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intel. 


Intel microprocessors and peripherals provide a broad 
range of time-saving, energy-efficient, high-perform- 
ance solutions to designers of both mobile and desktop 
microprocessor-based systems. Intel's microprocessor/ 
peripheral interface delivers time and performance 
ad- 
vantages to the designers of microprocessor-based sys- 
tems, meeting their demand for greater performance, 
lower power consumption and a wider variety of built- 
in features for their customers. 


Intel offers an entire product line of microprocessors 
for desktop and mobile systems, ranging from the 
25 MHz version of the Intel486TM SX processor to the 
high performance 66 MHz Intel486 DX2 processor. As 
Intel maintains its leadership in MIPS for microproces- 
sor-based systems, it has managed to couple this superi- 
or performance with its sophisticated energy-efficient 
SL technology to meet the requirements of the EPA's 
Energy Star guidelines. 


Intel has combined SL technology with desktop per- 
formance to enhance its entire Intel486 microprocessor 
family. Now, with built-in power management, SL En- 
hanced Intel486 CPU-based mobile systems achieve in- 
creases in battery life without compromising perform- 
ance, while maintaining full compatibility with existing 
Intel architecture 
processors. SL Enhanced 
Intel486 
CPU s for mobile and desktop systems are available in a 
full range of speeds (25 MHz to 66 MHz), packages 
(PGA, SQFP, PQFP), and voltages (5V, 3.3V) to meet 
any system design requirement. 


Intel's universal motherboard for Intel486 SX, DX, and 
DX2 microprocessor-based desktop systems and scala- 
ble architecture for mobile systems reduces the develop- 
ment effort required to produce an entire product line 
of high-performance systems, thereby reducing time to 
market. 


Intel offers a wide variety of off-the-shelf components 
to fulfill the requirements of system designers while 
simplifying the implementation 
of their designs. Off- 
the-shelf system solutions greatly decrease the potential 
risk of costly project delays due to component incom- 
patibility. These system solutions greatly reduce the 
amount of time required to design, debug, manufacture 
and test microprocessor-based systems. 


At Intel, the rate of failure for devices is carefully 
tracked. Highest reliability is a tangible goal that trans- 
lates to higher reliability for your product, 
reduced 
downtime, and reduced repair costs. And as more and 
more functions are integrated into fewer components, 
the resulting system requires less power, produces less 
heat, 
and requires fewer mechanical 
connections- 
again resulting in greater system reliability. 


Using proven, reliable off-the-shelf components will re- 
duce design costs, manufacturing 
costs, and time to 
market while increasing project viability and product 
reliability. 
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INTRODUCTION 
TO THE Intel486™ 
MICROPROCESSOR 
FAMILY 


In addition to the standard Intel486TMmicroprocessors 
(Inte1486 SX, Intel486 DX and Inte1486 DX2 CPUs), 
Intel is enhancing its entire Inte1486 microprocessor 
family with the energy-efficient SL technology. The 
new SL Enhanced Intel486 microprocessor family en- 
ables built-in power management while maintaining full 
compatibility with existing Intel architecture 
proces- 


sors. 


The SL Enhanced Intel486 microprocessor packages 
are identified with the following laser mark: "&E". If 
the Intel486 SX, DX or DX2 microprocessor is laser- 
marked with "&E" on the package, refer to the SL 
Enhanced Intel486 Microprocessor Data Sheet Adden- 
dum on p. 2-715 for SL Enhanced Intel486 microproc- 
essor specifications and power management features. 


• 


• 
IEEE 1149.1 Boundary 
Scan 
Compatibility 


• 
High Performance 
Design 
- 
50 MHz/66 
MHz Core Speed 
Using 
25 MHz/33 
MHz Bus Clocks 
- 
RISC Integer 
Core with Frequent 
Instructions 
Executing 
In One Core 
Clock 
- 
80, 106 Mbyte/sec 
Burst Bus 
- 
Dynamic 
Bus Sizing for 8-, 16-, and 
32-Blt Busses 
- 
Complete 
32-Blt Architecture 


• 
Multiprocessor 
Support 
- 
Cache Consistency 
Protocols 
- 
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A5 
Q12 
03 
H2 
BEO# 
K15 
TM5 
B14 
B10 
C4 
B3 
A6 
515 
04 
M3 
BEU 
J16 
B12 
C5 
B4 
A7 
Q13 
05 
J2 
BE2# 
J15 
813 
E2 
B5 
A8 
R13 
06 
L2 
BE3# 
F17 
C10 
E16 
E1 
A9 
Q11 
07 
L3 
BLA5T# 
R16 
C12 
G2 
E17 
Al0 
513 
08 
F2 
BOFF# 
017 
C13 
G16 
G1 
All 
R12 
09 
01 
BROY# 
H15 
G15 
H16 
G17 
A12 
57 
010 
E3 
BREQ 
Q15 
R17 
J1 
H1 
A13 
Q10 
011 
C1 
B58# 
016 
54 
K2 
H17 
A14 
55 
012 
G3 
B516# 
C17 
K16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
L16 
K17 
A16 
Q9 
014 
K3 
O/C# 
M15 
M2 
L1 
A17 
Q3 
015 
F3 
OPO 
N3 
M16 
L17 
A18 
R5 
016 
J3 . 
OP1 
F1 
P16 
M1 
A19 
Q4 
017 
03 
OP2 
H3 
R3 
M17 
A20 
Q8 
018 
C2 
OP3 
A5 
R6 
P17 
A21 
Q5 
019 
B1 
EA05# 
B17 
R8 
Q2 
A22 
Q7 
020 
A1 
FERR# 
C14 
R9 
R4 
A23 
83 
021 
B2 
FLU5H# 
C15 
R10 
56 
A24 
Q6 
022 
A2 
HLOA 
P15 
R11 
58 
A25 
R2 
023 
A4 
HOLO 
E15 
R14 
59 
A26 
52 
024 
A6 
IGNNE# 
A15 
510 
A27 
51 
025 
B6 
INTR 
A16 
511 
A28 
R1 
026 
C7 
KEN# 
F15 
512 
A29 
P2 
027 
C6 
LOCK # 
N15 
514 
A30 
P3 
028 
C8 
M/IO# 
N16 
A31 
Q1 
029 
A8 
NMI 
B15 
030 
C9 
'PCO 
J17 
031 
B8 
PCHK# 
Q17 
PWT 
L15 
PLOCK # 
Q16 
ROY# 
F16 
RE5ET 
C16 
UP#(l) 
C11 
W/R# 
N17 


NOTE: 
1. These pins were No-Connects on the 25 MHz and 33 MHz Intel486 OX microprocessors. For compatibility with old 
designs they can still be left unconnected. 


Symbol 
Type 
Name and Function 


CLK 
I 
Clock provides 
the fundamental 
timing for the bus interface 
unit and is multiplied 
by 


two (2x) to provide the internal frequency 
for the Intel486 
OX2. All external 
timing 
parameters 
are specified 
with respect to the rising edge of CLK. 


ADDRESS 
BUS 


A31-A4 
I/O 
A31-A2 
are the address 
lines of the microprocessor. 
A31-A2, 
together 
with the 
A2-A3 
0 
byte enables 
BEOII-BE311, 
define the physical area of memory or input/output 
space accessed. 
Address 
lines A31-A4 
are used to drive addresses 
into the 
microprocessor 
to perform 
cache line invalidations. 
Input signals must meet setup 
and hold times t22 and t23. A31-A2 
are not driven during bus or address 
hold. 


BEO-311 
0 
The byte enable signals indicate 
active bytes during read and write cycles. 
Ouring 
the first cycle of a cache fill, the external 
system should assume that all byte 


enables 
are active. BE311 applies to 024-031, 
BE211 applies to 016-023, 
BE1 II 
applies to 08-015 
and BEOII applies to 00-07. 
BEOII-BE311 
are active LOW and 


are not driven during bus hold. 


DATA BUS 


031-00 
I/O 
These are the data lines for the Intel486 
OX2 microprocessor. 
Lines 00-07 
define 
the least significant 
byte of the data bus while lines 024-031 
define the most 
significant 
byte of the data bus. These signals must meet setup and hold times t22 
and t23 for proper operation 
on reads. These pins are driven during the second 
and 
subsequent 
clocks of write cycles. 


DATA 
PARITY 


OPO-OP3 
I/O 
There is one data parity pin for each byte of the data bus. Oata parity is generated 
on all write data cycles with the same timing as the data driven by the Intel486 
OX2 
microprocessor. 
Even parity information 
must be driven back into the 
microprocessor 
on the data parity pins with the same timing as read information 
to 
insure that the correct 
parity check status is indicated 
by the Intel486 
OX2 
microprocessor. 
The signals read on these pins do not affect program 
execution. 


Input signals must meet setup and hold times t22 and t23' OPO-OP3 
should be 
connected 
to Vcc through 
a pullup resistor in systems which do not use parity. 


OPO-OP3 
are active HIGH and are driven during the second and subsequent 
clocks 


of write cycles. 


PCHKII 
0 
Parity Status is driven on the PCHK II pin the clock after ready for read operations. 
The parity status is for data sampled 
at the end of the previous clock. A parity error 
is indicated 
by PCHKII 
being LOW. Parity status is only checked 
for enabled 
bytes 
as indicated 
by the byte enable and bus size signals. PCHK II is valid only in the 
clock immediately 
after read data is returned 
to the microprocessor. 
At all other 
times PCHK II is inactive 
(HIGH). PCHK II is never floated. 
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Symbol 
Type 
Name and Function 


BUS CYCLE 
DEFINITION 


M/IO# 
0 
The memory/input-output, 
data/control 
and write/read 
lines are the primary bus 
O/C# 
0 
definition 
signals. These signals are driven valid as the AOS# 
signal is asserted. 


W/R# 
0 
M/IO# 
O/C# 
W/R# 
Bus Cycle 
Initiated 


0 
0 
0 
Interrupt 
Acknowledge 
0 
0 
1 
Halt/Special 
Cycle 
0 
1 
0 
I/O Read 
0 
1 
1 
I/O Write 
1 
0 
0 
Code Read 
1 
0 
1 
Reserved 
1 
1 
0 
Memory 
Read 
1 
1 
1 
Memory Write 


The bus definition 
signals are not driven during bus hold and follow the timing of the 
address 
bus. Refer to Section 
7.2.11 for a description 
of the special bus cycles. 


LOCK # 
0 
The bus lock pin indicates 
that the current bus cycle is locked. The Intel486 
OX2 
microprocessor 
will not allow a bus hold when LOCK # is asserted 
(but address 
holds are allowed). 
LOCK # goes active in the first clock of the first locked bus cycle 
and goes inactive after the last clock of the last locked bus cycle. The last locked 
cycle ends when ready is returned. 
LOCK # is active LOW and is not driven during 
bus hold. Locked read cycles will not be transformed 
into cache fill cycles if KEN # is 
returned 
active. 


PLOCK # 
0 
The pseudo-lock 
pin indicates 
that the current bus transaction 
requires more than 
one bus cycle to complete. 
Examples 
of such operations 
are floating 
point long 
reads and writes (64 bits), segment 
table descriptor 
reads (64 bits), in addition 
to 
cache line fills (128 bits). The Intel486 
OX2 microprocessor 
will drive PLOCK # active 
until the addresses 
for the last bus cycle of the transaction 
have been driven 
regardless 
of whether 
ROY # or BROY # have been returned. 
Normally 
PLOCK # and BLAST # are inverse of each other. However 
during the first 
bus cycle of a 64-bit floating 
point write, both PLOCK # and BLAST # will be 
asserted. 
PLOCK# 
is a function 
of the BS8#, 
BS16# 
and KEN# 
inputs. PLOCK# 
should be 
sampled 
only in the clock ready is returned. 
PLOCK # is active LOW and is not 
driven during bus hold. 


BUS CONTROL 


AOS# 
0 
The address status output indicates 
that a valid bus cycle definition 
and address 
are 
available 
on the cycle definition 
lines and address bus. AOS# 
is driven active in the 
same clock as the addresses 
are driven. AOS# 
is active LOW and is not driven 
during bus hold. 


ROY# 
I 
The non-burst 
ready input indicates 
that the current bus cycle is complete. 
ROY # 
indicates 
that the external system has presented 
valid data on the data pins in 
response 
to a read or that the external 
system has accepted 
data from the Intel486 
OX2 microprocessor 
in response 
to a write. ROY # is ignored when the bus is idle 
and at the end of the first clock of the bus cycle. 
ROY # is active during address 
hold. Data can be returned 
to the processor 
while 
AHOLO is active. 
ROY # is active LOW, and is not provided 
with an internal 
pullup resistor. 
ROY # 
must satisfy setup and hold times t16 and t17 for proper chip operation. 


intel~ 


Symbol 
Type 
Name and Function 


BURST CONTROL 


BRDY# 
I 
The burst ready input performs 
the same function 
during a burst cycle that ROY # 
performs 
during a non-burst 
cycle. BRDY # indicates 
that the external 
system has 
presented 
valid data in response 
to a read or that the external 
system has accepted 
data 
in response 
to a write. BRDY # is ignored when the bus is idle and at the end of the first 
clock in a bus cycle. 
BRDY # is sampled 
in the second and subsequent 
clocks of a burst cycle. The data 
presented 
on the data bus will be strobed 
into the microprocessor 
when BRDY # is 
sampled 
active. If ROY # is returned 
simultaneously 
with BRDY #, BRDY # is ignored 
and the burst cycle is prematurely 
interrupted 
BRDY # is active LOW and is provided 
with a small pullup resistor. 
BRDY # must satisfy 
the setup and hold times t16 and tn. 


BLAST # 
0 
The burst last signal indicates 
that the next time BRDY # is returned 
the burst bus cycle 
is complete. 
BLAST # is active for both burst and non~burst bus cycles. BLAST # is 
active LOW and is not driven during bus hold. 


INTERRUPTS 


RESET 
I 
The reset input forces the Intel486 
DX2 microprocessor 
to begin execution 
at a known 
state. The microprocessor 
cannot 
begin execution 
of instructions 
until at least 1 ms after 
Vcc and CLK have reached 
their proper DC and AC specifications. 
The RESET pin 
should remain active during this time to insure proper microprocessor 
operation. 
RESET 
is active HIGH. RESET is asynchronous 
but must meet setup and hold times t20 and t21 
for recognition 
in any specific 
clock. 


INTR 
I 
The maskable 
interrupt 
indicates 
that an external 
interrupt 
has been generated. 
If the 
internal interrupt 
flag is set in EFLAGS, active interrupt 
processing 
will be initiated. 
The 
Intel486 
DX2 microprocessor 
will generate 
two locked interrupt 
acknowledge 
bus cycles 
in response 
to the INTR pin going active. INTR must remain active until the interrupt 
acknowledges 
have been performed 
to assure that the interrupt 
is rec.ognized. 


INTR is active HIGH and is not provided with an internal pulldown 
resistor. 
INTR is 
asynchronous, 
but must meet setup and hold times t20 and t21 for recognition 
in any 
specific 
clock. 


NMI 
I 
The non-maskable 
interrupt 
request signal indicates 
that an external 
non-maskable 
interrupt 
has been generated. 
NMI is rising edge sensitive. 
NMI must be held LOW for at 
least four CLK periods before this rising edge. NMI is not provided with an internal 
pulldown 
resistor. 
NMI is asynchronous, 
but must meet setup and hold times t20 and t21 
for recognition 
in any specific 
clock. 


BUS ARBITRATION 


BREQ 
0 
The internal 
cycle pending 
signal indicates 
that the Intel486 
DX2 microprocessor 
has 
internally 
generated 
a bus request. 
BREQ is generated 
whether 
or not the Intel486 
DX2 
microprocessor 
is driving the bus. BREQ is active HIGH and is never floated. 


HOLD 
I 
The bus hold request 
allows another 
bus master complete 
control 
of the Intel486 
DX2 
microprocessor 
bus. In response 
to HOLD going active the Intel486 
DX2 microprocessor 
will float most of its output and input/output 
pins. HLDA will be asserted 
after completing 
the current bus cycle, burst cycle or sequence 
of locked cycles. The Intel486 
DX2 
microprocessor 
will remain in this state until HOLD is deasserted. 
HOLD is active high 
and is not provided 
with an internal pulldown 
resistor. 
HOLD must satisfy setup and hold 
times t18 and t19 for proper operation. 


HLDA 
0 
Hold acknowledge 
goes active in response 
to a hold request presented 
on the HOLD 
pin. HLDA indicates 
that the Intel486 
DX2 microprocessor 
has given the bus to another 
local bus master. HLDA is driven active in the same clock that the Intel486 
DX2 
microprocessor 
floats its bus. HLDA is driven inactive when leaving bus hold. HLDA is 
active HIGH and remains driven during bus hold. 


•• 


intel~ 


Symbol 
Type 
Name and Function 


BUS ARBITRATION 
(Continued) 


BOFFIF 
I 
The backoff 
input forces the Intel486 
DX2 microprocessor 
to float its bus in the next 
clock. The microprocessor 
will float all pins normally floated 
during bus hold but HLDA 
will not be asserted 
in response 
to BOFF IF. BOFF IF has higher priority than ROY IF or 
BRDYIF; if both are returned 
in the same clock, BOFFIF takes effect. The 
microprocessor 
remains in bus hold until BOFF IF is negated. 
If a bus cycle was in 
progress 
when BOFF IF was asserted 
the cycle will be restarted. 
BOFF IF is active LOW 
and must meet setup and hold times t18 and t19 for proper operation. 


CACHE INVALIDATION 


AHOLD 
I 
The address hold request allows another 
bus master access to the Intel486 
DX2 
microprocessor's 
address 
bus for a cache invalidation 
cycle. The Intel486 
DX2 
microprocessor 
will stop driving its address bus in the clock following 
AHOLD 
going 
active. Only th~ address bus will be floated 
during address 
hold, the remainder 
of the 
bus will remain active. AHOLD is active HIGH and is provided 
with a small internal 
pulldown 
resistor. 
For proper operation 
AHOLD 
must meet setup and hold times t18 and 


t19· 


EADSIF 
I 
This signal indicates 
that a valid external address has been driven onto the Intel486 
DX2 
microprocessor 
address pins. This address will be used to perform 
an internal cache 
invalidation 
cycle. EADSIF is active LOW and is provided with an internal pullup resistor. 


EADSIF must satisfy setup and hold times t12 and t13 for proper operation. 


CACHE CONTROL 


KEN IF 
I 
TI'\I:lcache enable pin is used to determine 
whether 
the current cycle is cacheable. 
When the Intel486 
DX2 microprocessor 
generates 
a cycle that can be cached 
and 
KEN IF is active, the cycle will become 
a cache line fill cycle. Returning 
KEN IF active one 
clock before ready during the last read in the cache line fill will cause the line to be 
placed in the on-chip cache. KEN IF is active LOW and is provided with a small internal 
pullup resistor. 
KEN IF must satisfy setup and hold times t14 and t15 for proper operation. 


FLUSH IF 
I 
The cache flush input forces the Intel486 DX2 microprocessor 
to flush its entire internal 
cache. FLUSH IF is active low and need only be asserted 
for one clock. FLUSH IF is 
asynchronous 
but setup and hold times t20 and t21 must be met for recognition 
in any 
specific 
clock. FLUSH IF being sampled 
low in the clock before the falling edge of 
RESET causes the Intel486 
DX2 microprocessor 
to enter the tri-state 
test mode. 


PAGE CACHEABILITY 


PWT 
0 
The page write-through 
and page cache disable pins reflect the state of the page 
PCD 
0 
attribute 
bits, PWT and PCD, in the page table entry or page directory 
entry. If paging is 
disabled 
or for cycles that are not paged, PWT and PCD reflect the state of the PWT and 
PCD bits in control 
register 3. PWT and PCD have the same timing as the cycle definition 
pins (MilO IF, D/CIF and W/RIF). 
PWTand 
PCD are active HIGH and are not driven 
during bus hold. PCD is masked by the cache disable bit (CD) in Control 
Register O. 


NUMERIC 
ERROR REPORTING 


FERRIF 
0 
The floating point error pin is driven active when a floating 
point error occurs. 
FERR IF is 
similar to the ERROR IF pin on the Intel387™ 
math coprocessor. 
FERR IF is included 
for 
compatibility 
with systems 
using DOS type floating 
point error reporting. 
FERR IF will not 
go active if FP errors are masked 
in FPU register. 
FERRIF is active LOW, and i~ not 
floated 
during bus hold. 


Symbol 
Type 
Name and Function 


NUMERIC 
ERROR REPORTING 
(Continued) 


IGNNE# 
I 
When the ignore numeric error pin is asserted 
the Intel486 
OX2 microprocessor 
will 
ignore a numeric error and continue 
executing 
non-control 
floating 
point instructions, 
but 


FERR # will still be activated 
by the Intel486 
OX2. When IGNNE # is deasserted 
the 
Intel486 
OX2 microprocessor 
will freeze on a non-control 
floating 
point instruction, 
if a 
previous floating 
point instruction 
caused an error. IGNNE # has no effect when the NE 
bit in control 
register 0 is set. IGNNE# 
is active LOW and is provided 
with a small 
internal 
pullup resistor. 
IGNNE# 
is asynchronous 
but setup and hold times t20 and t21 
must be met to insure recognition 
on any specific clock. 


BUS SIZE CONTROL 


8S16# 
I 
The bus size 16 and bus size 8 pins (bus sizing pins) cause the Intel486 
OX2 
8S8# 
I 
microprocessor 
to run multiple bus cycles to complete 
a request from devices that 
cannot 
provide or accept 32 bits of data in a single cycle. The bus sizing pins are 
sampled 
every clock. The state of these pins in the clock before ready is used by the 
Intel486 OX2 microprocessor 
to determine 
the bus size. These signals are active LOW 
and are provided with internal 
pullup resistors. 
These inputs must satisfy setup and hold 
times t14 and t15 for proper operation. 


ADDRESS 
MASK 


A20M# 
I 
When the address bit 20 mask pin is asserted, 
the Intel486 
OX2 microprocessor 
masks 
physical 
address bit 20 (A20) before performing 
a lookup to the internal cache or driving 
a memory cycle on the bus. A20M # emulates 
the address wraparound 
at one Mbyte 
which occurs on the 8086. A20M # is active LOW and should be asserted 
only when the 
processor 
is in real mode. This pin is asynchronous 
but should meet setup and hold 
times t20 and t21 for recognition 
in any specific 
clock. For proper operation, 
A20M # 
should be sampled 
high at the falling edge of RESET. 


TEST ACCESS 
PORT 


TCK 
I 
Test Clock is an input to the Intel486 
OX2 CPU and provides 
the clocking 
function 
required by the JT AG boundary 
scan feature. TCK is used to clock state information 
and 


data into and out of the component. 
State select information 
and data are clocked 
into 
the component 
on the rising edge of TCK on TMS and TOI, respectively. 
Data is clocked 
out of the part on the falling edge of TCK on TOO. 


TOI 
I 
Test Data Input is the serial input used to shift JT AG instructions 
and data into the 
component. 
TOI is sampled 
on the rising edge of TCK, during the SHIFT-IR 
and the 
SHIFT-DR 
TAP controller 
states. During all other tap controller 
states, TOI is a "don't 


care". 


TOO 
0 
Test Data Output 
is the serial output used to shift JT AG instructions 
and data out of the 


component. 
TOO is driven on the falling edge of TCK during the SHIFT-IR 
and 
SHIFT-DR 
TAP controller 
states. At all other times TOO is driven to the high impedance 


state. 


TMS 
I 
Test Mode Select 
is decoded 
by the JTAG TAP (Tap Access 
Port) to select the 
operation 
of the test logic. TMS is sampled 
on the rising edge of TCK. To guarantee 
deterministic 
behavior of the TAP controller 
TMS is provided with an internal 
pull-up 
resistor. 


•• 


intel~ 


Symbol 
Type 
Name and Function 


POWER DOWN MODE (UPGRADE 
PROCESSOR 
SUPPORn 


UP# 
I 
The Upgrade Present 
pin forces the Intel486 
DX2 to 3-state all of its outputs and enter 
the power down mode. When the Upgrade Present pin is sampled 
asserted 
by the CPU in 
the clock before the falling edge of RESET, the power down mode is enabled. 
UP# 
has 
no effect on the power down status except during this edge. The CPU is also forced to 3- 
state all of it's outputs 
immediately 
in response 
to this signal. The UP# 
signal must 
remain asserted 
in order to keep the pins 3-stated. 
UP # is active low and is provided 
with 
an internal 
pull-up resistor. 


Name 
Active 
When 
Level 
Floated 


BREQ 
HIGH 
HLDA 
HIGH 
BEO#-BE3# 
LOW 
Bus Hold 
PWT, PCD 
HIGH 
Bus Hold 
W/R#, 
D/C#, 
M/IO# 
HIGH 
Bus Hold 
LOCK# 
LOW 
Bus Hold 
PLOCK # 
LOW 
Bus Hold 
ADS# 
LOW 
Bus Hold 
BLAST # 
LOW 
Bus Hold 
PCHK# 
LOW 
FERR# 
LOW 
RES_B 
LOW 
A2-A3 
HIGH 
Bus, Address 
Hold 


Name 
Active 
When 
Level 
Floated 


DO-D31 
HIGH 
Bus Hold 
DPO-DP3 
HIGH 
Bus Hold 
A4-A31 
HIGH 
Bus, Address 
Hold 


Name 
Active 
Synchronousl 
Level 
Asynchronous 


CLK 
RESET 
HIGH 
Asynchronous 
HOLD 
HIGH 
Synchronous 
AHOLD 
HIGH 
Synchronous 
EADS# 
LOW 
Synchronous 
BOFF# 
LOW 
Synchronous 
FLUSH# 
LOW 
Asynchronous 
A20M# 
LOW 
Asynchronous 
BS16#, 
BS8# 
LOW 
Synchronous 
KEN# 
LOW 
Synchronous 
RDY# 
LOW 
Synchronous 
BRDY# 
LOW 
Synchronous 
INTR 
HIGH 
Asynchronous 
NMI 
HIGH 
Asynchronous 
IGNNE# 
LOW 
Asynchronous 
RESJ 
LOW 
Asynchronous 
UP# 
LOW 
Asynchronous 


Name 
Input or 
sampledl 
Output 
Driven On 


TCK 
Input 
N/A 


TOI 
Input 
Rising Edge of TCK 


TDO 
Output 
Falling Edge of TCK 


TMS 
Input 
Rising Edge of TCK 


The Intel486 OX2 microprocessor is a fully compati- 
ble member of the Intel486 family. 


The Intel486 microprocessor family is a 32·bit archi- 
tecture with on-chip memory management, floating 
point and cache memory units. 


The Intel486 OX microprocessor contains all the 
features of the Intel386™ microprocessor with en- 
hancements to increase performance. The instruc- 
tion set includes the complete Intel386 microproces- 
sor instruction set along with extensions to serve 
new applications. The on-chip memory management 
unit (MMU) is completely compatible with the In- 
tel386 microprocessor MMU. The Intel486 OX mi- 
croprocessor brings the Intel387™ math coproces- 
sor on-chip. All software written for the Intel386 mi- 
croprocessor, Intel387 math coprocessor and previ- 
ous members of the 86/87 architectural family will 
run on the Intel486 OX microprocessor without any 
modifications. 


Several enhancements have been added to the In- 
tel486 OX microprocessor to increase performance. 
On-chip cache memory allows frequently used data 
and code to be stored on-chip reducing accesses to 
the external bus. A clock doubler has been added to 
speed up internal operations to twice that of an In- 
tel486 OX microprocessor running with the same 
bus clock. RiSe design techniques have been used 
to reduce instruction cycle times. A burst bus feature 
enables fast cache fills. All of these features, com- 
bined, lead to performance greater than triple that of 
a Intel386 microprocessor. 


The memory management unit (MMU) consists of a 
segmentation unit and a paging unit. Segmentation 
allows management of the logical address space by 
providing easy data and code relocatibility and effi- 
cient sharing of global resources. The paging mech- 
anism operates beneath segmentation and is trans- 
parent to the segmentation process. Paging is op- 
tional and can be disabled by system software. Each 
segment can be divided into one or more 4 Kbyte 
segments. To implement a virtual memory system, 
the Intel486 OX microprocessor supports full restart- 
ability for all page and segment faults. 


Memory is organized into one or more variable 
length segments, each up to four gigabytes (232 
bytes) in size. A segment can have attributes associ- 
ated with it which include its location, size, type (I.e., 
stack, code or data), and protection characteristics. 
Each task on a Intel486 OX microprocessor can 
have a maximum of 16,381 segments, each up to 
four gigabytes in size. Thus each task has a maxi- 
mum of 64 terabytes (trillion bytes) of virtual memo- 
ry. 


The segmentation unit provides four-levels of pro- 
tection for isolating and protecting applications and 
the operating system from each other. The hardware 
enforced protection allows the design of systems 
with a high degree of integrity. 


The Intel486 OX microprocessor has two modes of 
operation: Real Address Mode (Real Mode) and Pro- 
tected 
Mode 
Virtual 
Address 
Mode 
(Protected 
Mode). In Real Mode the Intel486 OX microproces- 
sor operates as a very fast 8086. Real Mode is re- 
quired primarily to set up the processor for Protected 
Mode operation. Protected Mode provides access to 
the sophisticated memory management paging and 
privilege capabilities of the processor. 


Within Protected Mode, software can perform a task 
switch to enter into tasks designated as Virtual 8086 
Mode tasks. Each virtual 8086 task behaves with 
8086 semantics, allowing 8086 software (an applica- 
tion program or an entire operating system) to exe· 
cute. 


The on-chip floating point unit operates in parallel 
with the arithmetic and logic unit and provides arith- 
metic instructions for a variety of numeric data types. 
It executes numerous built-in transcendental func- 
tions (e.g., tangent, sine, cosine, and log functions). 
The floating point unit fully conforms to the ANSI/ 
IEEE standard 754-1985 for floating point arithmetic. 


The on-chip cache is 8 Kbytes in size. It is 4-way set 
associative and follows a write-through policy. The 
on-chip cache includes features to provide flexibility 
in external memory system design. Individual pages 
can be designated as cacheable or non-cacheable 
by software or hardware. The cache can also be en- 
abled and disabled by software or hardware. 


Finally the Intel486 OX2 microprocessor has fea- 
tures to facilitate high performance hardware de- 
signs. The 1x bus clock eases high frequency board 
level designs. While the 2x clock doubler improves 
execution performance without increasing the board 
design complexity. This 2x clock doubler enhances 
all operations operating out of the cache and/or not 
blocked by external bus accesses. The burst bus 
feature enables fast cache fills. These features are 
described beginning in Section 6. 


The Intel486 OX microprocessor register set in- 
cludes all the registers contained in the Intel386 mi- 
croprocessor and the Intel387 math coprocessor. 
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The register set can be split into the following cate- 
gories: 


Base Architecture Registers 
General Purpose Registers 
Instruction Pointer 
Flags Register 
Segment Registers 


Systems Level Registers 
Control Registers 
System Address Registers 


Floating Point Registers 
Data Registers 
Tag Word 
Status Word 
Instruction and Data Pointers 
Control Word 


General 
Purpose 
Registers 


31 
24123 
16 
15 
617 
0 


AH 
Ax 
AL 
EAX 


BH 
~X 
BL 
EBX 


CH 
~X 
CL 
ECX 


OH 
DX 
OL 
EOX 


Sl 
ESI 


01 
EOI 


BP 
EBP 


SP 
ESP 


Segment 
Registers 
15 
0 


CS 
Code Segment 


55 
Stack 
Segment 
00) 
ES 
Data Segments 
FS 


GS 


Instruction 
Pointer 
31 
16 
15 
0 
I 
I 
IP 
I EIP 


Flags Register 
I 
I 
FLAGS 
I EFLAGS 
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The base architecture and floating point registers 
are accessible by the applications program. The sys- 
tem level registers are only accessible at privilege 
level 0 and are used by the systems level program. 
The debug and test registers are also only accessi- 
ble at privilege level O. 


Figure 2.1 shows the Intel486 OX2 microprocessor 
base architecture registers. The contents of these 
registers are task-specific and are automatically 
loaded with a new context upon a task switch opera- 
tion. 


The base architecture includes six directly accessi- 
ble descriptors, each specifying a segment up to 
4 Gbytes in size. The descriptors are indicated by 
the selector values placed in the Intel486 OX2 mi- 
croprocessor segment registers. Various selector 
values can be loaded as a program executes. 


The selectors are also task-specific, so the segment 
registers are automatically loaded with new context 
upon a task switch operation. 


2.1.1.1 
General 
Purpose 
Registers 


The eight 32-bit general 
purpose registers are 
shown in Figure 2.1. These registers hold data or 
address quantities. The general purpose registers 
can support data operands of 1, 8, 16 and 32 bits, 
and bit fields of 1 to 32 bits. Address operands of 16 
and 32 bits are supported. The 32-bit registers are 
named EAX, EBX, ECX, EOX, ESI, EOI, EBP and 
ESP. 


The least significant 16 bits of the general purpose 
registers can be accessed separately by using the 
16-bit names of the registers AX, BX, CX, OX, SI, 01, 
BP and SP. The upper 16 bits of the register are not 
changed when the lower 16 bits are accessed sepa- 
rately. 


Finally 8-bit operations can individually access the 
lowest byte (bits 0-7) and the higher byte (bits 8- 
15) of the general purpose registers AX, BX, CX and 
OX.The lowest bytes are named AL, BL, CL and OL 
respectively. The higher bytes are named AH, BH, 
CH and OH respectively. The individual byte acces- 
sibility offers additional flexibility for data operations 
but is not used for effective address calculation. 


The instruction pointer, shown in Figure 2.1, is a 32- 
bit register named EIP. EIP holds the offset of the 
next instruction to be executed. The offset is always 
relative to the base of the code segment (CS). The 
lower 16 bits (bits 0-15) of the EIP contain the 16-bit 


CARRY 
FLAG 


PARITY 
FLAG 


AUXILIARY 
CARRY 


ZERO 
FLAG 


SIGN 
FLAG 


TRAP 
FLAG 


ALIGN WENT 
CHECK 


VIRTUAL 
WODE 


RESUWE 
FLAG 


NESTED 
TASK 
FLAG 


I/o 
PRIVILEGE 
LEVEL 


OVERFLOW 


DIRECTION 
FLAG 


INTERRUPT 
ENABLE 


NOTE: 
o indicates 
Intel Reserved: 
do not define; 
see Section 
2.1.6. 


instruction pointer named IP,which is used for 16-bit 
addressing. 
register and it enables faults on accesses to misa- 
ligned data. 
AC 
(Alignment Check, bit 18) 
The AC bit enables the generation of faults if a 
memory reference is to a misaligned address. 
Alignment faults are enabled when AC is set 
to 1. A mis-aligned address is a word access 
to an odd address, a dword access to an ad- 
dress that is not on a dword boundary, or an 
8-byte reference to an address that is not on a 
64-bit word boundary. See Section 7.1.6 for 
more information on operand alignment. 
Alignment faults are only generated by pro- 
grams running at privilege level 3. The AC bit 
setting is ignored at privilege levels 0, 1 and 2. 
Note that references to the descriptor tables 
(for selector loads), or the task state segment 
(TSS), are implicitly level 0 references even if 
the instructions causing the references are 
executed at level 3. Alignment faults are re- 
ported through interrupt 17, with an error code 
of o. Table 2.1 gives the alignment required 
for 
the 
Intel486 
OX microprocessor 
data 
types. 


Table 2.1. Data Type Alignment 
Requirements 


The 
flags 
register 
is 
a 
32-bit 
register 
named 
EFLAGS. The defined bits and bit fields within 
EFLAGS control certain operations and indicate 
status of the Intel486 DX2 microprocessor. The low- 
er 16 bits (bits 0-15) of EFLAGS contain the 16-bit 
register named FLAGS, which is most useful when 
executing 8086 and 80286 code. EFLAGS is shown 
in Figure 2.2. 


EFLAGS bits 1, 3, 5, 15 and 19-31 are "undefined". 
When these bits are stored during interrupt process- 
ing or with a PUSHF instruction (push flags onto 
stack), a one is stored in bit 1 and zeros in bits 3, 5, 
15 and 19-31. 


The EFLAGS register in the Intel486 OX microproc- 
essor contains a new bit not previously defined. The 
new bit, AC, is defined in the upper 16 bits of the 


Memory 
Access 
Alignment 
(Byte Boundary) 


Word 
2 
Dword 
4 
Single Precision Real 
4 
Double Precision Real 
8 
Extended Precision Real 
8 
Selector 
2 
48-Bit Segmented Pointer 
4 
32-Bit Flat Pointer 
4 
32-Bit Segmented Pointer 
2 
48-Bit "Pseudo-Descriptor" 
4 
FSTENVIFLDENV Save Area 
4/2 (On Operand Size) 
FSAVE/FRSTOR Save Area 
4/2 (On Operand Size) 
Bit String 
4 


• 


IMPLEMENTATiON 
NOTE: 
Several instructions on the Intel486 DX microproc- 
essor generate misaligned references, even if their 
memory address is aligned. For example, on the In- 
tel486 DX microprocessor, the SGDT/SIDT (store 
global/interrupt descriptor table) instruction reads/ 
writes two bytes, and then reads/writes four bytes 
from a "pseudo-descriptor" 
at the given address. 


The Intel486 DX microprocessor will generate misa- 
ligned references unless the address is on a 2 mod 
4 boundary. The FSAVE and FRSTOR instructions 
(floating point save and restore state) will generate 
misaligned references for one-half of the register 
save/restore cycles. The Intel486 DX microproces- 
sor will not cause any AC faults if the effective ad- 
dress given in the instruction has the proper align- 
ment. 


VM 
(Virtual 8086 Mode, bit 17) 
The VM bit provides Virtual 8086 Mode within 
Protected Mode. If set while the Intel486 DX 
microprocessor is in Protected Mode, the In- 
tel486 DX microprocessor will switch to Virtual 
8086 operation, handling segment loads as 
the 8086 does, but generating exception 13 
faults on privileged opcodes. The VM bit can 
be set only in Protected Mode, by the IRET 
instruction (if current privilege level = 0) and 
by task switches at any privilege level. The 
VM bit is unaffected by POPF. PUSHF always 
pushes a 0 in this bit, even if executing in Vir- 
tual 8086 Mode. The EFLAGS image pushed 
during interrupt processing or saved during 
task switches will contain a 1 in this bit if the 
interrupted code was executing as a Virtual 
8086 Task. 
RF 
(Resume Flag, bit 16) 
The RF flag is used in conjunction with the 
debug register breakpoints. It is checked at 
instruction boundaries before breakpoint pro- 
cessing. When RF is set, it causes any debug 
fault to be ignored on the next instruction. RF 
is then automatically reset at the successful 
completion of every instruction (no faults are 
signalled) except the IRET instruction, the 
POPF instruction, (and JMP, CALL, and INT 
instructiqns causing a task switch). These in- 
structions set RF to the value specified by the 
memory image. For example, at the end of the 
breakpoint service routine, the IRET instruc- 
tion can pop an EFLAG image having the RF 
bit set and resume the program's execution at 
the breakpoint address without generating an- 
other breakpoint fault on the same location. 
NT 
(Nested Task, bit 14) 
This flag applies to Protected Mode. NT is set 
to indicate that the execution of this task is 
nested within another task. If set, it indicates 


that the current nested task's Task State Seg- 
ment (TSS) has a valid back link to the previ- 
ous task's TSS. This bit is set or reset by con- 
trol transfers to other tasks. The value of NT 
in EFLAGS is tested by the IRET instruction to 
determine whether to do an inter-task return 
or an intra-task return. A POPF or an IRET 
instruction will affect the setting of this bit ac- 
cording to the image popped, at any privilege 
level. 


10PL (Input/Output Privilege Level, bits 12-13) 
This two-bit field applies to Protected Mode. 
10PL indicates the numerically maximum CPL 
(current privilege level) value permitted to ex- 
ecute I/O instructions without generating an 
exception 13 fault or consulting the I/O Per- 
mission Bitmap. It also indicates the maximum 
CPL value allowing alteration of the IF (INTR 
Enable Flag) bit when new values are popped 
into the EFLAG register. POPF and IRET in- 
struction can alter the 10PLfield when execut- 
ed at CPL = O.Task switches can always al- 
ter the 10PL field, when the new flag image is 
loaded from the incoming task's TSS. 
OF 
(Overflow Flag, bit 11) 
OF is set if the operation resulted in a signed 
overflow. Signed overflow occurs when the 
operation resulted in carry/borrow 
Into the 
sign bit (high-order bit) of the result but did not 
result in a carry/borrow out of the high-order 
bit, or vice-versa. For 8-, 16-, 32-bit opera- 
tions, OF is set according to overflow at bit 7, 
15, 31, respectively. 
DF 
(Direction Flag, bit 10) 
DF defines whether ESI and/or EDI registers 
postdecrement or postincrement during the 
string instructions. Postincrement occurs if DF 
is reset. Postdecrement occurs if DF is set. 


IF 
(INTR Enable Flag, bit 9) 
The IF flag, when set, allows recognition of 
external interrupts signalled on the INTR pin. 
When IF is reset, external interrupts signalled 
on the INTR are not recognized. 10PL indi- 
cates the maximum CPL value allowing altera- 
tion of the IF bi~when new values are popped 
into EFLAGS or FLAGS. 
TF 
(Trap Enable Flag, bit 8) 
TF controls the generation of exception 1 trap 
when single-stepping through code. When TF 
is set, the Intel486 DX microprocessor gener- 
ates an exception 1 trap after the next instruc- 
tion is executed. When TF is reset, exception 
1 traps occur only as a function of the break- 
point addresses loaded into debug registers 
DRO-DR3. 


SF 
(Sign Flag, bit 7) 


SF is set if the high-order 
bit of the result 
is 
set, 
it is reset 
otherwise. 
For 8-, 16-, 32-bit 
operations, 
SF reflects 
the state of bit 7, 15, 
31 respectively. 


ZF 
(Zero Flag, bit 6) 


ZF is set if all bits of the result are O. Other- 
wise it is reset. 


AF 
(Auxiliary 
Carry Flag, bit 4) 


The Auxiliary 
Flag is used to simplify the addi- 
tion 
and 
subtraction 
of packed 
BCD 
quanti- 
ties. 
AF is set if the operation 
resulted 
in a 
carry out of bit 3 (addition) 
or a borrow into bit 
3 (subtraction). 
Otherwise 
AF is reset. 
AF is 
affected 
by carry out of, or borrow 
into bit 3 
only, regardless 
of overall 
operand 
length: 
8, 
16 or 32 bits. 


PF 
(Parity Flags, bit 2) 


PF is set if the low-order 
eight bits of the oper- 
ation contains 
an even number of "1's" 
(even 
parity). 
PF is reset 
if the low-order 
eight 
bits 
have odd parity. 
PF is a function 
of only the 
low-order 
eight 
bits, 
regardless 
of 
operand 
size. 


CF 
(Carry Flag, bit 0) 


CF is set if the operation 
resulted 
in a carry 
out of (addition), 
or a borrow into (subtraction) 
the high-order 
bit. Otherwise 
CF is reset. 
For 
8-, 16- or 32-bit operations, 
CF is set accord- 
ing to carry/borrow 
at bit 7, 15 or 31, respec- 
tively. 


SEGMENT 
REGISTERS 


NOTE: 
In these 
descriptions, 
"set" 
means 
"set 
to 1," and 
"reset" 
means 
"reset 
to 0." 


2.1.1.4 
Segment 
Registers 


Six 16-bit segment 
registers 
hold segment 
selector 
values 
identifying 
the currently 
addressable 
memory 
segments. 
In protected 
mode, 
each 
segment 
may 
range 
in size from 
one byte up to the entire 
linear 
and 
physical 
address 
space 
of 
the 
machine, 
4 
Gbytes 
(232 bytes). 
In real address 
mode, the maxi- 
mum segment 
size is fixed at 64 Kbytes (216 bytes). 


The 
six addressable 
segments 
are defined 
by the 
segment 
registers 
CS, SS, DS, ES, FS and GS. The 
selector 
in CS indicates 
the current 
code 
segment; 
• 
the selector 
in SS indicates 
the current 
stack 
seg- 
ment; the selectors 
in DS, ES, FS and GS indicate 
the current 
data segments. 


2.1.1.5 
Segment 
Descriptor 
Cache 
Registers 


The segment 
descriptor 
cache registers 
are not pro- 
grammer 
visible, 
yet it is very useful 
to understand 
their 
content. 
A 
programmer 
invisible 
descriptor 
cache register 
is associated 
with each programmer- 


visible 
segment 
register, 
as shown 
by Figure 
2.3. 


Each descriptor 
cache 
register 
holds a 32-bit 
base 
address, a 32-bit segment 
limit, and the other neces- 
sary segment 
attributes. 


Other 
\ 
Segment 
Attributes 
from Descriptor 


Selector 


Selector 


Selector 


Selector 


Selector 


Selector 


o 
Physical Base Address 
Segment 
Limit 


CS- 


SS- 


DS- 


ES- 
FS- 


GS- 


- 
- 
- 
- - - 
- - - 
- - - 
- - - 


Figure 
2.3. Intei486™ 
OX Microprocessor 
Segment 
Registers 
and Associated 
Descriptor 
Cache 
Registers 


When a selector 
value is loaded 
into a segment 
reg- 
ister, the associated 
descriptor 
cache register 
is au- 
tomatically 
updated 
with the correct 
information. 
In 
Real Address 
Mode, only the base address 
is updat- 
ed directly 
(by shifting 
the selector 
value four bits to 
the left), since the segment 
maximum 
limit and attri- 
butes 
are fixed 
in Real 
Mode. 
In Protected 
Mode, 
the base address, 
the limit, and the attributes 
are all 
updated 
per the contents 
of the segment 
descriptor 


I indexed 
by the selector. 


Whenever 
a memory 
reference 
occurs, the segment 
descriptor 
cache 
register 
associated 
with 
the seg- 
ment 
being 
used is automatically 
involved 
with the 
memory 
reference. 
The 
32-bit 
segment 
base 
ad- 
dress 
becomes 
a component 
of the linear address 
calculation, 
the 32-M 
limit is used for the limit-check 
operation, 
and 
the 
attributes 
are checked 
against 
the type of memory 
reference 
requested. 


The system level registers, 
Figure 2.4, control 
opera- 
tion of the on-chip 
cache, 
the on-chip 
floating 
point 
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unit (FPU) and the segmentation 
and paging mecha- 
nisms. 
These 
registers 
are only accessible 
to pro- 
grams 
running 
at privilege 
level D, the highest 
privi- 
lege level. 


The system 
level registers 
include three control 
reg- 
isters 
and 
four 
segmentation 
base 
registers. 
The 
three control 
registers 
are CRD, CR2 and CR3. CR1 
is reserved 
for future 
Intel processors. 
The four seg- 
mentation 
base registers 
are the Global 
Descriptor 
Table Register 
(GDTR), the Interrupt 
Descriptor 
Ta- 
ble Register 
(IDTR), the Local Descriptor 
Table Reg- 
ister (LDTR) 
and the Task State 
Segment 
Register 
(TR). 


2.1.2.1 Control Registers 


Control Register 0 (CRO) 


CRD, shown 
in Figure 2.5, contains 
1D bits for con- 
trol and status 
purposes. 
Five of the bits defined 
in 
the Intel486 
DX microprocessor's 
CRD are newly de- 
fined. 
The new bits are CD, NW, AM, WP and NE. 


The function 
of the bits in CRD can be categorized 
as follows: 


31 
24123 
16115 
817 
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SYSTEM 
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NOTE: 
indicates 
Intel reserved: 
Do not define: 
See Section 
2.1.6 


Intel486 
OX Microprocessor 
Operating 
Modes: PG, 
PE (Table 2.2) 


On-Chip 
Cache 
Control 
Modes: 
CD, NW (Table 2.3) 


On-Floating 
Point Unit Control: TS, EM, MP, NE 


(Table 2.4) 


Alignment 
Check 
Control: 
AM 


Supervisor 
Write Protect: 
WP 


PG 
PE 
Mode 


0 
0 
REAL Mode. Exact 8086 semantics, 
with 32-bit extensions 
available 
with 
prefixes. 


0 
1 
Protected 
Mode. Exact 80286 
semantics, 
plus 32-bit extensions 
through 
both prefixes and "default" 
prefix setting associated 
with code 
segment 
descriptors. 
Also, a sub- 
mode is defined to support a virtual 
8086 within the context 
of the 
extended 
80286 protection 
model. 


1 
0 
UNDEFINED. 
Loading CRO with this 
combination 
of PG and PE bits will 
raise a GP fault with error code O. 


1 
1 
Paged Protected 
Mode. All the 


facilities 
of Protected 
mode, with 
paging enabled 
underneath 
segmentation. 


CD 
NW 
Operating 
Mode 


1 
1 
Cache fills disabled, 
write-through 
and 


invalidates 
disabled. 


1 
0 
Cache fills disabled, 
write-through 
and 
invalidates 
enabled. 


0 
1 
INVALID. 
If CRO is loaded with this 
configuration 
of bits, a GP fault with 


error code is raised. 


0 
0 
Cache fills enabled, 
write-through 
and 


invalidates 
enabled. 


CROBIT 
Instruction 
Type 


EM 
TS 
MP 
Floating-Point 
Walt 


0 
0 
0 
Execute 
Execute 
0 
0 
1 
Execute 
Execute 
0 
1 
0 
Trap 7 
Execute 


0 
1 
1 
Trap 7 
Trap 7 


1 
0 
0 
Trap 7 
Execute 


1 
0 
1 
Trap 7 
Execute 


1 
1 
0 
Trap 7 
Execute 


1 
1 
1 
Trap 7 
Trap 7 


The low-order 
16 bits of CRO are also known as the 


Machine 
Status 
Word 
(MSW), for compatibility 
with 


the 80286 protected 
mode. LMSW and SMSW (load 


and 
store 
MSW) 
instructions 
are taken 
as special 


aliases of the load and store CRO operations, 
where 


only the low-order 
16 bits of CRO are involved. 
The 


LMSW 
and SMSW 
instructions 
in the 
Intel486 
OX 


microprocessor 
work 
in an identical 
fashion 
to the 


LMSW 
and 
SMSW 
instructions 
in the 
80286 
(I.e., 


they only operate 
on the low-order 
16 bits of CRO 


and ignores 
the new bits). New Intel486 
OX micro- 


processor 
operating 
systems 
should 
use the 
MOV 


CRO, Reg instruction. 


The defined 
CRO bits are described 
below. 


PG 
(Paging Enable, 
bit 31) 


The PG bit is used to indicate 
whether 
paging is 2 


enabled 
(PG=1) 
or disabled 
(PG=O). 
See Ta- 


ble 2.2. 


CD 
(Cache 
Disable, 
bit 30) 


The CD bit is used to enable the on-chip 
cache. 


When 
CD = 1, the 
cache 
will 
not 
be filled 
on 


cache misses. When CD=O, 
cache fills may be 


performed 
on misses. 
See Table 2.3. 


The state of the CD bit, the cache enable 
input 


pin (KEN#), 
and the relevant 
page cache 
dis- 


able 
(PCD) 
bit determine 
if a line read 
in re- 


sponse 
to a cache 
miss will be installed 
in the 


cache. 
A line is installed 
in the 
cache 
only 
if 


CD = 0 and KEN # and PCD are both zero. The 
relevant 
PCD bit comes 
from 
either 
the 
page 


table entry, page directory 
entry or control 
reg- 


ister 3. Refer to Section 
5.6 for more details 
on 


page cacheability. 


CD is set to one after RESET. 


NW (Not Write-Through, 
bit 29) 


The 
NW 
bit 
enables 
on-chip 
cache 
write- 


throughs 
and write-invalidate 
cycles 
(NW = 0). 


When 
NW = 0, all writes, 
including 
cache 
hits, 


are sent out to the pins. 
Invalidate 
cycles 
are 


enabled 
when 
NW = O. During an invalidate 
cy- 


cle a line will be removed 
from the cache 
if the 


invalidate 
address 
hits in the cache. 
See Table 


2.3. 


When 
NW = 1, write-throughs 
and 
write-invali- 


date cycles are disabled. 
A write will not be sent 


to the pins if the write 
hits in the cache. 
With 


NW = 1 the only write cycles that reach the ex- 
ternal 
bus 
are 
cache 
misses. 
Write 
hits 
with 


NW= 
1 will never update 
main memory. 
Invali- 


date cycles 
are ignored 
when NW = 1. 


AM 
(Alignment 
Mask, bit 18) 


The 
AM 
bit 
controls 
whether 
the 
alignment 


check (AC) bit in the flag register 
(EFLAGS) 
can 


allow 
an alignment 
fault. 
AM = 0 disables 
the 


AC bit. AM = 1 enables 
the AC bit. AM = 0 is the 


Intel386 
microprocessor 
compatible 
mode. 


Intel386 microprocessor software may load in- 
correct data into the AC bit in the EFLAGS reg- 
ister. Setting AM = 0 will prevent AC faults from 
occurring before the Intel486 DX microproces- 
sor has created the AC interrupt service routine. 


WP (Write Protect, bit 16) 
WP protects read-only pages from supervisor 
write access. The Intel386 microprocessor al- 
lows a read-only page to be written from privi- 
lege levels 0-2. The Intel486 DX microproces- 
sor is compatible with the Intel386 microproces- 
sor when WP= O. WP= 1 forces a fault on a 
write to a read-only page from any privilege lev- 
el. Operating systems with Copy-on-Write fea- 
tures can be supported with the WP bit. Refer 
to Section 4.5.3 for further details on use of the 
WP bit. 


NE (Numerics Exception, bit 5) 
The NE bit controls whether unmasked floating 
point exceptions (UFPE) are handled through 
interrupt vector 16 (NE= 1) or through an exter- 
nal interrupt (NE=.0). NE= 0 (default at reset) 
supports the DOS operating system error re- 
porting scheme from the 8087, 80287 and In- 
tel387 
math coprocessor. In DOS systems, 
math coprocessor errors are reported via exter- 
nal interrupt vector 13. DOS uses interrupt vec- 
tor 16 for an operating system call. Refer to 
Sections 6.2.13 and 7.2.14 for more information 
on floating point error reporting. 
For any UFPE the floating point error output pin 
(FERR#) will be driven active. 
For NE= 0, the Intel486 DX microprocessor 
works in conjunction with the ignore numeric er- 
ror input (IGNNE#) 
and the FERR# 
output 
pins. When a UFPE occurs and the IGNNE# 
input is inactive, the Intel486 DX microproces- 
sor freezes immediately before executing the 
next floating point instruction. An external inter- 
rupt controller will supply an interrupt vector 
when FERR# is driven active. The UFPE is ig- 
nored if IGNNE# is active and floating point ex- 
ecution continues. 


NOTE: 
The freeze does not take place if the next in- 
struction 
is one of the 
control 
instructions 
FNCLEX, 
FNINIT, 
FNSAVE, 
FNSTENV, 
FNSTCW, FNSTSW, FNSTSW AX, 
FNENI, 
FNDISI and FNSETPM. The freeze does occur 
if the next instruction is WAIT. 
For NE= 1, any UFPE will result in a software 
interrupt 16, immediately before executing the 
next non-control floating point or WAIT instruc- 
tion. The ignore numeric error input (IGNNE#) 
signal will be ignored. 


TS 
(Task Switched, bit 3) 
The TS bit is set whenever a task switch opera- 
tion is performed. Execution of a floating point 
instruction with TS = 1 will cause a device not 
available (DNA) fault (trap vector 7). If TS= 1 
and MP= 1 (monitor coprocessor in CRO) a 
WAIT instruction will cause a DNA fault. See 
Table 2.4. 


EM (Emulate Coprocessor, bit 2) 
The EM bit determines whether floating point 
instructions are trapped (EM= 1) or executed. If 
EM= 1, all floating point instructions will cause 
fault 7. 


NOTE: 
WAIT instructions are not affected by the state 
of EM. See Table 2.4. 
MP (Monitor Coprocessor, bit 1) 
The MP bit is used in conjunction with the TS bit 
to determine if WAIT instructions should trap. If 
MP= 1 and TS= 1, WAIT instructions cause 
fault 7. Refer to Table 2.4. The TS bit is set to 1 
on task switches by the Intel486 DX microproc- 
essor. Floating point instructions are not affect- 
ed by the state of the MP bit. It is recommended 
that the MP bit be set to one for the normal 
operation of the Intel486 DX microprocessor. 


PE (Protection Enable, bit 0) 
The PE bit enables the segment based protec- 
tion mechanism. If PE= 1 protection is enabled. 
When PE= 0 the Intel486 DX microprocessor 
operates in REAL mode, with segment based 
protection disabled, and addresses formed as 
in an 8086. Refer to Table 2.2. 


All new CRObits added to the Intel386 and Intel486 
DX microprocessors, except for ET and NE, are up- 
ward compatible with the 80286 because they are in 
register bits not defined in the 80286. For strict com- 
patibility with the 80286, the load machine status 
word (LMSW) instruction is defined to not change 
the ET or NE bits. 


CR1 is reserved for use in future Intel microproces- 
sors. 


CR2, shown in Figure 2.6, holds the 32-bit linear ad- 
dress that caused the last page fault detected. The 
error code pushed onto the page fault handler's 
stack when it is invoked provides additional status 
information on this page fault. 
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P 
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PAGE 
DIRECTORY 
BASE 
REGISTER 
o 0 o 
0 o 0 0 
C 
W o 
0 
0 
D 
T 


NOTE: 
o indicates 
Intel reserved: 
Do not define; 
See Section 
2.1.6. 


CR3, 
shown 
in Figure 
2.6, 
contains 
the 
physical 
base 
address 
of the 
page 
directory 
table. 
The 
In- 
tel486 
OX microprocessor 
page directory 
is always 
page aligned 
(4 Kbyte-aligned). 
This alignment 
is en- 
forced 
by only storing 
bits 20-31 
in CR3. 


In the Intel486 
OX microprocessor 
CR3 contains 
two 
new bits, page write-through 
(PWT) (bit 3) and page 


. cache 
disable 
(PCO) (bit 4). The 
page table 
entry 
(PTE) and page directory 
entry 
(POE) also contain 
PWT 
and 
PCO bits. 
PWT 
and 
PCO control 
page 
cacheability. 
When 
a page 
is accessed 
in external 
memory, 
the state of PWT and PCO are driven 
out 
on the PWT and PCO pins. The source 
of PWT and 
PCO can be CR3, the 
PTE or the 
POE. PWT and 
PCO are sourced 
from CR3 when the POE is being 
updated. 
When paging is disabled 
(PG = 0 in CRO), 
PCO and PWT are assumed 
to be 0, regardless 
of 
their state in CR3. 


A task 
switch 
through 
a task 
state 
segment 
(TSS) 
which changes 
the values in CR3, or an explicit 
load 
into CR3 with 
any value, 
will invalidate 
all cached 
page table entries 
in the translation 
lookaside 
buffer 
(TLB). 


The page directory 
base address 
in CR3 is a physi- 
cal address. 
The 
page directory 
can be paged 
out 
while its associated 
task is suspended, 
but the oper- 
ating system 
must ensure that the page directory 
is 
resident 
in physical 
memory 
before 
the task is dis- 
patched. 
The entry in the TSS for CR3 has a physi- 
cal address, 
with no provision 
for a present 
bit. This 
means 
that 
the 
page 
directory 
for a task 
must 
be 
resident 
in physical 
memory. 
The CR3 
image 
in a 
TSS must point to this area, before 
the task can be 
dispatched 
through 
its TSS. 


2.1.2.2 
System 
Address 
Registers 


Four special 
registers 
are defined 
to reference 
the 
tables 
or 
segments 
supported 
by the 
80286, 
In- 
tel386 
and 
Intel486 
OX microprocessor 
protection 
model. These tables 
or segments 
are; 


GOT (Global 
Descriptor 
Table) 
lOT (Interrupt 
Descriptor 
Table) 
LOT (Local 
Descriptor 
Table) 
TSS (Task State Segment) 


The 
addresses 
of these 
tables 
and 
segments 
are 


stored 
in special 
registers, 
the System 
Address 
and 


System 
Segment 
Registers, 
illustrated 
in Figure 2.4. 


These 
registers 
are named GOTR, IOTR, LOTR and 
TR respectively. 
Section 
4, Protected 
Mode 
Archi- 
tecture, 
describes 
the use of these 
registers. 


The GOTR and IOTR hold the 32-bit linear base ad- 
dress 
and 16-bit limit of the GOT and lOT, respec- 


tively. 


Since the GOT and lOT segments 
are global 
to all 


tasks in the system, the GOT and lOT are defined 
by 


32-bit linear addresses 
(subject to page translation 
if 


paging 
is enabled) 
and 16-bit limit values. 


System 
Segment 
Registers: 
LDTR and TR 


The 
LOTR and TR hold the 
16-bit 
selector 
for the 
LOT descriptor 
and the TSS descriptor, 
respectively. 


Since the LOT and TSS segments 
are task specific 


segments, 
the LOT and TSS are defined 
by selector 
values 
stored 
in the system 
segment 
registers. 


NOTE: 
A programmer-invisible 
segment 
descriptor 
register 


is associated 
with each system 
segment 
register. 


• 


Figure 2.7 shows the floating point register set. The 
on-chip FPU contains eight data registers, a tag 
word, a control register, a status register, an instruc- 
tion pointer and a data pointer. 
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1 
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The operation of the Intel486 OX microprocessor's 
on-chip floating point unit is exactly the same as the 
Intel387 math coprocessor. Software written for the 
Intel387 math coprocessor will run on the on-chip 
floating point unit (FPU) without any modifications. 


2.1.3.1 Data Registers 


Floating point computations use the Intel486 OX mi- 
croprocessor's FPU data registers. These eight 80- 
bit registers provide the equivalent capacity of twen- 
ty 32-bit registers. Each of the eight data registers is 


intel~ 


divided into "fields" corresponding to the FPU's ex- 
tended-precision data type. 


The FPU's register set can be accessed either as a 
stack, with instructions operating on the top one or 
two stack elements, or as a fixed register set, with 
instructions operating on explicitly designated regis- 
ters. The TOP field in the status word identifies the 
current top-of-stack register. A "push" 
operation 
decrements TOP by one and loads a value into the 
new top register. A "pop" operation stores the value 
from the current top register and then increments 
TOP by one. Like other Intel486 OX microprocessor 
stacks in memory, the FPU register stack grows 
"down" toward lower-addressed registers. 


Instructions may address the data registers either 
implicitly or explicitly. Many instructions operate on 
the register at the TOP of the stack. These instruc- 
tions implicitly address the register at which TOP 
points. Other instructions allow the programmer to 
explicitly specify which register to use. This explicit 
register addressing is also relative to TOP. 


The tag word marks the content of each numeric 
data register, as shown in Figure 2.8. Each two-bit 
tag represents one of the eight data registers. The 
principal function of the tag word is to optimize the 
FPUs performance and stack handling by making it 
possible to distinguish between empty and nonemp- 
ty register locations. It also enables exception han- 
dlers to check the contents of a stack location with- 
out the need to perform complex decoding of the 
actual data. 


2.1.3.3 Status Word 


The 16-bit status word reflects the overall state of 
the FPU. The status word is shown in Figure 2.9 and 
is located in the status register. 


15 
0 


TAG (7) 
I 
TAG (6) 
I 
TAG (5) 
I 
TAG (4) 
I 
TAG (3) 
I 
TAG (2) 
I 
TAG (1) 
I 
TAG (0) 


NOTE: 
The index i of tag(i) 
is not 
top-relative. 
A program 
typically 
uses the "top" 
field of Status 
Word to determine 
which 
tag(i) 
field refers 
to logical 
top of stack. 


TAG VALUES: 
00 = Valid 
01 = Zero 
10 = QNaN, 
SNaN, 
Infinity, 
Denormal 
and Unsupported 
Formats 
11 = Empty 
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EXCEPTION 
FLAGS: 


PRECISION 


UNDERFLOW 


OVERFLOW 


ZERO 
DIVIDE 


DENORt.4ALIZED 
OPERAND 


INVALID 
OPERATION 


ES is set if any unmasked 
exception 
bit is set; cleared 
otherwise. 


See Table 
2.5 for interpretation 
of condition 
code. 
TOP values: 
000 
~ 
Register 
0 is Top 
of Stack 
001 = Register 
1 is Top of Stack 


111 
~ 
Register 
7 is Top 
of Stack 
For definitions 
of exceptions, 
refer 
to the Section 
entitled 


"Exception 
Handling". 


The B bit (Busy, bit 15)is included for 8087 compati- 
bility. The B bit reflects the contents of the ES bit (bit 
7 of the status word). 


BUSY 


TOP 
OF STACK 
POINTER 


The four numeric condition code bits, CO-C3, are 
similar to the flags in EFLAGS. Instructions that per- 
form arithmetic operations update CO-C3 to reflect 
the outcome. The effects of these instructions on 
the condition codes are summarized in Tables 2.5 
through 2.8. 
Bits 13-11 (TOP) point to the FPU register that is 
the current top-of-stack. 
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Instruction 
CO (5) 
I 
C3(Z) 
C1 (A) 
C2(C) 


FPREM, FPREM1 
Three least significant 
bits 
Reduction 
(see Table 2.3) 
of quotient 
0= 
complete 
02 
ao 
01 
orO/U# 
1 = incomplete 


FCOM, FCOMP, 
FCOMPP, FTST, 
Result of comparison 
Zero 
Operand 
is not 
FUCOM, FUCOMP, 
(see Table 2.7) 
orO/U# 
comparable 
FUCOMPP, 
FICOM, 
(Table 2.7) 
FICOMP 


FXAM 
Operand 
class 
Sign 
Operand 
class 
(see Table 2.8) 
orO/U# 
(Table 2.8) 


FCHS, FABS, FXCH, 
FINCTOP, 
FDECTOP, 
Constant 
loads, 
UNDEFINED 
Zero 
UNDEFINED 
FXTRACT, 
FLD, 
orO/U# 
FILD, FBLD, 
FSTP (ext real) 


FIST, FBSTP, 
FRNDINT, 
FST, 
FSTP, FADD, FMUL, 
FDIV, FDIVR, 
UNDEFINED 
Roundup 
UNDEFINED 
FSUB, FSUBR, 
orO/U# 
FSCALE, 
FSORT, 
FPATAN, 
F2XM1, 
FYL2X, FYL2XP1 


FPTAN, FSIN 
Roundup 
Reduction 
FCOS, FSINCOS 
UNDEFINED 
orO/U#, 
0= 
complete 
undefined 
1 = incomplete 
ifC2 = 1 


FLDENV, 
FRSTOR 
Each bit loaded from memory 


FINIT 
Clears these bits 


FLDCW, FSTENV, 
FSTCW, FSTSW, 
UNDEFINED 
FCLEX, FSAVE 


O/U# 
When 
both 
IE and SF bits of status 
word 
are set, indicating 
a stack 
exception, 
this 
bit 
distinguishes 
between 
stack overflow 
(C1 = 1) and underflow 
(C1 = 0). 


Reduction 
If FPREM 
or FPREM1 
produces 
a remainder 
that 
is less than the modulus, 
reduction 
is 
complete. 
When 
reduction 
is incomplete 
the 
value 
at the 
top 
of the 
stack 
is a partial 
remainder, 
which 
can be used as input to further 
reduction. 
For FPTAN, 
FSIN, FCOS, and 
FSINCOS, 
the reduction 
bit is set if the operand 
at the top of the stack is too large. In this 
case the original 
operand 
remains 
at the top of the stack. 


Roundup 
When the PE bit of the status word is set, this bit indicates 
whether 
the last rounding 
in the 
instruction 
was upward. 


UNDEFINED 
Do not rely on finding 
any specific 
value in these 
bits. 


Condition 
Code 
Interpretation 
after FPREM and FPREM1 
C2 
C3 
C1 
CO 


Incomplete 
Reduction: 


1 
X 
X 
X 
further interaction 
required 
for complete 
reduction 


01 
ao 
02 
o MOD8 


0 
0 
0 
0 
0 
1 
0 
1 
Complete 
Reduction: 
1 
0 
0 
2 
0 
1 
1 
0 
3 
CO,C3, C1 contain three least 


0 
0 
1 
4 
significant 
bits of quotient 


0 
1 
1 
5 
1 
0 
1 
6 
1 
1 
1 
7 


Order 
C3 
C2 
CO 


TOP> 
Operand 
0 
0 
0 
TOP < Operand 
0 
0 
1 
TOP = Operand 
1 
0 
0 
Unordered 
1 
1 
1 


C3 
C2 
C1 
CO 
Value at TOP 


0 
0 
0 
0 
+ Unsupported 
0 
0 
0 
1 
+ NaN 
0 
0 
1 
0 
- 
Unsupported 
0 
0 
1 
1 
- 
NaN 
0 
1 
0 
0 
+ Normal 
0 
1 
0 
1 
+ Infinity 
0 
1 
1 
0 
- 
Normal 
0 
1 
1 
1 
- 
Infinity 
1 
0 
0 
0 
+0 
1 
0 
0 
1 
+ Empty 
1 
0 
1 
0 
-0 
1 
0 
1 
1 
- 
Empty 
1 
1 
0 
0 
+ Denormal 
1 
1 
1 
0 
- 
Denormal 
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Bit 7 is the error summary (ES) status bit. The ES bit 
is set if any unmasked exception bit (bits 0-5 in the 
status word) is set; ES is clear otherwise. The 
FERR# (floating point error) signal is asserted when 
ES is set. 


Bit 6 is the stack flag (SF). This bit is used to distin- 
guish invalid operations due to stack overflow or un- 
derflow. When SF is set, bit 9 (C1) distinguishes be- 
tween 
stack 
overflow 
(C1= 1) 
and 
underflow 


(C1=0). 


Table 2.9 shows the six exception flags in bits 0-5 
of the status word. Bits 0-5 are set to indicate that 
the FPU has detected an exception while executing 
an instruction. 


The six exception flags in the status word can be 
individually masked by mask bits in the FPU control 
word. Table 2.9 lists the exception conditions, and 
their causes in order of precedence. Table 2.9 also 
shows the action taken by the FPU if the corre- 
sponding exception flag is masked. 


An exception that is not masked by the control word 
will cause three things to happen: the corresponding 
exception flag in the status word will be set, the ES 
bit in the status word will be set and the FERR# 
output signal will be asserted. When the Intel486 OX 
microprocessor attempts to execute another floating 
point or WAIT instruction, exception 16 occurs or an 
external interrupt happens if the NE= 1 in control 


register O. The exception condition must be resolved 
via an interrupt service routine. The FPU saves the 
address of the floating point instruction that caused 
the exception and the address of any memory oper- 
and required by that instruction in the instruction and 
data pointers (see Section 2.1.3.4). 


Note that when a new value is loaded into the status 
word 
by 
the 
FLDENV 
(load 
environment) 
or 


FRSTOR (restore state) instruction, the value of ES 
(bit 7) and its reflection in the B bit (bit 15) are not 
derived from the values loaded from memory. The 
values of ES and B are dependent upon the values 
of the exception flags in the status word and their 
corresponding masks in the control word. If ES is set 
in such a case, the FERR# output of the Intel486 
OX microprocessor is activated immediately. 


2.1.3.4 Instruction 
and Data Pointers 


Because the FPU operates in parallel with the ALU 
(in the Intel486 OX and Intel486 microprocessors 
the arithmetic and logic unit (ALU) consists of the 
base architecture registers), any errors detected by 
the FPU may be reported after the ALU has execut- 
ed the floating point instruction that caused it. To 
allow identification of the failing numeric instruction, 
the Intel486 OXmicroprocessor contains two pointer 
registers that supply the address of the failing nu- 
meric instruction and the address of its numeric 
memory operand (if appropriate). 


Exception 
Cause 
Default Action 
(If exception Is masked) 


Invalid 
Operation on a signaling NaN, unsupported format, 
Result is a quiet NaN, integer 
Operation 
indeterminate form (0' 00, 0/0, (+ 00) + (- 
00), etc.), or 
indefinite, or BCD indefinite 
stack overflow/underflow (SF is also set). 


Denormalized 
At least one of the operands is denormalized, Le.,it has 
Normal processing 


Operand 
the smallest exponent but a nonzero significand. 
continues 


Zero Divisor 
The divisor is zero while the dividend is a noninfinite, 
Result is 00 
nonzero number. 


Overflow 
The result is too large in magnitude to fit in the specified 
Result is largest finite value 
format. 
or 00 


Underflow 
The true result is nonzero but too small to be 
Result is denormalized or 
represented in the specified format, and, if underflow 
zero 
exception is masked, denormalization causes loss of 
accuracy. 


Inexact 
The true result is not exactly representable in the 
Normal processing 
Result 
specified format (e.g., 1/3); the result is rounded 
continues 
(Precision) 
according to the rounding mode. 


The instruction and data pointers are provided for 
user-written error handlers. These registers are ac- 
cessed 
by 
the 
FLDENV 
(load 
environment), 
FSTENV (store environment), FSAVE (save state) 
and FRSTOR (restore state) instructions. Whenever 
the Intel486 DX microprocessor decodes a new 
floating point instruction, it saves the instruction (in- 
cluding any prefixes that may be present), the ad- 
dress of the operand (if present) and the opcode. 


The instruction and data pointers appear in one of 
four formats depending on the operating mode of 
the Intel486 DX microprocessor (protected mode or 
real-address mode) and depending on the operand- 


size attribute in effect (32-bit operand or 16-bit oper- 
and). When the Intel486 DX microprocessor is in the 
virtual-86 mode, the real address mode formats are 
used. The four formats are shown in Figures 2.10- 
2.13. 
The 
floating 
point 
instructions 
FLDENV, 


FSTENV, FSAVE and FRSTOR are used to transfer 
these values to and from memory. Note that the val- 
ue of the data pointer is undefined if the prior float· 
ing point instruction did not have a memory operand. 


NOTE: 
The operand size attribute is the D bit in a segment 
descriptor. 


32-BIT 
PROTECTED 
MODE FORMAT 
15 
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INSTRUCTION 
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15 ..0 


0000 
I 
INSTRUCTION 
POINTER 
31 ..16 
I 
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OP~ODE 
10 ..0 
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OPERAND 
P~INTER 
15 ..0 


0000 
I 
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POINTER 
31 ..16 
I 
0000 
00000000 
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16-BIT PROTECTEDMODE FORMAT 
15 
7 
0 
16-BIT REAL-ADDRESSMODE AND 
VIRTUAl-BOB6MODE FORMAT 
15 
7 
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CONTROL WORD 
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STATUS WORD 
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Figure 2.12. Protected Mode FPU 
Instruction 
and Data Pointer 
Image In Memory, 16-Blt Format 


Figure 2.13. Real Mode FPU 
Instruction 
and Data Pointer 
Image In Memory, 16-Blt Format 


The FPU provides several processing options that are selected by loading a control word from memory into 
the control register. Figure 2.14 shows the format and encoding of fields in the control word. 


RESERVED 


RESERVED" 


ROUNDING 
CONTROL 


PRECISION 
CONTROL 


EXCEPTION 
MASKS: 


PRECISION 


UNDERFLOW 


OVERFLOW 


ZERO 
DIVIDE 


DENORMALIZED 
OPERAND 


INVALID 
OPERATION 


" "0" AFTER 
RESET 
OR FlNIT; 


CHANGEABLE 
UPON 
LOADING 
THE 
CONTROL 
WORD 
(CW). 
PROGRAMS 


MUST 
IGNORE 
THIS 
BIT. 


Precision Control 
00-24 
bits (single precision) 
01-(reserved) 
10-53 
bits (double precision) 
11-64 
bits (extended precision) 


Rounding Control 
00-Round 
to nearest or even 
01-Round 
down (toward -00) 
10-Round 
up (toward + 00) 
ll-Chop 
(truncate toward zero) 


The low-order byte of the FPU control word config- 
ures the FPU error and exception masking. Bits 0-5 
of the control word contain individual masks for each 
of the six exceptions that the FPU recognizes. 


The high-order byte of the control word configures 
the FPU operating mode, including precision and 
rounding. 
RC (Rounding Control, bits 10-11) 


The RC bits provide for directed rounding and 
true chop, as well as the unbiased round to 
nearest even mode specified in the IEEE stan- 
dard. Rounding control affects only those in- 
structions that perform rounding at the end of 
the operation (and thus can generate a preci- 
sion exception); namely, FST, FSTP, FIST, all 
arithmetic 
instructions 
(except 
FPREM, 
FPREM1, FXTRACT, FABS and FCHS), and all 
transcendental instructions. 
PC (Precision Control, bits 8-9) 
The PC bits can be used to set the FPU internal 
operating precision of the significand at less 
than the default of 64 bits (extended precision). 
This can be useful in providing compatibility with 
early generation arithmetic processors of small- 
er precision. PC affects only the instructions 
ADD, SUB, DIV, MUL, and SORT. For all other 
instructions, either the precision is determined 
by the opcode or extended precision is used. 


2.1.4.1 
Debug 
Registers 


The six programmer accessible debug registers, Fig- 
ure 2.15, provide on-chip support for debugging. De- 
bug registers DRO-3 specify the four linear break- 
points. The Debug control register DR?, is used to 
set the breakpoints and the Debug Status Register, 
DR6, displays the current state of the breakpoints. 
The use of the Debug registers is described in Sec- 
tion 9. 
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2.1.4.2 
Test Registers 


The Intel486 OX microprocessor contains five test 
registers. The test registers are shown in Figure 
2.15. TR6 and TR? are used to control the testing of 
the translation lookaside buffer. TR3, TR4 and TR5 
are used for testing the on-chip cache. The use of 
the test registers is discussed in Section 8. 


There are a few differences regarding the accessibil- 
ity of the registers in Real and Protected Mode. Ta- 
ble 2.10 summarizes these differences. See Section 
4, Protected Mode Architecture, for further details. 
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Use in 
Use In 
Use in 


Register 
Real Mode 
Protected Mode 
Virtual 8086 Mode 


Load 
Store 
Load 
Store 
Load 
Store 


General Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Segment Register 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Flag Register 
Yes 
Yes 
Yes 
Yes 
IOPL 
IOPL" 


Control Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
Yes 


GDTR 
Yes 
Yes 
PL = 0 
Yes 
No 
Yes 


IDTR 
Yes 
Yes 
PL = 0 
Yes 
No 
Yes 


LDTR 
No 
No 
PL = 0 
Yes 
No 
No 


TR 
No 
No 
PL = 0 
Yes 
No 
No 


FPU Data Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Control Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Status Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Instruction Pointer 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Data Pointer 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Debug Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
No 


Test Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
No 


NOTES: 
PL = 0: The registers 
can be accessed 
only when 
the current 
privilege 
level is zero. 


'IOPL: 
The PUSHF 
and POPF instructions 
are made 
I/O 
Privilege 
Level sensitive 
in Virtual 
86 Mode. 


2.1.6 COMPATIBILITY 


VERY IMPORTANT NOTE: 
COMPATIBILITY WITH FU'tURE PROCESSORS 


In the preceding register descriptions, 
note cer- 
tain 
Intel486 
microprocessor 
register 
bits are 
Intel reserved. 
When reserved 
bits are called 
out, treat them as fully undefined. This is essen- 
tial for your software 
compatibility 
with future 
processorsl 
Follow the guidelines below: 
1) Do not depend on the states of any unde- 
fined bits when testing the values of defined 
register bits. Mask them out when testing. 
2) Do not depend 
on the states of any unde- 
fined bits when storing 
them to memory or 
another register. 


3) Do not depend on the ability to retain Infor- 
mation written into any undefined bits. 
4) When loading registers always load the unde- 


fined bits as zeros. 
5) However, 
registers 
which 
have been previ- 
ously stored may be reloaded without 
mask- 
ing. 


Depending upon the values of undefined 
regis- 
ter bits will make your software dependent upon 
the 
unspecified 
Intel486 
OX 
microprocessor 
handling of these bits. Depending on undefined 
values risks making your software 
Incompatible 
with future 
processors 
that define 
usages for 
the 
Intel486 
microprocessor-undefined 
bits. 
AVOID ANY SOFTWARE DEPENDENCE UPON 
THE STATE OF UNDEFINED Intel486 
MICRO- 


PROCESSOR REGISTER BITS. 


The Intel486 DX microprocessor instruction set can 
be divided into 11 categories of operations: 


Data Transfer 
Arithmetic 
Shift/Rotate 
String Manipulation 
Bit Manipulation 
Control Transfer 
High Level Language Support 
Operating System Support 
Processor Control 
Floating Point 
Floating Point Control 


The Intel486 DX microprocessor instructions are list- 
ed in Section 10. Note that all floating point unit in- 
struction mnemonics begin with an F. 


All Intel486 DX microprocessor instructions operate 
on either 0, 1, 2 or 3 operands; where an operand 
resides in a register, in the instruction itself or in 
memory. Most zero operand instructions (e.g., CLI, 
STI) take only one byte. One operand instructions 
generally are two bytes long. The average instruc- 
tion is 3.2 bytes long. Since the Intel486 DX micro- 
processor has a 32-byte instruction queue, an aver- 
age of 10 instructions will be prefetched. The use of 
two operands permits the following types of com- 
mon instructions: 


Register to Register 
Memory to Register 
Memory to Memory 
Immediate to Register 
Register to Memory 
Immediate to Memory 


The operands can be either 8, 16, or 32 bits long. As 
a general rule, when executing code written for the 
Intel486 DX, Intel486 or Intel386 microprocessors 
(32-bit code), operands are 8 or 32 bits; when exe- 
cuting existing 80286 or 8086 code (16-bit code), 
operands are 8 or 16 bits. Prefixes can be added to 
all instructions which override the default length of 
the operands (Le., use 32-bit operands for 16-bit 
code, or 16·bit operands for 32-bit code). 


Memory on the Intel486 DX microprocessor is divid- 
ed up into 8-bit quantities (bytes), 16-bit quantities 
(words), and 32-bit quantities (dwords). Words are 
stored in two consecutive bytes in memory with the 
low-order byte at the lowest address, the high order 


byte at the high address. Dwords are stored in four 
consecutive bytes in memory with the low-order byte 
at the lowest address, the high-order byte at the 
highest address. The address of a word or dword is 
the byte address of the low-order byte. 


In addition to these basic data types, the Intel486 
DX microprocessor supports two larger units of 
memory: pages and segments. Memory can be di- 
vided up into one or more variable length segments, 
which can be swapped to disk or shared between 
programs. Memory can also be organized into one 
or more 4 Kbyte pages. Finally, both segmentation 
and paging can be combined, gaining the advan- 
tages of both systems. The Intel486 DX microproc- 
essor supports both pages and segments in order to 
provide maximum flexibility to the system designer. 
Segmentation and paging are complementary. Seg- 2 
mentation is useful for organizing memory in logical 
modules, and as such is a tool for the application 
programmer, while pages are useful for the system 
programmer for managing the physical memory of a 
system. 


The Intel486 DX microprocessor has three distinct 
address spaces: logical, 
linear, 
and physical. 
A 


logical address (also known as a virtual address) 
consists of a selector and an offset. A selector is the 
contents of a segment register. An offset is formed 
by summing all of the addressing components 
(BASE, INDEX, DISPLACEMENT)discussed in Sec- 
tion 2.5.3 Memory 
Addressing 
Modes 
into an ef- 


fective address. Since each task on the Intel486 DX 
microprocessor has a maximum of 16K (214 -1) 
se- 


lectors, and offsets can be 4 gigabytes, (232 bits) 
this gives a total of 246 bits or 64 terabytes of logi- 
cal address space per task. The programmer sees 
this virtual address space. 


The segmentation unit translates the logical 
ad- 


dress space into a 32-bit linear address space. If the 
paging unit is not enabled then the 32·bit linear ad- 
dress corresponds to the physical 
address. The 


paging unit translates the linear address space into 
the physical address space. The physical 
address 


is what appears on the address pins. 


The primary difference between Real Mode and Pro- 
tected Mode is how the segmentation unit performs 
the translation of the logical address into the linear 
address. In Real Mode, the segmentation unit shifts 
the selector left four bits and adds the result to the 
offset to form the linear address. While in Protected 
Mode every selector has a linear base address as- 
sociated with it. The linear base address is stored in 
one of two operating system tables (Le., the Local 
Descriptor Table or Global Descriptor Table). The 
selector's linear base address is added to the offset 
to form the final linear address. 


EFFECTIVE 


ADDRESS 
LOGICAL 
OR 
VIRTUAL 
ADDRESS 
13 


DESCRIPTOR 
INDEX 
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32 


LINEAR 


ADDRESS 


32 


PHYSICAL 
ADDRESS 


Figure 2.16. Address 
Translation 


and create a system with a four gigabyte linear ad- 
dress space. This creates a system where the virtual 
address space is the same as the linear address 
space. Further details of segmentation are dis- 
cussed in Section 4.1. 


Figure 2.16 shows the relationship between the vari- 
ous address spaces. 


The main data structure used to organize memory is 
the segment. On the Intel486 OX microprocessor, 
segments are variable sized blocks of linear ad- 
dresses which have certain attributes associated 
with them. There are two main types of segments: 
code and data, the segments are of variable size 
and can be as small as 1 byte or as large as 4 giga- 
bytes (232 bytes). 


In order to provide compact instruction encoding, 
and increase processor performance, instructions 
do not need to explicitly specify which segment reg- 
ister is used. A default segment register is automati- 
cally chosen according to the rules of Table 2.11 
(Segment Register Selection Rules). In general, data 
references use the selector contained in the OSreg- 
ister; Stack references use the SS register and In- 
struction fetches use the CS register. The contents 
of the Instruction Pointer provide the offset. Special 
segment override prefixes allow the explicit use of a 
given segment register, and override the implicit 
rules listed in Table 2.11. The override prefixes also 
allow the use of the ES, FS and GS segment regis- 
ters. 


There are no restrictions regarding the overlapping 
of the base addresses of any segments. Thus, all 6 
segments could have the base address set to zero 


2.4 
I/O Space 


The Intel486 OX microprocessor has two distinct 
physical address spaces: Memory and I/O. General- 
ly, peripherals are placed in I/O space although the 
Intel486 OX microprocessor also supports memory- 
mapped peripherals. The I/O 
space consists of 
64 Kbytes, it can be divided into 64K 8-bit po~s, 3.2K 
16-bit ports, or 16K 32-bit ports, or any combination 
of ports which add up to less than 64 Kbytes. The 
64K I/O address space refers to physical memory 
rather than linear address since I/O instructions do 
not go through the segmentation or paging hard- 
ware. The M/IO# 
pin acts as an additional address 
line thus allowing the system designer to easily de- 
termine which address space the processor is ac- 
cessing. 


The I/O ports are accessed via the IN and OUT I/O 
instructions, with the port address supplied as an 
immediate 8-bit constant in the instruction or in the 
OXregister. All 8- and 16-bit port addresses are zero 
extended on the upper address lines. The I/O in- 
structions cause the M/IO# 
pin to be driven low. 


I/O port addresses OOF8Hthrough OOFFHare re- 
served for use by Intel. 
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Type of 
Implied (Default) 
Segment 
Override 
Memory 
Reference 
Segment 
Use 
Prefixes 
Possible 


Code Fetch 
CS 
None 


Destination 
of PUSH, PUSHF, INT, 
SS 
None 
CALL, PUSHA Instructions 


Source of POP, POPA, POPF, 
SS 
None 
IRET, RET instructions 


Destination 
of STOS, MOVS, REP 
ES 
None 
STOS, REP MOVS Instructions 
(01 is Base Register) 
. 


Other Data References, 
with 
Effective 
Address 
Using Base 
Register 
of: 
[EAX) 
OS 
[EBX) 
OS 
[ECX) 
OS 
[EDX) 
OS 
All 
[ESI) 
OS 
[EDI) 
OS 
[EBP) 
SS 
[ESP) 
SS 


The Intel486 
OX microprocessor 
provides 
a total of 
11 addressing 
modes 
for instructions 
to specify 
op- 
erands. 
The addressing 
modes 
are optimized 
to al- 
low the efficient 
execution 
of high level 
languages 
such as C and FORTRAN, 
and they cover the vast 
majority of data references 
needed 
by high-level 
lan- 
guages. 


Two 
of the 
addressing 
modes 
provide 
for instruc- 
tions 
that 
operate 
on register 
or immediate 
oper- 
ands: 


Register 
Operand 
Mode: The operand 
is located 
in 
one of the 8-, 16- or 32-bit general 
registers. 


Immediate 
Operand 
Mode: The operand 
is includ- 
ed in the instruction 
as part of the opcode. 


The 
remaining 
9 modes 
provide 
a mechanism 
for 


specifying 
the effective 
address 
of an operand. 
The 


linear address consists 
of two components: 
the seg- 


ment 
base address 
and an effective 
address. 
The 


effective 
address 
is calculated 
by using 
combina- 


tions of the following 
four address 
elements: 


DISPLACEMENT: 
An 8-, or 32-bit immediate 
value, 


following 
the instruction. 


BASE: The contents 
of any general 
purpose 
regis- 


ter. The base registers 
are generally 
used by compil- 


ers to point to the start of the local variable 
area. 


INDEX: The contents 
of any general 
purpose 
regis- 


ter except 
for ESP. The index registers 
are used to 
access the elements 
of an array, or a string of char- 
acters. 


SCALE: The index register's 
value can be multiplied 
by a scale 
factor, 
either 
1, 2, 4 or 8. Scaled 
index 


mode is especially useful for accessing arrays or 
structures. 


Combinations of these 4 components make up the 9 
additional addressing modes. There is no perform- 
ance penalty for using any of these addressing com- 
binations, since the effective address calculation is 
pipelined with the execution of other instructions. 
The one exception is the simultaneous use of Base 
and Index components which requires one addition- 
al clock. 


As shown in Figure 2.17, the effective address (EA) 
of an operand is calculated according to the follow- 
ing formula. 


Direct Mode: The operand's offset is contained as 
part of the instruction as an 8·, 16- or 32-bit dis- 
placement. 
EXAMPLE: 
INC Word 
PTR [500] 


Register Indirect Mode: A BASE register contains 
the address of the operand. 
EXAMPLE: 
MOV 
[ECX], 
EDX 


Based Mode: A BASE register's contents is added 
to a DISPLACEMENT to form the operand's offset. 
EXAMPLE: 
MOV ECX, [EAX + 24] 


Index Mode: An INDEX register's contents is added 
to a DISPLACEMENTto form the operand's offset. 
EXAMPLE: 
ADD EAX, TABLE[ESI] 


Scaled Index Mode: An INDEX register's contents is 
multiplied by a scaling factor which is added to a 
DISPLACEMENTto form the operand's offset. 
EXAMPLE: 
IMUL EBX, TABLE[ES'*4],7 


Based Index Mode: The contents of a BASE register 
is added to the contents of an INDEX register to 
form the effective address of an operand. 
EXAMPLE: 
MOV EAX, [ESI] 
[EBX] 


Based Scaled Index Mode: The contents of an IN· 
DEX register is multiplied by a SCALING factor and 
the result is added to the contents of a BASE regis- 
ter to obtain the operand's offset. 
EXAMPLE: 
MOV ECX, [EDX*S] 
[EAX] 


SS 
GS 
rs 


ES 


DS 
--cs 


SS 


GS 
rs 


ES 


DS 


ACCESS 
RIGHTS 
CS 


L1t.11T 


BASE 
ADDRESS 


SEGt.lENT 
L1t.11T 
/ 
'\ 
LINEAR 


ADDRESS 
• 
TARGET 
ADDRESS 


------~ 


./ 


SEGt.lENT 
BASE 
ADDRESS 


Based Index Mode with Displacement: The contents 
of an INDEX Register and a BASE register's con- 
tents and a DISPLACEMENT are all summed to- 
gether to form the operand offset. 
EXAMPLE: ADD EDX, [ESI] [EBP+OOFFFFFOH] 


Based Scaled Index Mode with Displacement: The 
contents of an INDEX register are multiplied by a 
SCALING factor, the result is added to the contents 
of a BASE register and a DISPLACEMENT to form 
the operand's offset. 
EXAMPLE: 
MOV 
EAX, 
LOCALTABLE[EDI*4] 
[EBP+80] 


2.5.4 DIFFERENCES BETWEEN 16- AND 32-BIT 
ADDRESSES 


In order to provide software compatibility with the 
80286 and the 8086, the Intel486 OX Microproces- 
sor can execute 16-bit instructions in Real and Pro- 
tected Modes. The processor determines the size of 
the instructions it is executing by examining the 0 bit 
in the CS segment Descriptor. If the 0 bit is 0 then 
all operand lengths and effective addresses are as- 
sumed to be 16 bits long. If the 0 bit is 1 then the 
default length for operands and addresses is 32 bits. 
In Real Mode the default size for operands and ad- 
dresses is 16 bits. 


Regardless of the default precision of the operands 
or addresses, the Intel486 OX Microprocessor is 
able to execute either 16- or 32-bit instructions. This 
is specified via the use of override prefixes. Two pre- 
fixes, the Operand Size Prefix and the Address 
Length Prefix, override the value of the 0 bit on an 
individual instruction basis. These prefixes are auto- 
matically added by Intel assemblers. 


Example: The processor is executing in Real Mode 
and the programmer needs to access the EAX regis- 
ters. The assembler code for this might be MOV 
EAX, 32-bit MEMORYOP, ASM486 Macro Assem- 
bler automatically determines that an Operand Size 
Prefix is needed and generates it. 


Example: The 0 bit is 0, and the programmer wishes 
to use Scaled Index addressing mode to access an 
array. The Address Length Prefix allows the use of 
MOV OX, TABLE[ESI*2]. The assembler uses an 


Address Length Prefix since, with 0 = 0, the default 
addressing mode is 16 bits. 


Example: The 0 bit is 1, and the program wants to 
store a 16-bit quantity. The Operand Length Prefix is 
used to specify only a 16-bit value; MOV MEM16, 
OX. 


The OPERAND LENGTH and Address Length Pre- 
fixes can be applied separately or in combination to 
any instruction. The Address Length Prefix does not 
allow addresses over 64 Kbytes to be accessed in 
Real Mode. A memory address which exceeds 
FFFFH will result in a General Protection Fault. An 
Address Length Prefix only allows the use of the ad- 
ditional 
Intel486 
OX Microprocessor 
addressing 
modes. 


When executing 32-bit code, the Intel486 OX Micro- 
processor uses either 8-, or 32-bit displacements, 
and any register can be used as base or index regis- 
ters. When executing 16-bit code, the displacements 
are either 8, or 16 bits, and the base and index regis- 
ter conform to the 80286 model. Table 2.12 illus- 
trates the differences. 


The Intel486 OX Microprocessor can support a wide 
variety of data types. In the following descriptions, 
the on-chip floating point unit (FPU) consists of the 
floating point registers. The central processing unit 
(CPU) consists of the base architecture registers. 


2.6.1.1 Unsigned Data Types 


The FPU does not support unsigned data types. Re- 
fer to Table 2.13. 
Byte: 
Unsigned 8-bit quantity 
Word: 
Unsigned 16-bit quantity 
Dword: Unsigned 32-bit quantity 


The least significant bit (LSB) in a byte is bit 0, and 
the most significant bit is 7. 


16·Bit Addressing 
32·Bit Addressing 


BASE REGISTER 
BX,BP 
Any 32-bit GP Register 
INDEX REGISTER 
SI,DI 
Any 32-bit GP Register 
Except ESP 
SCALE FACTOR 
none 
1,2,4,8 
DISPLACEMENT 
0,8,16 bits 
0,8,32 bits 


•• 


2.6.1.2 
Signed 
Data Types 


All signed data types assume 2's complement nota- 
tion. The signed data types contain two fields, a sign 
bit and a magnitude. The sign bit is the most signifi- 
cant bit (MSB). The number is negative if the sign bit 
is 1. If the sign bit is 0, the number is positive. The 
magnitude field consists of the remaining bits in the 
number. Refer to Table 2.13. 
8·bit Integer: 
Signed 8-bit quantity 
16-bit Integer: Signed 16·bit quantity 
32-bit Integer: Signed 32·bit quantity 
64-bit Integer: Signed 64·bit quantity 


The FPU only supports 16-, 32- and 64·bit integers. 
The CPU only supports 8-, 16- and 32-bit integers. 


2.6.1.3 
Floating 
Point Data Types 


Ftoating point data type in the Intel486 OX micro- 
processor contain three fields, sign, significand and 
exponent. The sign field is one bit and is the MSB of 
the floating point number. The number is negative if 
the sign bit is 1. If the sign bit is 0, the number is 
positive. The significand gives the significant bits of 
the number. The exponent field contains the power 
of 2 needed to scale the significand. Refer to Table 
2.13. 


Only the FPU supports floating point data types. 
Single Precision Real: 
23-bit significand and 8- 
bit exponent. 32 bits total. 


Double Precision Real: 
52-bit significand and 11- 
bit exponent. 64 bits total. 


Extended Precision Real: 64-bit significand and 15- 
bit exponent. 80 bits total. 
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2.6.1.4 
BCD Data Types 


The Intel486 OX microprocessor supports packed 
and unpacked binary coded decimal (BCD) data 
types. A packed BCD data type contains two digits 
per byte, the lower digit is in bits 0-3 and the upper 
digit in bits 4-7. An unpacked BCD data type con- 
tains 1 digit per byte stored in bits 0-3. 


The CPU supports 8-bit packed and unpacked BCD 
data types. The FPU only supports 80-bit packed 
BCD data types. Refer to Table 2.13. 


2.6.1.5 
String Data Types 


A string data type is a contiguous sequence of bits, 
bytes, words or dwords. A string may contain be- 
tween 1 byte and 4 Gbytes. Refer to Table 2.14. 


String data types are only supported by the CPU. 


Dword String: Contiguous sequence of dwords. 


Bit String: A set of contiguous bits. In the Intel486 
OXmicroprocessor bit strings can be up to 4 gigabits 
long. 


2.6.1.6 
ASCII 
Data Types 


The Intel486 OX microprocessor supports ASCII 
(American Standard Code for 
Information Inter- 
change) strings and can perform arithmetic opera- 
tions (such as addition and division) on ASCII data. 
Refer to Table 2.14. 


Table 2.13. Intel486™ OX Microprocessor 
Data Types 


Supported 
by 
Supported 
by 
sa•• Reglatera FPU 
Leeot 
Significant 
Byte 
J, J, 
J, 


DataFormat 
Range 
Precision 
7 


Byte 
X 
0-255 
8 bits 


15 


Word 
X 
0-64K 
16 bits 


31 


Dword 
X 
0-4G 
32 bits 


0• 


102 
Two's 


8-Bit Integer 
X 
8 bits 
Complement 


Sign Bit t 


15 


16-Bit Integer 
X X 
104 
16 bits 
Two's 
Complement 


Sign sn t 


31 


32·Bit Integer 
X X 
109 
32 bits 
Two's 
Complement 


Sign Bit t 


63 


64-Bit Integer 
X 
1019 
64 bits 
Two's 
Complement 


SignSit 
t 


8·Bit Unpacked 
BCD 
X 
0-9 
1 Digit 
One 
BCD Digit per Byte 


31 
23 


Biased 


Exp. 


63 
52 


Biased 


Exp. 
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A+N 
CJ 


A+l 
A 
J1TOI 
~ 


A+ 268.435,455 
J, 
A+3 
A+2 
Bit 
r--r-T}} 
String~(( 
7 
t 
+2,147,483,647 


A+2N+l 
A+2N 
A+3 
A+2 
A+l 
A 


115 
I 


01 
115 
0115 
01 


N 


A+7 
A+6 
A+5 
A+4 
A+3 
A+2 
A+1 
A 


131 
0131 
01 


A-266,435,456 


A+l 
A 
A-l 
A-2 
A-3 
J, 


0 llED t 
-2.147,483,648 


ASCII Data Types 
D 


A+4N+3 
A+4N+2 
A+4N+1 
A+4N 


Dwordl 
~ 
I 
String 31 
0 


2.6.1.7 Pointer Data Types 


A pointer data type contains a value that gives the 
address of a piece of data. The Intel486 OX micro- 
processor supports two types of pointers. Refer to 
Table 2.15. 


Table 2.15. Pointer Data Types 


Lout 
SIg Byte 
J, 


47 


Selector 


2.6.2 
LITTLE 
ENDIAN 
vs BIG ENDIAN 
DATA 
FORMATS 


The Intel486 DX microprocessor, as well as all other 
members of the 86 architecture use the "Iittle-endi- 
an" method for storing data types that are larger 
than one byte. Words are stored in two consecutive 
bytes in memory with the low-order byte at the low- 
est address and the high order byte at the high ad- 
dress. Dwords are stored in four consecutive bytes 
in memory with the low-order byte at the lowest ad- 
dress and the high order byte at the highest address. 
The address of a word or dword data item is the byte 
address of the low-order byte. 


Figure 2.18 illustrates the differences between the 
big-endian and Iittle-endian formats for dwords. The 
32 bits of data are shown with the low order bit num- 
bered bit 0 and the high order bit numbered 32. Big- 
endian data is stored with the high-order bits at the 
lowest addressed byte. Little-endian data is stored 
with the high-order bits in the highest addressed 
byte. 


The Intel486 DX microprocessor has two instruc- 
tions which can convert 16- or 32-bit data between 
the two byte orderings. BSWAP (byte swap) handles 
four byte values and XCHG (exchange) handles two 
byte values. 


Interrupts and exceptions alter the normal program 
flow, in order to handle external events, to report 
errors or exceptional conditions. The difference be- 
tween interrupts and exceptions is that interrupts are 
used to handle asynchronous external events while 
exceptions handle instruction faults. Although a pro- 
gram can generate a software interrupt via an INT N 
instruction, the processor treats software interrupts 
as exceptions. 


Hardware interrupts occur as the result of an exter- 
nal event and are classified into two types: maskable 
or non-maskable. Interrupts are serviced after the 
execution of the current instruction. After the inter- 
rupt handler is finished servicing the interrupt, exe- 
cution proceeds with the instruction immediately af· 
ter the interrupted instruction. Sections 2.7.3 and 
2.7.4 discuss the differences between Maskable and 
Non-Maskable interrupts. 


Exceptions are classified as faults, traps, or aborts 
depending on the way they are reported, and wheth- 
er or not restart of the instruction causing the excep- 
tion is supported. Faults are exceptions that are de- 
tected and serviced before 
the execution of the 
faulting instruction. A fault would occur in a virtual 
memory system, when the processor referenced a 
page or a segment which was not present. The oper- 
ating system would fetch the page or segment from 
disk, and then the Intel486 DX microprocessor 
would restart the instruction. Traps are exceptions 
that are reported immediately after the execution of 
the instruction which caused the problem. User de- 
fined interrupts are examples of traps. Aborts 
are 
exceptions which do not permit the precise location 
of the instruction causing the exception to be deter- 
mined. Aborts are used to report severe errors, such 
as a hardware error, or illegal values in system ta- 
bles. 


Thus, when an interrupt service routine has been 
completed, execution proceeds from the instruction 
immediately following the interrupted instruction. On 
the other hand, the return address from an excep- 
tion fault routine will always point at the instruction 
causing the exception and include any leading in- 
struction prefixes. Table 2.16 summarizes the possi- 
ble interrupts for the Intel486 DX microprocessor 
and shows where the return address points. 


The Intel486 DX microprocessor has the ability to 
handle up to 256 different interrupts/exceptions. In 
order to service the interrupts, a table with up to 256 
interrupt vectors must be defined. The interrupt vec- 
tors are simply pointers to the appropriate interrupt 
service routine. In Real Mode (see Section 3.1), the 
vectors are 4 byte quantities, a Code Segment plus 
a 16-bit offset; in Protected Mode, the interrupt vec- 
tors are 8 byte quantities, which are put in an Inter- 
rupt Descriptor Table (see Section 4.3.3.4). Of the 
256 possible interrupts, 32 are reserved for use by 
Intel, the remaining 224 are free to be used by the 
system designer. 


When an interrupt occurs the following actions hap- 
pen. First, the current program address and the 
Flags are saved on the stack to allow resumption of 
the interrupted program. Next, an 8-bit vector is sup- 


• 


plied to the Intel486 
DX microprocessor 
which 
iden- 
tifies the appropriate 
entry in the interrupt 
table. The 
table 
contains 
the starting 
address 
of the interrupt 
service 
routine. 
Then, 
the 
user 
supplied 
interrupt 
service 
routine 
is executed. 
Finally, 
when 
an IRET 
instruction 
is executed 
the old processor 
state is re- 
stored and program 
execution 
resumes 
at the appro- 
priate instruction. 


The 8-bit interrupt 
vector 
is supplied 
to the Intel486 
DX microprocessor 
in several 
different 
ways: excep- 
tions 
supply 
the interrupt 
vector 
internally; 
software 
INT instructions 
contain 
or imply the vector; 
maska- 
ble hardware 
interrupts 
supply 
the 8-bit vector 
via 
the 
interrupt 
acknowledge 
bus 
sequence. 
Non- 
Maskable 
hardware 
interrupts 
are assigned 
to inter- 
rupt vector 
2. 


Maskable 
interrupts 
are the most common 
way used 
by the 
Intel486 
DX microprocessor 
to respond 
to 
asynchronous 
external 
hardware 
events. 
A 
hard- 
ware interrupt 
occurs 
when the INTR is pulled 
high 
and the Interrupt 
Flag bit (IF) is enabled. 
The proc- 


essor 
only 
responds 
to interrupts 
between 
instruc- 
tions, (REPeat String instructions, 
have an "interrupt 
window", 
between 
memory 
moves, 
which 
allows 
in- 
terrupts 
during long string moves). When an interrupt 
occurs 
the processor 
reads an 8-bit vector 
supplied 
by the hardware 
which 
identifies 
the source 
of the 
interrupt, 
(one of 224 user defined 
interrupts). 
The 
exact nature of the interrupt 
sequence 
is discussed 
in Section 
7.2.10. 


Instruction 
Which 
Return 
Address 


Function 
Interrupt 
Can Cause 
Points 
to 
Type 
Number 
Exception 
Faulting 
Instruction 


Divide Error 
0 
DIV,IDIV 
YES 
FAULT 


Debug Exception 
1 
Any Instruction 
YES 
TRAp· 


NMllnterrupt 
2 
INT20rNMI 
NO 
NMI 


One Byte Interrupt 
3 
INT 
NO 
TRAP 


Interrupt 
on Overflow 
4 
INTO 
NO 
TRAP 


Array Bounds Check 
5 
BOUND 
YES 
FAULT 


Invalid OP-Code 
6 
Any Illegal Instruction 
YES 
FAULT 


Device Not Available 
7 
ESC, WAIT 
YES 
FAULT 


Double Fault 
8 
Any Instruction 
That Can 
ABORT 
Generate 
an Exception 


Intel Reserved 
9 


InvalidTSS 
10 
JMP, CALL, IRET, INT 
YES 
FAULT 


Segment 
Not Present 
11 
Segment 
Register 
Instructions 
YES 
FAULT 


Stack Fault 
12 
Stack References 
YES 
FAULT 


General 
Protection 
Fault 
13 
Any Memory 
Reference 
YES 
FAULT 


Page Fault 
14 
Any Memory Access 
or Code Fetch 
YES 
FAULT 


Intel Reserved 
15 


Floating 
Point Error 
16 
Floating 
Point, WAIT 
YES 
FAULT 


Alignment 
Check Interrupt 
17 
Unaligned 
Memory Access 
YES 
FAULT 


Intel Reserved 
18-31 


Two Byte Interrupt 
0-255 
INTn 
NO 
TRAP 
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The IF bit in the EFLAG registers is reset when an 
interrupt is being serviced. This effectively disables 
servicing additional interrupts during an interrupt 
service routine. However, the IF may be set explicitly 
by the interrupt handler, to allow the nesting of inter- 
rupts. When an IRET instruction is executed the 
original state of the IF is restored. 


Non-maskable interrupts provide a method of servic- 
ing very high priority interrupts. A common example 
of the use of a non-maskable interrupt (NMI) would 
be to activate a power failure routine. When the NMI 
input is pulled high it causes an interrupt with an 
internally supplied vector value of 2. Unlike a normal 
hardware interrupt, no interrupt acknowledgment se- 
quence is performed for an NMI. 


While executing the NMI servicing procedure, the In- 
tel486 OX microprocessor will not service further 
NMI requests until an interrupt return (IRET) instruc- 
tion is executed or the processor is reset. If NMI 
occurs while currently servicing an NMI, its presence 
will be saved for servicing after executing the first 
IRET instruction. The IF bit is cleared at the begin- 
ning of an NMI interrupt to inhibit further INTR inter- 
rupts. 


A third type of interrupt/exception for the Intel486 
OX microprocessor is the software interrupt. An INT 
n instruction causes the processor to execute the 
interrupt service routine pointed to by the nth vector 
in the interrupt table. 


A special case of the two byte software interrupt INT 
n is the one byte INT 3, or breakpoint interrupt. By 
inserting this one byte instruction in a program, the 
user can set breakpoints in his program as a debug- 
ging tool. 


A final type of software interrupt is the single step 
interrupt. It is discussed in Section 9.2. 


2.7.6 
INTERRUPT 
AND EXCEPTION 


PRIORITIES 


Interrupts are externally-generated events. Maska- 
ble Interrupts (on the INTR input) and Non-Maskable 
Interrupts (on the NMI input) are recognized at in- 
struction 
boundaries. When 
NMI and 
maskable 


INTR are both recognized at the same instruction 
boundary, the Intel486 OX microprocessor invokes 
the NMI service routine first. If, after the NMI service 
routine has been invoked, maskable interrupts are 
still enabled, then the Intel486 OX microprocessor 
will invoke the appropriate interrupt service routine. 


Table 2.17a.lnteI486TM 
DX Microprocessor 


Priority for Invoking 
Service 
Routines 
In 


~ 
_c__ 
as_e_O_f _S_lm_U_I_ta_n_e_O_U_S_E_x_t_e_rn_a_I_I_nt_e_r_ru_p_t_s~I. 
1. NMI 
2.INTR 


Exceptions are internally-generated events. Excep- 
tions are detected by the Intel486 OX microproces- 
sor if, in the course of executing an instruction, the 
Intel486 OX microprocessor detects a problematic 
condition. The Intel486 OX microprocessor then im- 
mediately invokes the appropriate exception service 
routine. The state of the Intel486 OXmicroprocessor 
is such that the instruction causing the exception 
can be restarted. If the exception service routine has 
taken care of the problematic condition, the instruc- 
tion will execute without causing the same excep- 
tion. 


It is possible for a single instruction to generate sev- 
eral exceptions (for example, transferring a single 
operand could generate two page faults if the oper- 
and location spans two "not present" pages). How- 
ever, only one exception is generated upon each at- 
tempt to execute the instruction. Each exception 
service routine should correct its corresponding ex- 
ception, and restart the instruction. In this manner, 
exceptions are serviced until the instruction exe- 
cutes successfully. 


As the Intel486 OXmicroprocessor executes instruc- 
tions, it follows a consistent cycle in checking for 
exceptions, as shown in Table 2.17b. This cycle is 
repeated as each instruction is executed, and oc- 
curs in parallel with instruction decoding and execu- 
tion. 


Consider 
the case of the Intel486 
DX microproc- 
essor 
having 
just 
completed 
an 
instruction. 
It 
then performs 
the following 
checks 
before reach- 
ing the point where 
the next instruction 
is com- 
pleted: 


1. Check for Exception 
1 Traps from the instruc- 
tion just completed 
(single-step 
via Trap Flag, 
or Data Breakpoints 
set in the Debug 
Regis- 
ters). 


2. Check 
for Exception 
1 Faults 
in the next in- 
struction 
(Instruction 
Execution 
Breakpoint 
set 
in the 
Debug 
Registers 
for the 
next 
in- 
struction). 


3. Check 
for external 
NMI and INTR. 


4. Check 
for Segmentation 
Faults that 
prevent- 
ed fetching 
the entire next instruction 
(excep- 
tions 
11 or 13). 


5. Check for Page Faults that prevented 
fetching 
the entire 
next instruction 
(exception 
14). 


6. Check for Faults decoding 
the next instruction 
(exception 
6 if illegal opcode; 
exception 
6 if in 
Real 
Mode 
or in Virtual 
8086 
Mode 
and at- 
tempting 
to execute 
an instruction 
for Protect- 
ed Mode 
only (see Section 
4.6.4); 
or excep- 
tion 13 if instruction 
is longer than 15 bytes, or 
privilege 
violation 
in Protected 
Mode (Le., not 
at IOPL or at CPL=O). 


7. 
If WAIT 
opcode, 
check 
if TS = 1 and MP = 1 
(exception 
7 if both are 1). 


8. 
If opcode 
for 
Floating 
Point 
Unit, 
check 
if 
EM = 1 or TS = 1 (exception 
7 if either are 1). 


9. 
If opcode 
for Floating 
Point Unit (FPU), check 
FPU error status 
(exception 
16 if error status 
is asserted). 


10. Check in the following 
order for each memo- 
ry reference 
required 
by the instruction: 


a. Check 
for Segmentation 
Faults that pre- 
vent transferring 
the entire memory 
quan- 
tity (exceptions 
11, 12, 13). 


b. Check for Page Faults that prevent 
trans- 
ferring 
the 
entire 
memory 
quantity 
(ex- 
ception 
14). 


NOTE: 
The 
order 
stated 
supports 
the 
concept 
of the 
paging 
mechanism 
being "underneath" 
the seg- 
mentation 
mechanism. 
Therefore, 
for any given 
code 
or data 
reference 
in memory, 
segmenta- 
tion exceptions 
are generated 
before 
paging 
ex- 
ceptions 
are generated. 
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The 
Intel486 
DX microprocessor 
fully 
supports 
re- 
starting 
all instructions 
after faults. 
If an exception 
is 
detected 
in the instruction 
to be executed 
(exception 
categories 
4 through 
10 in Table 2.17b), the Intel486 
DX microprocessor 
invokes 
the appropriate 
excep- 
tion service routine. The Intel486 
DX microprocessor 
is in a state that permits 
restart of the instruction, 
for 
all cases but those in Table 2.17c. Note that all such 
cases 
are easily 
avoided 
by proper 
design 
of the 
operating 
system. 


Table 
2.17c. Conditions 
Preventing 
Instruction 
Restart 


An 
instruction 
causes 
a task 
switch 
to a task 
whose 
Task 
State 
Segment 
is 
partially 
"not 
present". 
(An entirely 
"not 
present" 
TSS is re- 
startable.) 
Partially 
present 
TSS's 
can be avoid- 
ed either 
by keeping 
the 
TSS's 
of such 
tasks 
present 
in memory, 
or by aligning 
TSS segments 
to reside entirely within a single 4K page (for TSS 
segments 
of 4 Kbytes or less). 


NOTE: 
These 
conditions 
are avoided 
by using the oper- 
ating system 
designs 
mentioned 
in this table. 


A Double 
Fault (exception 
8) results when the proc- 
essor 
attempts 
to invoke 
an exception 
service 
rou- 
tine for the segment 
exceptions 
(10, 11, 12 or 13), 


but in the process 
of doing so, detects 
an exception 
other than a Page Fault (exception 
14). 


A Double 
Fault (exception 
8) will also be generated 
when 
the 
processor 
attempts 
to 
invoke 
the 
Page 
Fault (exception 
14) service 
routine, 
and detects 
an 
exception 
other 
than 
a second 
Page 
Fault. 
In any 
functional 
system, 
the entire Page Fault service 
rou- 
tine must remain 
"present" 
in memory. 


When a Double Fault occurs, the Intel486 
DX micro- 
processor 
invokes 
the exception 
service 
routine 
for 
exception 
8. 


Several 
interrupt 
vectors 
of the 
Intel486 
DX micro- 
processor 
are used to report exceptional 
conditions 
while 
executing 
numeric 
programs 
in either 
real or 
protected 
mode. Table 
2.18 shows 
these 
interrupts 
and their causes. 


Interrupt 
Cause of Interrupt 
Number 


7 
A Floating 
Point instruction 
was encountered 
when EM or TS of the Intel486 
OX processor 
control 
register zero (CRO) was set. EM = 1 indicates 
that software 
emulation 
of the 
instruction 
is required. When TS is set, either a Floating Point or WAIT instruction 
causes 
interrupt 
7. This indicates 
that the current FPU context 
may not belong to the current task. 


13 
The first word or doubleword 
of a numeric operand 
is not entirely within the limit of its 
segment. 
The return address 
pushed onto the stack of the exception 
handler points at the 
Floating Point instruction 
that caused the exception, 
including 
any prefixes. The FPU has 
not executed 
this instruction; 
the instruction 
pointer and data pointer register refer to a 
previous, 
correctly 
executed 
instruction. 


16 
The previous 
numerics 
instruction 
caused an unmasked 
exception. 
The address of the 
faulty instruction 
and the address of its operand 
are stored in the instruction 
pointer and 
data pointer registers. 
Only Floating 
Point and WAIT instructions 
can cause this interrupt. 


The Intel486TM OX processor 
return address 
pushed onto the stack of the exception 
handler points to a WAIT or Floating 
Point instruction 
(including 
prefixes). 
This instruction 
can be restarted 
after clearing the exception 
condition 
in the FPU. The FNINIT, FNCLEX. 
FNSTSW, 
FNSTENV, 
and FNSAVE instructions 
cannot cause this interrupt. • 
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When the processor is reset or powered up it is ini- 
tialized in Real Mode. Real Mode has the same base 
architecture as the 8086, but allows access to the 
32-bit register set of the Intel486 microprocessor 
family. The addressing mechanism, memory size, in- 
terrupt handling, are all identical to the Real Mode 
on the 80286. 


All of the Intel486 microprocessor instructions are 
available in Real Mode (except those instructions 
listed in Section 4.6.4). The default operand size in 
Real Mode is 16 bits, just like the 8086. In order to 
use the 32-bit registers and addressing modes, over- 
ride prefixes must be used. In addition, the segment 
size on the Intel486 microprocessor in Real Mode is 
64 Kbytes so 32-bit effective addresses must have a 
value less the OOOOFFFFH.The primary purpose of 
Real Mode is to set up the processor for Protected 
Mode Operation. 


The LOCK prefix on the Intel486 microprocessor, 
even in Real Mode, is more restrictive than on the 
80286. This is due to the addition of paging on the 
Intel486 microprocessor in Protected Mode and Vir- 
tual 8086 Mode. Paging makes it impossible to guar- 
antee that 
repeated 
string instructions 
can 
be 
LOCKed. The Intel486 microprocessor can't require 
that all pages holding the string be physically pres- 
ent in memory. Hence, a Page Fault (exception 14) 
might have to be taken during the repeated string 
instruction. Therefore the LOCK prefix can't be sup- 
ported during repeated string instructions. 


These are the only instruction forms where the 
LOCK prefix is legal on the Intel486 DX2 microproc- 
essor: 


Opcode 
Operands 
(Oest, Source) 


BITTestand 
Mem,Reg/immed 
SET/RESET/COMPLEMENT 
XCHG 
Reg,Mem 
XCHG 
Mem,Reg 
ADD,OR,ADC,SBB, 
Mem,Reg/immed 
AND,SUB,XOR 
NOT,NEG,INC,DEC 
Mem 
CMPXCHG.XADD 
Mem,Reg 


An exception 6 will be generated if a LOCK prefix is 
placed before any instruction form or opcode not 
listed above. The LOCK prefix allows indivisible 
read/modify/write 
operations on memory operands 
using the instructions above. For example, even the 
ADD Reg, Mem is not LOCKable, because the Mem 
operand is not the destination (and therefore no 
memory read/modify/operation is being performed). 


Since, on the Intel486 microprocessor, repeated 
string instructions are not LOCKable, it is not possi- 
ble to LOCK the bus for a long period of time. There- 
fore, the LOCK prefix is not IOPL-sensitive on the 
Intel486 microprocessor. The LOCK prefix can be 
used at any privilege level, but only on the instruc- 
tion forms listed above. 
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3.2 
Memory Addressing 


In Real Mode the maximum 
memory size is limited to 
1 megabyte. 
Thus, 
only address 
lines A2-A 
19 are 
active. 
(Exception, 
after RESET address 
lines A20- 
A31 are high during CS-relative 
memory 
cycles 
until 
an intersegment 
jump or call is executed 
(see Sec- 
tion 6.5)). 


Since paging is not allowed 
in Real Mode the linear 
addresses 
are 
the 
same 
as 
physical 
addresses. 
Physical 
addresses 
are 
formed 
in Real 
Mode 
by 
adding the contents 
of the appropriate 
segment 
reg- 
ister which 
is shifted 
left by four bits to an effective 
address. 
This addition 
results 
in a physical 
address 
from OOOOOOOOHto 0010FFEFH. 
This is compatible 
with 80286 
Real Mode. Since segment 
registers 
are 
shifted 
left by 4 bits, Real Mode 
segments 
always 
start on 16 byte boundaries. 


All segments 
in Real 
Mode 
are exactly 
64 Kbytes 
long, 
and 
may be read, 
written, 
or executed. 
The 
Intel486 
microprocessor 
will generate 
an exception 
13 if a data operand 
or instruction 
fetch occurs 
past 
the 
end 
of a segment 
(Le., if an operand 
has an 
offset 
greater 
than FFFFH, 
for example 
a word with 
a low byte at FFFFH 
and the high byte at OOOOH). 


Segments 
may be overlapped 
in Real Mode. Thus, if 
a particular 
segment 
does not use all 64 Kbytes an- 
other 
segment 
can be overlayed 
on top of the un- 
used 
portion 
of the previous 
segment. 
This allows 
the programmer 
to minimize 
the amount 
of physical 
memory 
needed 
for a program. 


There 
are two fixed areas in memory 
which 
are re- 
served 
in Real address 
mode: 
system 
initialization 
area and the interrupt 
table area. Locations 
OOOOOH 
through 
003FFH 
are reserved 
for interrupt 
vectors. 
Each one of the 256 possible 
interrupts 
has a 4-byte 
jump vector 
reserved 
for it. Locations 
FFFFFFFOH 
through 
FFFFFFFFH 
are reserved 
for system initiali- 
zation. 


3.4 
Interrupts 


Many 
of the 
exceptions 
shown 
in Table 
2.16 
and 
discussed 
in Section 
2.7 are not applicable 
to Real 
Mode operation, 
in particular 
exceptions 
10, 11, 14, 
17, will not happen 
in Real Mode. Other exceptions 
have slightly different 
meanings 
in Real Mode; Table 
3.1 identifies 
these exceptions. 


The 
HLT instruction 
stops 
program 
execution 
and 
prevents 
the processor 
from using the local bus until. 


restarted. 
Either 
NMI, INTR with interrupts 
enabled 
(IF = 1), or RESET will force the Intel486 
DX2 micro- 
processor 
out of halt. If interrupted, 
the saved CS:IP 
will point to the next instruction 
after the HL T. 


As in the case in protected 
mode, the shutdown 
will 
occur when a severe error is detected 
that prevents 
further 
processing. 
In Real Mode, shutdown 
can oc- 
cur under two conditions: 


An interrupt 
or an exception 
occur 
(exceptions 
8 or 
13) and the interrupt 
vector 
is larger than the Inter- 
rupt Descriptor 
Table 
(Le., there 
is not an interrupt 
handler 
for the interrupt). 


A CALL, 
INT or PUSH instruction 
attempts 
to wrap 
around the stack segment 
when SP is not even (Le., 


pushing 
a value on the stack when 
SP = 0001 re- 
sulting in a stack segment 
greater 
than FFFFH). 


An NMI input can bring the processor 
out of shut- 
down 
if the Interrupt 
Descriptor 
Table 
limit is large 
enough 
to contain 
the NMI interrupt 
vector 
(at least 
0017H) 
and the stack 
has enough 
room to contain 
the vector 
and flag 
information 
(Le., SP is greater 
than 
0005H). 
If these 
conditions 
are not 
met, 
the 
Intel486 
DX2 CPU is unable to execute 
the NMI and 
executes 
another 
shutdown 
cycle. 
In this case, the 
processor 
remains in the shutdown 
and can only exit 
via the RESET input. 


Function 
Interrupt 
Related 
Return 
Number 
Instructions 
Address 
Location 


Interrupt 
table limit too small 
8 
INT Vector is not 
Before 
within table limit 
Instruction 


CS, OS, ES, FS, GS 
13 
Word memory reference 
Before 
Segment 
overrun exception 
beyond offset 
= FFFFH. 
Instruction 
An attempt 
to execute 
past the end of CS segment. 


SS Segment 
overrun exception 
12 
Stack Reference 
Before 
beyond offset 
= FFFFH 
Instruction 


The complete capabilities of the Intel486 DX2 micro- 
processor are unlocked when the processor oper- 
ates in Protected Virtual Address Mode (Protected 
Mode). Protected Mode vastly increases the linear 
address space to four gigabytes (232 bytes) and al- 
lows the running of virtual memory programs of al- 
most unlimited size (64 terabytes or 246 bytes). In 
addition Protected Mode allows the Intel486 DX2 mi- 
croprocessor to run all of the existing 8086, 80286 
and Intel386 microprocessor software, while provid- 
ing a sophisticated memory management and a 
hardware-assisted protection mechanism. Protected 
Mode allows the use of additional instructions espe- 
cially optimized for supporting multitasking operating 
systems. The base architecture of the Intel486 mi- 
croprocessor remains the same, the registers, in- 
structions, and addressing modes described in the 
previous sections are retained. The main difference 
between Protected Mode, and Real Mode from a 
programmer's view is the increased address space, 
and a different addressing mechanism. 


ACCESS RIGHTS 


LIMIT 


BASE ADDRESS 


Like Real Mode, Protected Mode uses two compo- 
nents to form the logical address, a 16-bit selector is 
used to determine the linear base address of a seg- 
ment, the base address is added to a 32-bit effective 
address to form a 32-bit linear address. The linear 
address is then either used as the 32-bit physical 
address, or if paging is enabled the paging mecha- 
nism maps the 32-bit linear address into a 32-bit 
physical address. 


The difference between the two modes lies in calcu- 
lating the base address. In Protected Mode the se- 
lector is used to specify an index into an operating 
system defined table (see Figure 4.1). The table 
contains the 32-bit base address of a given seg- 
ment. The physical address is formed by adding the 
base address obtained from the table to the offset. 


Paging provides an additional memory management 
mechanism which operates only in Protected Mode. 
Paging provides a means of managing the very large 
segments of the Intel486 microprocessor family. As 
such, paging operates beneath segmentation. The 
paging mechanism translates the protected linear 
address which comes from the segmentation unit 
into a physical address. Figure 4.2 shows the com- 
plete Intel486 DX2 addressing mechanism with pag- 
ing enabled. 


Inlet486DX 
CPIJ 
PAGING 
llECHANISll 


PAGE FRAllE 


ADDRESS 


Segmentation is one method of memory manage- 
ment. Segmentation provides the basis for protec- 
tion. Segments are used to encapsulate regions of 
memory which have common attributes. For exam· 
pie, all of the code of a given program could be con- 
tained in a segment, or an operating system table 
may reside in a segment. All information about a 
segment is stored in an 8 byte data structure called 
a descriptor. All of the descriptors in a system are 
contained in tables recognized by hardware. 


The following terms are used throughout the discus- 
sion of descriptors, privilege levels and protection: 


PL: Privilege Level-one 
of the four hierarchical 
privilege levels. Level 0 is the most privileged level 
and level 3 is the least privileged. More privileged 
levels are numerically smaller than less privileged 
levels. 


RPL: Requestor Privilege Level-The 
privilege level 
of the original supplier of the selector. RPL is deter- 
mined by the least two significant bits of a selector. 


DPL: Descriptor Privilege Level-This 
is the least 
privileged level at which a task may access that de- 
scriptor (and the segment associated with that de- 
scriptor). Descriptor Privilege Level is determined by 
bits 6:5 in the Access Right Byte of a descriptor. 


CPL: Current Privilege Level-The 
privilege level at 
which a task is currently executing, which equals the 
privilege level of the code segment being executed. 
CPL can also be determined by examining the low- 
est 2 bits of the CS register, except for conforming 
code segments. 


EPL: Effective Privilege Level-The 
effective privi- 
lege level is the least privileged of the RPL and DPL. 
Since smaller privilege level values indicate greater 
privilege, EPL is the numerical maximum of RPL and 
DPL. 


Task: One instance of the execution of a program. 
Tasks are also referred to as processes. 


4.3.3.1 Descriptor Tables Introduction 


The descriptor tables define all of the segments 
which are used in a Intel486 microprocessor system. 
There are three types of tables on the Intel486 DX2 
microprocessor which hold descriptors: the Global 
Descriptor Table, Local Descriptor Table, and the In- 
terrupt Descriptor Table. All of the tables are vari- 
able length memory arrays. They can range in size 
between 8 bytes and 64 Kbytes. Each table can hold 
up to 8192 8-byte descriptors. The upper 13 bits of a 
selector are used as an index into the descriptor ta- 
ble. The tables have registers associated with them 
which hold the 32-bit linear base address, and the 
16-bit limit of each table. 


• 


Each of the tables has a register associated with it, 
the GDTR, LDTR, and the IDTR (see Figure 4.3). 
The LGDT, LLDT, and L1DT instructions, load the 
base and limit of the Global, Local, and Interrupt De- 
scriptor Tables, respectively, into the appropriate 
register. The SGDT, SLDT, and SIDT store the base 
and limit values. These tables are manipulated by 
the operating system. Therefore, the load descriptor 
table instructions are privileged instructions. 


0 
32 


lOT LIMIT 
PROGRAM INVISIBLE 
AUTOMAT1CALLY LOADED 
FROM LOT DESCRIPTOR 
IDTR 
._------------- 


31 
0 


15 
0 


GOT LIMIT 


4.3.3.2 Global Descriptor Table 


The Global Descriptor Table (GDT) contains de- 
scriptors which are possibly available to all of the 
tasks in a system. The GDT can contain any type of 
segment descriptor except for descriptors which are 
used for servicing interrupts (i.e., interrupt and trap 
descriptors). Every Intel486 microprocessor system 
contains a GDT. Generally the GDT contains code 
and data segments used by the operating systems 
and task state segments, and descriptors for the 
LDTs in a system. 


The first slot of the Global Descriptor Table corre- 
sponds to the null selector and is not used. The null 
selector defines a null pointer value. 


4.3.3.3 Local Descriptor Table 


LDTs contain descriptors which are associated with 
a given task. Generally, operating systems are de- 
signed so that each task has a separate LDT. The 
LDT may contain only code, data, stack, task gate, 
and call gate descriptors. LDTs provide a mecha- 


nism for isolating a given task's code and data seg- 
ments from the rest of the operating system, while 
the GDT contains descriptors for segments which 
are common to all tasks. A segment cannot be ac- 
cessed by a task if its segment descriptor does not 
exist in either the current LDT or the GDT. This pro- 
vides both isolation and protection for a task's seg- 
ments, while still allowing global data to be shared 
among tasks. 


Unlike the 6 byte GDT or IDT registers which contain 
a base address and limit, the visible portion of the 
LDT register contains only a 16-bit selector. This se- 
lector refers to a Local Descriptor Table descriptor in 
the GDT. 


4.3.3.4 Interrupt 
Descriptor Table 


The third table needed for Intel486 microprocessor 
systems is the Interrupt Descriptor Table. (See Fig- 
ure 4.4.) The IDT contains the descriptors which 
point to the location of up to 256 interrupt service 
routines. The IDT may contain only task gates, inter- 
rupt gates, and trap gates. The IDT should be at 
least 256 bytes in size in order to hold the descrip- 
tors for the 32 Intel Reserved Interrupts. Every inter- 
rupt used by a system must have an entry in the IDT. 
The IDT entries are referenced via INT instructions, 
external interrupt vectors, and exceptions. (See Sec- 
tion 2.7 Interrupts). 


QATEFOR 
INTERRUPT 
'n 


QATEFOR 
INTERRUPT 
'n-1 


INTERRUPT 
DESCRIPTOR 
TABLE 
(lOT) 
GATE FOR 
INTERRUPT 
'1 


QATEFOR 
INTERRUPT 
#0 lId 


Figure 4.4. Interrupt Descriptor 
Table Register Use 


4.3.4.1 Descriptor 
Attribute 
Bits 


The object to which the segment selector points to 
is called a descriptor. Descriptors are eight byte 
quantities which contain attributes about a given re- 
gion of linear address space (i.e., a segment). These 
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attributes 
include 
the 32-bit 
base 
linear 
address 
of 
the segment, 
the 20-bit length and granularity 
of the 
segment, 
the protection 
level, read, write or execute 
privileges, 
the default 
size of the operands 
(16-bit or 
32-bit), 
and the type of segment. 
All of the attribute 
information 
about a segment 
is contained 
in 12 bits 
in the segment 
descriptor. 
Figure 4.5 shows the gen- 
eral format 
of a descriptor. 
All segments 
on the In- 
tel486 
microprocessor 
have three 
attribute 
fields 
in 
common: 
the P bit, the DPL bit, and the 5 bit. The 
Present 
P bit is 1 if the segment 
is loaded in physical 
memory, 
if P =0 then 
any attempt 
to access 
this 
segment 
causes 
a not present 
exception 
(exception 
11). The Descriptor 
Privilege 
Level DPL is a two-bit 
field which specifies 
the protection 
level 0-3 
associ- 
ated with a segment. 


The Intel486 
DX microprocessor 
has two main cate- 
gories of segments: 
system 
segments 
and non-sys- 
tem segments 
(for code and data). The segment 5 
bit in the segment 
descriptor 
determines 
if a given 
segment 
is a system segment 
or a code or data seg- 
ment. 
If the S bit is 1 then the segment 
is either 
a 
code or data segment, 
if it is 0 then the segment 
is a 
system 
segment. 


4.3.4.2 
Intel486 
CPU Code, 
Data Descriptors 
(5= 1) 


Figure 4.6 shows 
the general 
format 
of a code and 
data descriptor 
and Table 4.1 illustrates 
how the bits 
in the Access 
Rights 
Byte are interpreted. 


o 
BYTE 
ADDRESS 
o 


BASE 


23 ... 
16 


BASE 
Base Address of the segment 
LIMIT 
The length of the segment 
P 
PresentBit 
1= Present 0= Not Present 
DPL 
DescriptorPrivilegeLevel 0-3 
S 
Segment Descriptor 0= System Descriptor 1= Code or Data SegmentDescriptor 
TYPE 
Type of Segment 
A 
Accessed Bit 
G 
GranularityBit 
1= Segmentlength is page granular 
0= Segmentlength is byte granular 
D 
Default Operation Size (recognizedin code segment descriptorsonly) 
1= 32-bit segment 0= 16-bitsegment 
o 
Bit must be zero (0) for compatibilitywith future processors 
AVL 
Availablefield for user or OS 


NOTE: 
In a maximum-sizesegment (i.e., a segment with G= 1 and segment limit 19...0= FFFFFH),the lowest 12 bits of the 
segment base should be zero (i.e.• segment base 11...000= OOOH). 


SEGMENT 
BASE 15 ... 
0 
SEGMENT 
LIMIT 15 ... 
0 


LIMIT 
ACCESS 
BASE 
BASE 31 ... 
24 
G 
D 
0 
AVL 
19 ... 
16 
RIGHTS 
23 ... 
16 
BYTE 


D/B 
1= Default InstructionAttributes are 32-Bits 
0= Default InstructionAttributes are 16-Bits 
AVL 
Availablefield for user or OS 
G 
GranularityBit 
1= Segment length is page granular 
0= Segment length is byte granular 
o 
Bit must be zero (0) for compatibilitywith future processors 


Bit 
Name 
Function 
Position 


7 
Present (P) 
P = 1 
Segment 
is mapped into physical 
memory. 
P=O 
No mapping to physical 
memory exits, base and limit are 
not used. 


6-5 
Descriptor 
Privilege 
Segment 
privilege 
attribute 
used in privilege tests. 
Level (DPL) 
4 
Segment 
Descrip- 
S = 1 
Code or Data (includes stacks) segment 
descriptor. 


tor (S) 
S=O 
System Segment 
Descriptor 
or Gate Descriptor. 


3 
Executable 
(E) 
E ~ 0 
De"ript", type • data••• ~"t 
} If 
2 
Expansion 
Direc- 
ED = 0 Expand up segment, 
offsets 
must be ~ limit. 
Data 
tion (ED) 
ED = 1 Expand down segment, 
offsets 
must be > limit. 
Segment 
1 
Writeable 
(W) 
W = 0 
Data segment 
may not be written into. 
(S = 1, 
W = 1 
Data segment 
may be written into. 
E = 0) 


3 
Executable 
(E) 
E = 
1 
Descriptor 
type is code segment: 
r 


2 
Conforming 
(C) 
C = 
1 
Code segment 
may only be executed 
Code 
when CPL ;;: DPL and CPL 
Segment 
remains unchanged. 
(S = 
1, 
1 
Aeadable 
(A) 
A=O 
Code segment 
may not be read. 
E = 1) 
A = 
1 
Code segment 
may be read. 


0 
Accessed 
(A) 
A=O 
Segment 
has not been accessed. 
A=1 
Segment 
selector 
has been loaded into segment 
register 
or used by selector 
test instructions. 


Type 
Field 
Definition 


Code 
and 
data 
segments 
have 
several 
descriptor 
fields in common. 
The accessed 
A bit is set whenev- 
er the processor 
accesses 
a descriptor. 
The A bit is 
used by operating 
systems 
to keep usage statistics 
on a given 
segment. 
The 
G bit, or granularity 
bit, 
specifies 
if a segment 
length 
is byte-granular 
or 
page-granular. 
Intel486 
DX 
microprocessor 
seg- 
ments can be one megabyte 
long with byte granular- 
ity (G = 0) or four 
gigabytes 
with 
page 
granularity 
(G = 1), (Le., 220 pages 
each 
page 
is 4 Kbytes 
in 
length). The granularity 
is totally unrelated 
to paging. 
A Intel486 
DX microprocessor 
system can consist 
of 
segments 
with byte granularity, 
and page granularity, 
whether 
or not paging is enabled. 


The executable 
E bit tells if a segment 
is a code or 
data segment. 
A code segment 
(E = 1, S = 1) may be 
execute-only 
or execute/read 
as determined 
by the 
Aead 
R bit. 
Code 
segments 
are 
execute 
only 
if 
A = 0, and 
execute/read 
if A = 1. COde segments 
may never be written 
into. 


NOTE: 
Code segments 
may be modified 
via aliases. 
Alias- 
es are writeable 
data 
segments 
which 
occupy 
the 
same 
range 
of linear 
address 
space 
as the 
code 
segment. 


The 0 bit indicates 
the default 
length 
for operands 
and effective 
addresses. 
If D = 1 then 
32-bit 
oper- 
ands and 32-bit addressing 
modes 
are assumed. 
If 
D = 0 then 
16-bit 
operands 
and 
16-bit 
addressing 
modes 
are assumed. 
Therefore 
all existing 
80286 
code segments 
will execute 
on the Intel486 
DX mi- 
croprocessor 
assuming 
the D bit is set O. 


Another 
attribute 
of code segments 
is determined 
by 
the conforming 
C bit. Conforming 
segments, 
C = 1, 
can be executed 
and shared 
by programs 
at differ- 
ent privilege 
levels. 
(See Section 
4.4 Protection.) 


Segments 
identified 
as data segments 
(E = 0, S = 1) 
are used for two types of Intel486 
DX microproces- 
sor segments: 
stack and data segments. 
The expan- 
sion 
direction 
(ED) 
bit specifies 
if a segment 
ex- 
pands downward 
(stack) 
or upward 
(data). 
If a seg- 
ment is a stack segment 
all offsets 
must be greater 
than the segment 
limit. On a data segment 
all off- 
sets must be less than or equal to the limit. In other 
words, 
stack 
segments 
start at the base 
linear ad- 
dress 
plus 
the 
maximum 
segment 
limit 
and 
grow 
down 
to the base 
linear address 
plus the limit. On 
the other hand, data segments 
start at the base lin- 
ear address 
and expand 
to the base linear address 
plus limit. 


The write W bit controls the ability to write into a 
segment. Data segments are read-only if W=O. The 
stack segment must have W= 1. 


The B bit controls the size of the stack pointer regis- 
ter. If B= 1, then PUSHes, POPs,and CALLs all use 
the 32-bit ESP register for stack references and as- 
sume an upper limit of FFFFFFFFH. If B=0, stack 
instructions all use the 16·bit SP register and as- 
sume an upper limit of FFFFH. 


4.3.4.3 System Descriptor 
Formats 


System segments describe information about oper· 
ating system tables, tasks, and gates. Figure 4.7 
shows the general format of system segment de- 
scriptors, and the various types of system segments. 
Intel486 OXmicroprocessor system descriptors con- 
tain a 32-bit base linear address and a 20-bit seg- 
ment limit. 80286 system descriptors have a 24-bit 
base address and a 16-bit segment limit. 80286 sys- 
tem descriptors are identified by the upper 16 bits 
being all zero. 


4.3.4.4 LOT Descriptors 
(S=0, TYPE=2) 


LOT descriptors (S= 0, TYPE= 2) contain informa- 
tion about Local Descriptor Tables. LOTs contain a 
table of segment descriptors, unique to a particular 
task. Since the instruction to load the LDTR is only 
available at privilege level 0, the DPL field is ignored. 
LOT descriptors are only allowed in the Global De- 
scriptor Table (GOD. 


4.3.4.5 TSS Descriptors 
(S= 0, 
TYPE = 1,3,9, 
B) 


A Task State Segment (TSS) descriptor contains in- 
formation about the location, size, and privilege level 
of a Task State Segment (TSS). A TSS in turn is a 
special fixed format segment which contains all the 
state information for a task and a linkage field to 
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SEGMENT BASE 15 ... 0 


LIMIT 
19 ... 16 


Type 
o 
1 
2 
3 
4 
5 
6 
7 


Dellnes 


Invalid 
Available 
80286 
TSS 
LOT 
Busy 
80286 
TSS 
80286 
Call Gale 
Task 
Gale 
(for 80286 
or Inle1486T" 
OX CPU Task) 
80286 
Inlerrupl 
Gale 
80286 
Trap 
Gate 


permit nesting tasks. The TYPE field is used to indi- 
cate whether the task is currently BUSY (i.e., on a 
chain of active tasks) or the TSS is available. The 
TYPE field also indicates if the segment contains a 
80286 or an Intel486 OX microprocessor TSS. The 
Task Register (TR) contains the selector which 
points to the current Task State Segment. 


4.3.4.6 Gate Descriptors 
(S= 0, 
TYPE=4-7, C, F) 


Gates are used to control access to entry points 
within the target code segment. The various types of 
gate descriptors are call gates, task gates, inter- 
rupt gates, and trap gates. Gates provide a level of 
indirection between the source and destination of 
the control transfer. This indirection allows the proc- 2 
essor to automatically perform protection checks. It 
also allows system designers to control entry points 
to the operating system. Call gates are used to 
change privilege levels (see Section 4.4 Protec- 
tion), task gates are used to perform a task switch, 
and interrupt and trap gates are used to specify in- 
terrupt service routines. 


Figure 4.8 shows the format of the four types of gate 
descriptors. Call gates are primarily used to transfer 
program control to a more privileged level. The call 
gate descriptor consists of three fields: the access 
byte, a long pointer (selector and offset) which 
points to the start of a routine and a word count 
which specifies how many parameters are to be cop- 
ied from the caller's stack to the stack of the called 
routine. The word count field is only used by call 
gates when there is a change in the privilege level, 
other types of gates ignore the word count field. 


Interrupt and trap gates use the destination selector 
and destination offset fields of the gate descriptor as 
a pointer to the start of the interrupt or trap handler 
routines. The difference between interrupt gates and 
trap gates is that the interrupt gate disables inter- 
rupts (resets the IF bit) while the trap gate does not. 


BASE 
23 ... 16 


Type 
8 
9 
A 
B 
C 
o 
E 
F 


Dellnes 


Invalid 
Available 
Inle1486T •• OX CPU TSS 
Undefined 
(Inlel 
Reserved) 
Busy 
Inle1486T" 
OX CPU TSS 
Inte1486T" 
OX CPU Call Gale 
Undefined 
(Intel 
Reserved) 
Inlel486TM 
OX CPU Interrupl 
Gale 
Inlel486TM 
OX CPU Trap 
Gale 


Task gates are used to switch tasks. Task gates 
may only refer to a task state segment (see Section 
4.4.6 Task Switching) therefore only the destination 
selector portion of a task gate descriptor is used, 
and the destination offset is ignored. 


Exception 13 is generated when a destination selec- 
tor does not refer to a correct descriptor type, i.e., a 
code segment for an interrupt, trap or call gate, a 
TSS for a task gate. 


The access byte format is the same for all gate de- 
scriptors. P= 1 indicates that the gate contents are 
valid. P= 0 indicates the contents are not valid and 
causes exception 11 if referenced. DPL is the de- 
scriptor privilege level and specifies when this de- 
scriptor may be used by a task (see Section 4.4 Pro- 
tection). The S field, bit 4 of the access rights byte, 
must be 0 to indicate a system control descriptor. 
The type field specifies the descriptor type as indi- 
cated in Figure 4.8. 


4.3.4.7 Differences 
Between Intel486 DX2 
Microprocessor 
and 80286 Descriptors 


In order to provide operating system compatibility 
between the 80286 and Intel486 DX2 microproces- 
sor, the Intel486 OX microprocessor supports all of 
the 80286 segment descriptors. Figure 4.9 shows 
the general format of an 80286 system segment de- 
scriptor. The only differences between 80286 and 
Intel486 OX microprocessor descriptor formats are 
that the values of the type fields, and the limit and 
base address fields have been expanded for the In- 
tel486 OX microprocessors. The 80286 system seg- 
ment descriptors contained a 24-bit base address 
and 16-bit limit, while the Intel486 OX microproces- 
sor system segment descriptors have a 32·bit base 
address, a 20-bit limit field, and a granularity bit. 


SELECTOR 
OFFSET 15 ... 0 


WORD 
OFFSET 31 ... 16 
P 
DPL 
0 
TYPE 
0 
0 
0 
COUNT 


I 
4 ... 0 


GateDescriptor Fields 
Name 
Value 
Description 
Type 
4 
80286 call gate 
5 
Task gate (for 80286 or Inte1486T'"DX CPUtask) 
6 
80286 interruptgate 
7 
80286 trap gate 
C 
Inte1486T'"DX CPUcall gate 
E 
Inte1486T'"DX CPUinterruptgate 
F 
Inte1486T'"DX CPUtrap gate 
P 
0 
Descriptorcontents are not valid 
1 
Descriptorcontents are valid 


DPL-Ieast privilegedlevelat whicha task mayaccessthe gate.WORDCOUNT0-31-the 
numberof parametersto copy from caller's stack 
to the called procedure'sstack. The parametersare 32·bit quantitiesfor Inte1486T'"DX CPUgates, and 16·bitquantitiesfor 80286 gates. 


DESTINATION 
16·bit 
Selectorto the target code segment 
SELECTOR 
selector 
or 
Selectorto the target task state segmentfor task gate 


offset 
Entry point within the target code segment 
16·bit80286 
32·bit Inte1486T'"DX CPU 
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SEGMENT BASE 15 ... 0 


Intel Reserved 
Set to 0 


BASE 
LIMIT 
P 


'Base Addressof the segment 
The length of the segment 
PresentBit 
1= Present 
0= Not Present 


o 


BASE 
+4 
23 ... 
16 


DescriptorPrivilegeLevel 0- 3 
SystemDescriptor 
0= System 
1= User 
Type of Segment 
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By supporting 80286 system segments the Intel486 
DX microprocessor is able to execute 80286 appli- 
cation programs on an Intel486 DX microprocessor 
operating system. This is possible because the proc- 
essor automatically understands which descriptors 
are 80286-style descriptors and which descriptors 
are Intel486 DX Microprocessor-style descriptors. In 
particular, if the upper word of a descriptor is zero, 
then that descriptor is a 80286-style descriptor. 


The only other differences between 80286-style de- 
scriptors and Intel486 DX microprocessor-style de- 
scriptors is the interpretation of the word count field 
of call gates and the B bit. The word count field 
specifies the number of 16-bit quantities to copy for 
80286 call gates and 32-bit quantities for Intel486 
DX microprocessor call gates. The B bit controls the 
size of PUSHes when using a call gate; if B=0 
PUSHes are 16 bits, if B= 1 PUSHes are 32 bits. 


4.3.4.8 
Selector 
Fields 


A selector in Protected Mode has three fields: Local 
or Global Descriptor Table Indicator (TI), Descriptor 


6 


5 


<4 


3:0ESCRipTOR" 


2 


LOCAL 


DESCRIPTOR 


TABLE 


Entry Index (Index), and Requestor (the selector's) 
Privilege Level (RPL) as shown in Figure 4.10. The 
TI bits select one of two memory-based tables of 
descriptors (the Global Descriptor Table or the Local 
Descriptor Table). The Index selects one of 8K de- 
scriptors in the appropriate descriptor table. The 
RPL bits allow high speed testing of the selector's 
privilege attriblJtes. 


4.3.4.9 
Segment 
Descriptor 
Cache 


In addition to the selector value, every segment reg- 
ister has a segment descriptor cache register asso- 
ciated with it. Whenever a segment register's con- 
tents are changed, the 8-byte descriptor associated 
with that selector is automatically loaded (cached) 
on the chip. Once loaded, all references to that seg- 
ment use the cached descriptor information instead 
of reaccessing the descriptor. The contents of the 
descriptor cache are not visible to the programmer. 
Since descriptor caches only change when a seg- 
ment register is changed, programs which modify 
the descriptor tables must reload the appropriate 
segment registers after changing a descriptor's val- 
ue. 


TABLE 


INDICATOR 


TI=1 


. 


NULL 


GLOBAL 


DESCRIPTOR 


TABLE 
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4.3.4.10 segment Descriptor 
Register settings 


The contents of the segment descriptor cache vary 
depending on the mode the Intel486 OX microproc- 
essor is operating in. When operating in Real Ad- 
dress Mode, the segment base, limit, and other attri- 
butes within the segment cache registers are de- 
fined as shown in Figure 4.11. For compatibility with 


the 8086 architecture, the base is set to sixteen 
times the current selector value, the limit is fixed at 
OOOOFFFFH,and the attributes are fixed so as to 
indicate the segment is present and fully usable. In 
Real Address Mode, the internal "privilege level" is 
always fixed to the highest level, level 0, so 1/0 and 
other privileged opcodes may be executed. 


32 - BIT BASE 


(UPDATED 
DURING 
SELECTOR 


LOAD 
INTO SEGIoIENT REGISTER) 


32 
- BIT LIIoIIT 


(fiXED) 


OTHER ATTRIBUTES 


(FIXED) 


CONrORloIlNG 
PRIVILEGE 


STACK 
SIZE 
EXECUTABLE 
WRITEABLE 


READABLE 


EXPANSION 
DIRECTION 
1 


GRANULARITY 
1 


~~ED 
1 
PRIVILEGE 
LEVEL 
1 


~R~S~:T 
!!.A~E 
:I~I~ 
} 
_-l_ _ _ __ 


CS 
16X 
CURRENT 
CS SELECTOR" 
oooorrrrH 
Y 
0 
Y 
B 
U 
Y 
Y 
Y - 
N 


SS 
16X 
CURRENT 
SS SELECTOR 
oooorrrrH 
Y 
0 
Y 
B 
U 
Y 
Y 
N 
W - 


OS 
16X 
CURRENT 
OS SELECTOR 
oooorrrrH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


ES 
16X 
CURRENT 
ES SELECTOR 
oooorrrrH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


rs 
16X 
CURRENT 
rs 
SELECTOR 
oooorrrrH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


GS 
16X 
CURRENT 
GS SELECTOR 
oooorrrrH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


"Except 
!he 
32·tJit 
CS 
base 
is initialized 
10 FFFFFOOOH 
after 
resel 
unlil 
firsl 
inlersegment 
control 
transfer 
(i.e .• inlersegmenl 
CALL, 
or 


intersegment 
JMP, or INT). 
(See 
Figure 
4.13 
Example.) 


Key: 
Y 
= yes 
N 
= no 
o 
= privilege 
level 0 
1 
= privilege 
level 
1 


2 
= privilege 
level 
2 
3 
= privilege 
level 
3 
U 
= expand 
up 


o 
= expand 
down 
B 
= byte granularity 
P 
= page 
granularity 
W 
= push/pop 
16-biI 
words 
F 
= push/pop 
32·bil 
dwords 
- 
= does 
nol apply 
10 that 
segment 
cache 
register 


Figure 4.11. segment Descriptor Caches for Real Address Mode 
(Segment Limit and Attributes are Fixed) 
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When operating in Protected Mode, the segment 
base, limit, and other attributes within the segment 
cache registers are defined as shown in Figure 4.12. 
In Protected Mode, each of these fields are defined 


according to the contents of the segment descriptor 
indexed by the selector value loaded into the seg- 
ment register. 


32 
- 
BIT 
L1t.11T 


(UPDATED 
DURING 


SELECTOR 
LOAD 
INTO 


SEGt.lENT 
REGISTER) 


OTHER 
ATTRIBUTES 


(UPDATED 
DURING 


SELECTOR 
LOAD 
INTO 


SEGt.lENT 
REGISTER) 


32 
- 
BIT 
BASE 


(UPDATED 
DURING 


SELECTOR 
LOAD 
INTO 
SEGt.lENT 
REGISTER) 


CONroRt.lING 
PRIVILEGE 


STACK 
SIZE 


EXECUTABLE 


WRITEABLE 


READABLE 


EXPANSION 
DIRECTION 
1 


GRANULARITY 
1 


ACCESSED 
1 
~:~Vi~:~~ 
~~V:L_~~s~ 
:I~I~ 
tt_______ 


CS 
BASE 
PER 
SEG 
DESCR 
L1t.11T PER 
SEG 
DESCR 
P 
d 
d 
d 
d 
d 
N 
Y 
- 
d 


SS 
BASE 
PER 
SEG 
DESCR 
Llt.lIT 
PER 
SEG 
DESCR 
P 
d 
d 
d 
d 
r 
w 
N 
d 
- 


OS 
BASE 
PER 
SEG 
DESCR 
LIt.lIT 
PER 
SEG 
DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


ES 
BASE 
PER 
SEG 
DESCR 
L1t.11T PER 
SEG 
DESCR 
P 
d 
d 
d 
d 
d 
d 
N 
- 
- 


FS 
BASE 
PER 
SEG 
DESCR 
L1t.1lT PER 
SEG 
OESCR 
P 
d 
d 
d 
d 
d 
d 
N 
- 
- 


GS 
BASE 
PER 
SEG 
OESCR 
L1t.11T PER 
SEG 
OESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


Key: 
Y ~ li,ed yes 
N = li,ed no 
d ~ per segment descriptor 
p = per segment descriptor; descriptor must indicate "present" to avoid e,ception 11 
(e,ception 12 in case 01SS) 
r = per segment descriptor, but descriptor must indicate "readable" to avoid e,coption 13 
(special case lor SS) 
w = per segment descriptor, but descriptor must indicate "writable" to avoid e,coption 13 
(special case lor SS) 
- = does not apply to that segment cache register 
• 


When operating in a Virtual 8086 Mode within the 
Protected Mode, the.segment base, limit, and other 
attributes within the segment cache registers are de- 
fined as shown in Figure 4.13. For compatibility with 
the 8086 architecture, the base is set to sixteen 
times the current selector value, the limit is fixed at 


OOOOFFFFH,and the attributes are fixed so as to 
indicate the segment is present and fully usable. The 
virtual program executes at lowest privilege level, 
level 3, to allow trapping of all IOPL-sensitive in- 
structions and level-Q-onlyinstructions. 


32 - BIT BASE 
32 - BIT LIMIT 
OTHER ATTRIBUTES 


(UPDATED DURING SELECTOR 
(fiXED) 
(fiXED) 
LOAD INTO SEGMENT REGISTER) 


CONfORMING PRIVILEGE 
STACK SIZE 
EXECUTABLE 
WRITEABLE 
READABLE 


EXPANSION DIRECTION 
1 
GRANULARITY 
I 
ACCESSED 
1 


~:~~~:~~ 
~~V:L 
B~-~-E_-_-_ -_-_-_-_-_ -_-_-_-_-~-IM-_I-~-_-_-_-l-J 
__ 
_ 
__ 


CS 
16X CURRENT CS SELECTOR 
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Figure 4.13. Segment Descriptor Caches for Virtual 8086 Mode within Protected Mode 
(Segment LImit and Attributes 
are Fixed) 
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Figure 4.14. Four-Level Hierarchical Protection 


The Intel486 OX Microprocessor has four levels of 
protection which are optimized to support the needs 
of a multi-tasking operating system to isolate and 
protect user programs from each other and the op- 
erating system. The privilege levels control the use 
of privileged instructions, I/O instructions, and ac- 
cess to segments and segment descriptors. Unlike 
traditional 
microprocessor-based 
systems 
where 


this protection is achieved only through the use of 
complex external hardware and software the In- 
tel486 OX Microprocessor provides the protection 
as part of its integrated Memory Management Unit. 
The Intel486 OX Microprocessor offers an additional 
type of protection on a page basis, when paging is 
enabled (See Section 4.5.3 Page Level Protec- 
tion). 


The four-level hierarchical privilege system is illus- 
trated in Figure 4-14. It is an extension of the user/ 
supervisor privilege mode commonly used by mini- 
computers and, in fact, the user/supervisor mode is 
fully supported by the Intel486 OX Microprocessor 
paging mechanism. The privilege levels (PL) are 
numbered 0 through 3. Level 0 is the most privileged 
or trusted level. 


4.4.2 RULES OF PRIVILEGE 


The Intel486 OX Microprocessor controls access to 
both data and procedures between levels of a task, 
according to the following rules. 
• Data stored in a segment with privilege level p 


can be accessed only by code executing at a 
privilege level at least as privileged as p. 


• A code segment/procedure with privilege level p 


can only be called by a task executing at the 
same or a lesser privilege level than p. 


4.4.3.1 Task Privilege 


At any point in time, a task on the Intel486 OX Micro- 
processor always executes at one of the four privi- 
lege levels. The Current Privilege Level (CPL) speci- 
fies the task's privilege level. A task's CPL may only 
be changed by control transfers through gate de- 
scriptors to a code segment with a different privilege 
level. (See Section 4.4.4 Privilege Level Transfers) 
Thus, an application program running at PL = 3 may 
call an operating system routine at PL = 1 (via a 
gate) which would cause the task's CPL to be set to 
1 until the operating system routine was finished. 


4.4.3.2 Selector Privilege (RPL) 


The privilege level of a selector is specified by the 
RPL field. The RPL is the two least significant bits of 
the selector. The selector's RPL is only used to es- 
tablish a less trusted privilege level than the current 
privilege level for the use of a segment. This level is 
called the task's effective privilege level (EPL). The 
EPL is defined as being the least privileged (i.e. nu- 
merically larger) level of a task's CPL and a selec- 
tor's RPL. Thus, if selector's RPL = 0 then the CPL 
always specifies the privilege level for making an ac- 
cess using the selector. On the other hand if RPL = 
3 then a selector can only access segments at level 
3 regardless of the task's CPL. The RPL is most 
commonly used to verify that pointers passed to an 
operating system procedure do not access data that 
is of higher privilege than the procedure that origi- 
nated the pointer. Since the originator of a selector 
can specify any RPL value, the Adjust RPL (ARPL) 
instruction is provided to force the RPL bits to the 
originator's CPL. 


4.4.3.3 1/0 Privilege and 1/0 Permission Bltmap 


The I/O privilege level (IOPL, a 2-bit field in the 
EFLAG register) defines the least privileged level at 
which I/O instructions can be unconditionally per- 
formed. I/O instructions can be unconditionally per- 
formed when CPL !,; 10PL.(The I/O instructions are 
IN, OUT, INS, OUTS, REP INS, and REP OUTS.) 
When CPL > 10PL,and the current task is associat- 
ed with a 286 TSS, attempted I/O instructions cause 
an exception 13 fault. When CPL > 10PL, and the 
current task is associated with an Intel486 OX Micro- 
processor TSS, the I/O Permission Bitmap (part of 
an Intel486 OX Microprocessor TSS) is consulted on 
whether I/O to the port is allowed, or an exception 
13 fault is to be generated instead. For diagrams of 
the I/O Permission Bitmap, refer to Figures 4.15a 
and 4.15b. For further information on how the I/O 
Permission Bitmap is used in Protected Mode or in 
Virtual 8086 Mode, refer to Section 4.6.4 Protection 
and I/O Permission Bitmap. 
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Instruction 
Operands 
Function 


ARPL 
Selector, 
Adjust Requested Privi- 


Register 
lege Level: adjusts the 
RPL of the selector to the 
numeric maximum of 
current selector RPL value 
and the RPL value in the 
register. Set zero flag if 
selector RPL was 
changed. 


VERR 
Selector 
VERify for Read: sets the 
zero flag if the segment 
referred to by the selector 
can be read. 


VERW 
Selector 
VERify for Write: sets the 
zero flag if the segment 
referred to by the selector 
can be written. 


LSL 
Register, 
Load Segment Limit: reads 
Selector 
the segment limit into the 
register if privilege rules 
and descriptor type allow. 
Set zero flag if successful. 


LAR 
Register, 
Load Access Rights: reads 
Selector 
the descriptor access 
rights byte into the register 
if privilege rules allow. Set 
zero flag if successful. 


The I/O privilege level (IOPL) also affects whether 
several other instructions can be executed or cause 
an exception 13 fault instead. These instructions are 
called "IOPL-sensitive" 
instructions and they are 
CLI and STI. (Note that the LOCK prefix is not 10PL- 
sensitive on the Intel486 microprocessor.) 


The 10PL also affects whether the IF (interrupts en- 
able flag) bit can be changed by loading a value into 
the EFLAGS register. When CPL $; 10PL, then the 
IF bit can be changed by loading a new value into 
the EFLAGS register. When CPL > 10PL,the IF bit 
cannot be changed by a new value POP'ed into (or 
otherwise loaded into) the EFLAGS register; the IF 
bit merely remains unchanged and no exception is 
generated. 


4.4.3.4 
Privilege 
Validation 


The Intel486 OX microprocessor provides several in- 
structions to speed pointer testing and help maintain 
system integrity by verifying that the selector value 
refers to an appropriate segment. Table 4.2 summa- 
rizes the selector validation procedures available for 
the Intel486 OX microprocessor. 


This pointer verification prevents the common prob- 
lem of an application at PL = 3 calling a operating 
systems routine at PL = 0 and passing the operat- 
ing system routine a "bad" pointer which corrupts a 
data structure belonging to the operating system. If 
the operating system routine uses the ARPL instruc- 
tion to ensure that the RPL of the selector has no 
greater privilege than that of the caller, then this 
problem can be avoided. 


4.4.3.5 
Descriptor 
Access 


There are basically two types of segment accesses: 
those involving code segments such as control 
transfers, and those involving data accesses. Deter- 
mining the ability of a task to access a segment in- 
volves the type of segment to be accessed, the in- 
struction used, the type of descriptor used and CPL, 
RPL, and OPL as described above. 


Any time an instruction loads data segment registers 
(OS, ES, FS, GS) the Intel486 OX microprocessor 
makes protection validation checks. Selectors load- 
ed in the OS, ES, FS, GS registers must refer only to 
data segments or readable code segments. The 
data access rules are specified in Section 4.4.2 
Rules 
of Privilege. 
The only exception to those 


rules is readable conforming code segments which 
can be accessed at any privilege level. 


Finally the privilege validation checks are performed. 
The CPL is compared to the EPL and if the EPL is 
more privileged than the CPL an exception 13 (gen- 
eral protection fault) is generated. 


The rules regarding the stack segment are slightly 
different than those involving data segments. In- 
structions that load selectors into SS must refer to 
data segment descriptors for writeable data seg- 


ments. The OPL and RPL must equal the CPL. All 
other descriptor types or a privilege level violation 
will cause exception 13. A stack not present fault 
causes exception 12. Note that an exception 11 is 
used for a not-present code or data segment. 


Inter-segment control transfers occur when a selec- 
tor is loaded in the CS register. For a typical system 
most of these transfers are simply the result of a call 
or a jump to another routine. There are five types of 
control transfers which are summarized in Table 4.3. 
Many of these transfers result in a privilege level 
transfer. Changing privilege levels is done only via 
control transfers, by using gates, task switches, and 
interrupt or trap gates. 


Control transfers can only occur if the operation 
which loaded the selector references the correct de- 
scriptor type. Any violation of these descriptor usage 
rules will cause an exception 13 (e.g. JMP through a 
call gate, or IRET from a normal subroutine call). 


In order to provide further system security, all control 
transfers are also subject to the privilege rules. 


The privilege 
rules require 
that: 
- 
Privilege level transitions can only occur via 
gates. 


- 
JMPs can be made to a non-conforming code 
segment with the same privilege or to a conform- 
ing code segment with greater or equal privilege. 
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- 
CALLs can be made to a non-conforming code 
segment with the same privilege or via a gate to a 


. more privileged level. 


- 
Interrupts handled within the task obey the same 
privilege rules as CALLs. 
- 
Conforming Code segments are accessible by 
privilege levels which are the same or less privi- 
leged than the conforming-code segment's OPL. 


- 
Both the requested privilege level (RPL) in the 
selector pointing to the gate and the task's CPL 
must be of equal or greater privilege than the 
gate's OPL. 


- 
The code segment selected in the gate must be 
the same or more privileged than the task's CPL. 


- 
Return instructions that do not switch tasks can 
only return control to a code segment with same 
or less privilege. 
- 
Task switches can be performed by a CALL, 
JMP, or INT which references either a task gate 
or task state segment who's OPL is less privi- 
leged or the same privilege as the old task's CPL. 


Any control transfer that changes CPL within a task 
causes a change of stacks as a result of the privi- 
lege level change. The initial values of SS:ESP for 
privilege levels 0, 1, and 2 are retained in the task 
state segment (see Section 4.4.6 Task Switching). 
Ouring a JMP or CALL control transfer, the new 
stack pointer is loaded into the SS and ESP regis- 
ters and the previous stack pointer is pushed onto 
the new stack. 


Control Transfer 
Types 
Operation 
Types 
Descriptor 
Descriptor 
Referenced 
Table 


Intersegment within the same privilege level 
JMP, CALL, RET, IRET* 
Code Segment 
GOT/LOT 


Intersegment to the same or higher privilege level 
CALL 
Call Gate 
GOT/LOT 
Interrupt within task may change CPL 
Interrupt Instruction, 
Trap or 
lOT 
Exception, External 
Interrupt 
Interrupt 
Gate 


Intersegment to a lower privilege level 
RET,IRET* 
Code Segment 
GOT/LOT 
(changes task CPL) 


CALL, JMP 
Task State 
GOT 
Segment 


Task Switch 
CALL, JMP 
Task Gate 
GOT/LOT 


IRET** 
Task Gate 
lOT 
Interrupt Instruction, 
Exception, External 
Interrupt 


'NT (Nested Task bit of flag register) = 0 
"NT 
(Nested Task bit of flag register) = 1 
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When RETurning to the original privilege level, use 
of the lower-privileged stack is restored as part of 
the RET or IRET instruction operation. For subrou- 
tine calls that pass parameters on the stack and 
cross privilege levels, a fixed number of words (as 
specified in the gate's word count field) are copied 
from the previous stack to the current stack. The 
inter-segment RET instruction with a stack adjust- 
ment value will correctly restore the previous stack 
pointer upon return. 


4.4.5 
CALL 
GATES 


Gates provide protected, indirect CALLs. One of the 
major uses of gates is to provide a secure method of 
privilege transfers within a task. Since the operating 
system defines all of the gates in a system, it can 
ensure that all gates only allow entry into a few trust- 
ed procedures (such as those which allocate memo- 
ry, or perform I/O). 


Gate descriptors follow the data access rules of priv- 
ilege; that is, gates can be accessed by a task if the 
EPL, is equal to or more privileged than the gate 
descriptor's OPL. Gates follow the control transfer 
rules of privilege and therefore may only transfer 
control to a more privileged level. 


Call Gates are accessed via a CALL instruction and 
are syntactically identical to calling a normal subrou- 
tine. When an inter-level Intel486 OX call gate is ac- 
tivated, the following actions occur. 
1. Load CS:EIP from gate check for validity 
2. SS is pushed zero-extended to 32 bits 
3. ESP is pushed 
4. Copy Word Count 32-bit parameters from the 
old stack to the new stack 
5. Push Return address on stack 


The procedure is identical for 80286 Call.gates, ex- 
cept that 16-bit parameters are copied and 16-bit 
registers are pushed. 


Interrupt Gates and Trap gates work in a similar 
fashion as the call gates, except there is no copying 
of parameters. The only difference between Trap 
and Interrupt gates is that control transfers through 
an Interrupt gate disable further interrupts (Le.the IF 
bit is set to 0), and Trap gates leave the interrupt 
status unchanged. 


A very important attribute of any multi-tasking/multi- 
user operating systems is its ability to rapidly switch 


between tasks or processes. The Intel486 OX micro- 
processor directly supports this operation by provid- 
ing a task switch instruction in hardware. The In- 
tel486 OX microprocessor task switch operation 
saves the entire state of the machine (all of the reg- 
isters, address space, and a link to the previous 
task), loads a new execution state, performs protec- 
tion checks, and commences execution in the new 
task, in about 10 microseconds. Like transfer of con- 
trol via gates, the task switch operation is invoked by 
executing an inter-segment JMP or CALL instruction 
which refers to a Task State Segment (TSS), or a 
task gate descriptor in the GOT or LOT. An INT n 
instruction, exception, trap, or external interrupt may 
also invoke the task switch operation if there is a 
task gate descriptor in the associated lOT descriptor 
slot. 


The TSS descriptor points to a segment (see Figure 
4.15) containing the entire Intel486 OX microproces- 
sor execution state while a task gate descriptor con- 
tains a TSS selector. The Intel486 OX microproces- 
sor supports both 80286 and Intel486 OX microproc- 
essor style TSSs. Figure 4.16 shows a 80286 TSS. 
The limit of an Intel486 OX microprocessor TSS 
must be greater than 0064H (002BH for a 80286 
TSS), and can be as large as 4 Gigabytes. In the 
additional TSS space, the operating system is free 
to store additional information such as the reason 
the task is inactive, time the task has spent running, 
and open files belong to the task. 


Each task must have a TSS associated with it. The 
current TSS is identified by a special register in the 
Intel486 OX microprocessor called the Task State 
Segment Register (TR). This register contains a se- 
lector referring to the task state segment descriptor 
that defines the current TSS. A hidden base and limit 
register associated with TR are loaded whenever TR 
is loaded with a new selector. Returning from a task 
is accomplished by the IRET instruction. When IRET 
is executed, control is returned to the task which 
was interrupted. The current executing task's state 
is saved in the TSS and the old task state is restored 
from its TSS. 


Several bits in the flag register and machine status 
word (CRO) give information about the state of a 
task which are useful to the operating system. The 
Nested Task (NT) (bit 14 in EFLAGS) controls the 
function of the IRET instruction. If NT = 0, the IRET 
instruction performs the regular return; when NT = 
1, IRET performs a task switch operation back to the 
previous task. The NT bit is set or reset in the follow- 
ing fashion: 
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When a CALL or INT instruction initiates a task 
switch, the new TSS will be marked busy and the 
back link field of the new TSS set to the old TSS 
selector. The NT bit of the new task is set by CALL 
or INT'initiated task switches. An interrupt that does 
not cause a task switch will clear NT. (The NT bit will 
be restored after execution of the interrupt handler) 
NT may also be set or cleared by POPF or IRET 
instructions. 


The Intel486 OX Microprocessor Task State Seg- 
ment is marked busy by changing the descriptor type 
field from TYPE 9H to TYPE BH. An 80286 TSS is 
marked busy by changing the descriptor type field 
from TYPE 1 to TYPE 3. Use of a selector that refer- 
ences a busy task state segment causes an excep- 
tion 13. 


The Virtual Mode (VM) bit 17 is used to indicate if a 
task, is a virtual 8086 task. If VM = 1, then the tasks 
will use the Real Mode addressing mechanism. The 
virtual 8086 environment is only entered and exited 
via a task switch (see Section 4.6 Virtual Mode). 


The FPU's state is not automatically saved when a 
task switch occurs, because the incoming task may 
not use the FPU.The Task Switched (TS) Bit (bit 3 in 
the CRO)helps deal with the FPU's state in a multi- 
tasking environment. Whenever the Intel486 OX Mi- 
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croprocessor switches tasks, it sets the TS bit. The 
Intel486 OX Microprocessor detects the first use of a 
processor extension instruction after a task switch 
and causes the processor extension not available 
exception 7. The exception handler for exception 7 
may then decide whether to save the state of the 
FPU. A processor extension not present exception 
(7) will occur when attempting to execute a Floating 
Point or WAIT instruction if the Task Switched and 
Monitor coprocessor extension bits are both set (Le. 
TS = 1 and MP = 1). 


The T bit in the Intel486 OX Microprocessor TSS 
indicates that the processor should generate a de- 
bug exception when switching to a task. If T = 1 
then upon entry to a new task a debug exception 1 
will be generated. 


4.4.7 
INITIALIZATION 
AND TRANSITION 
TO 
PROTECTED 
MODE 


Since the Intel486 OX Microprocessor begins exe- 
cuting in Real Mode immediately after RESET it is 
necessary to initialize the system tables and regis- 
ters with the appropriate values. 


The GOT and lOT registers must refer to a valid GOT 
and lOT. The lOT should be at least 256 bytes long, 
and GOT must contain descriptors for the initial 
code, and data segments. Figure 4.17 shows the ta- 
bles and Figure 4.18 the descriptors needed for a 
simple Protected Mode Intel486 OX Microprocessor 
system. It has a single code and single data/stack 
segment each four gigabytes long and a single privi- 
lege level PL = O. 


The actual method of enabling Protected Mode is to 
load CROwith the PE bit set, via the MOV CRO,R/M 
instruction. This puts the Intel486 OX Microproces- 
sor in Protected Mode. 


After enabling Protected Mode, the next instruction 
should execute an intersegment JMP to load the CS 
register and flush the instruction decode queue. The 
final step is to load all of the data segment registers 
with the initial selector values. 


An alternate approach to entering Protected Mode 
which is especially appropriate for multi-tasking op- 
erating systems, is to use the built in task-switch to 
load all of the registers. In this case the GOT would 
contain two TSS descriptors in addition to the code 
and data descriptors needed for the first task. The 
first JMP instruction in Protected Mode would jump 
to the TSS causing a task switch and loading all of 
the registers with the values stored in the TSS. The 
Task State Segment Register should be initialized to 
point to a valid TSS descriptor since a task switch 
saves the state of the current task in a task state 
segment. 
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4.4.8 
TOOLS 
FOR BUILDING 
PROTECTED 
SYSTEMS 


In order to simplify the design of a protected multi- 
tasking system, Intel provides a tool which allows 
the system designer an easy method of constructing 
the data structures needed for a Protected Mode 
Intel486 OX microprocessor system. This tool is the 
builder BLD-386. BLD-386 lets the operating system 
writer specify all of the segment descriptors dis- 
cussed in the previous sections (LOTs, lOTs, GDTs, 
Gates, and TSSs) in a high-level language. 


Paging is another type of memory management 
useful for virtual memory multitasking operating sys- 
tems. Unlike segmentation which modularizes pro- 
grams and data into variable length segments, pag- 
ing divides programs into multiple uniform size 
pages. Pages bear no direct relation to the logical 


• 


structure 
of a program. 
While segment 
selectors 
can 
be 
considered 
the 
logical 
"name" 
of 
a program 
module 
or data structure, 
a page most likely corre- 
sponds 
to only a portion 
of a module 
or data struc- 
ture. 


By taking advantage 
of the locality 
of reference 
dis- 
played 
by most 
programs, 
only 
a small 
number 
of 
pages from each active task need be in memory 
at 
anyone 
moment. 


4.5.2.1 
Page Mechanism 


The Intel486 
OX Microprocessor 
uses two levels of 
tables 
to translate 
the linear address 
(from the seg- 
mentation 
unit) into a physical 
address. 
There 
are 
three 
components 
to the paging 
mechanism 
of the 
Intel486 
OX Microprocessor: 
the page directory, 
the 
page 
tables, 
and the 
page 
itself 
(page 
frame). 
All 
memory-resident 
elements 
of the Intel486 
OX Micro- 
processor 
paging 
mechanism 
are 
the 
same 
size, 
namely, 
4 Kbytes. 
A uniform 
size for all of the ele- 
ments simplifies 
memory 
allocation 
and reallocation 
schemes, 
since 
there 
is no problem 
with 
memory 
fragmentation. 
Figure 
4.19 
shows 
how 
the 
paging 
mechanism 
works. 
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4.5.2.2 
Page Descriptor 
Base Register 


CR2 
is the 
Page 
Fault 
Linear 
Address 
register. 
It 
holds the 32-bit linear address which caused the last 
page fault detected. 


CR3 is the 
Page Directory 
Physical 
Base 
Address 
Register. 
It contains 
the physical 
starting 
address 
of 
the 
Page 
Directory. 
The 
lower 
12 bits of CR3 
are 
always zero to ensure that the Page Directory 
is al- 


ways page aligned. 
Loading 
it via a MOV CR3, reg 
instruction 
causes the Page Table Entry cache to be 
flushed, 
as will a task switch 
through 
a TSS which 
changes 
the value of CRO. (See 4.5.5 Translation 
Lookaside 
Buffer). 


4.5.2.3 
Page Directory 


The Page Directory 
is 4 Kbytes long and allows up to 
1024 
Page 
Directory 
Entries. 
Each 
Page 
Directory 
Entry contains 
the address 
of the next level 
of ta- 
bles, 
the 
Page 
Tables 
and 
information 
about 
the 
page table. The contents 
of a Page Directory 
Entry 
are shown 
in Figure 4.20. The upper 
10 bits of the 
linear address 
(A22-A31) 
are used as an index to 
select 
the correct 
Page Directory 
Entry. 


CR1 


CR2 


CR3 


CONTROL 
REGISTERS 


OS 
P 
P 
U 
R 
PAGE TABLE ADDRESS 
31 ..12 
RESERVED 
0 
0 
0 
A 
C 
W - - 
P 
0 
T 
S 
W 


os 
P 
P 
U 
R 
PAGE FRAME ADDRESS 31..12 
RESERVED 
0 
0 
D 
A 
C 
W - - 
P 
D 
T 
S 
W 


Figure 4.21. Page Table Entry (Points to Page) 


The (User/Supervisor) U/S bit 2 and the (Read/ 
Write) R/W bit 1 are used to provide protection attri- 
butes for individual pages. 
Each Page Table is 4 Kbytes and holds up to 1024 
Page Table Entries. Page Table Entries contain the 
starting address of the page frame and statistical 
information about the page (see Figure 4.21). Ad- 
dress bits A12-A21 are used as an index to select 
one of the 1024 Page Table Entries. The 20 upper- 
bit page frame address is concatenated with the 
lower 12 bits of the linear address to form the physi- 
cal address. Page tables can be shared between 
tasks and swapped to disks. 


4.5.2.5 
Page Directory/Table 
Entries 


The lower 12 bits of the Page Table Entries and 
Page Directory Entries contain statistical information 
about pages and page tables respectively. The P 
(Present) bit 0 indicates if a Page Directory or Page 
Table entry can be used in address translation. If 
P = 1 the entry can be used for address translation 
if P = 0 the entry can not be used for translation, 
and all of the other bits are available for use by the 
software. For example the remaining 31 bits could 
be used to indicate where on the disk the page is 
stored. 


The A (Accessed) bit 5, is set by the Intel486 DX 
microprocessor for both types of entries before a 
read or write access occurs to an address covered 
by the entry. The D (Dirty) bit 6 is set to 1 before a 
write to an address covered by that page table entry 
occurs. The D bit is undefined for Page Directory 
Entries. When the p. A and D bits are updated by the 
Intel486 DX microprocessor, the processor gener- 
ates a Read-Modify-Write cycle which locks the bus 
and prevents conflicts with other processors or per- 
pherials. Software which modifies these bits should 
use the LOCK prefix to ensure the integrity of the 
page tables in multi-master systems. 


The 3 bits marked OS Reserved 
in Figure 4.20 and 
Figure 4.21 (bits 9-11) are software definable. OSs 
are free to use these bits for whatever purpose they 
wish. An example use of the OS 
Reserved 
bits 
would be to store information about page aging. By 
keeping track of how long a page has been in mem- 
ory since being accessed, an operating system can 
implement a page replacement algorithm like Least 
Recently Used. 


4.5.3 
PAGE LEVEL 
PROTECTION 
(R/W, U/S BITS) 


The Intel486 DX microprocessor provides a set of 
protection attributes for paging systems. The paging • 
mechanism distinguishes between two levels of pro- 
tection: User which corresponds to level 3 of the 
segmentation 
based 
protection, 
and 
supervisor 
which encompasses all of the other protection levels 
(0, 1,2). 


The R/W and U/S bits are used in conjunction with 
the WP bit in the flags register (EFLAGS). The In- 
tel386 microprocessor does not contain the WP bit. 
The WP bit has been added to the Intel486 DX mi- 
croprocessor to protect read-only pages from super- 
visor write accesses. The Intel386 microprocessor 
allows a read-only page to be written from protection 
levels 0, 1 or 2. WP= 0 is the Intel386 microproces- 
sor compatible mode. When WP= 0 the supervisor 
can write to a read-only page as defined by the U/S 
and R/W bits. When WP= 1 supervisor access to a 
read-only page (R/W = 0) will cause a page fault (ex- 
ception 14). 


Table 4.4 shows the affect of the WP, U/S and R/W 
bits on accessing memory. When WP=O. the super- 
visor can write to pages regardless of the state of 
the R/W bit. When WP= 1 and R/W = 0 the supervi- 
sor cannot write to a read-only page. A user attempt 
to access a supervisor only page (U/S = 0), or write 
to a read only page will cause a page fault (excep- 
tion 14). 


The R/W and U/S bits provide protection from user 
access on a page by page basis since the bits are 
contained in the Page Table Entry and the Page Di- 
rectory Table. The U/S and R/W bits in the first level 
Page Directory Table apply to all entries in the page 
table pointed to by that directory entry. The U/S and 
R/W bits in the second level Page Table Entry apply 
only to the page described by that entry. The most 
restrictive of the U/S and R/W bits from the Page 
Directory Table and the Page Table Entry are used 
to address a page. 


Example: If the U/S and R/W bits for the Page Di- 
rectory entry were 10 (user read/execute) and the 


U/S and R/W 
bits for the Page Table Entry were 01 
(no user 
access 
at all), the 
access 
rights 
for 
the 
page 
would 
be 01, the 
numerically 
smaller 
of the 
two. 


Note that a given segment 
can be easily made read- 
only for level 0, 1 or 2 via use of segmented 
protec- 
tion mechanisms. 
(Section 
4.4 Protection). 


4.5.4 
PAGE 
CACHEABILITY 
(PWT 
AND 
PCD BITS) 


PWT (page write through) 
and PCD (page cache dis- 
able) are two new bits defined 
in entries 
in both lev- 
els of the page table 
structure, 
the Page Directory 
Table and the Page Table Entry. PCD and PWT con- 
trol page cacheability 
and write policy. 


PWT controls 
write policy. 
PWT= 
1 defines 
a write- 
through 
policy for the current 
page. PWT = 0 allows 
the possibility 
of write-back. 
PWT is ignored 
internal- 
ly because 
the 
Intel486 
DX microprocessor 
has a 
write-through 
cache. 
PWT can 
be used 
to control 
the write policy of a second 
level cache. 


PCD controls 
cacheability. 
PCD = 0 enables 
caching 
in the on-chip 
cache. 
PCD alone 
does 
not enable 
caching, 
it must be conditioned 
by the KEN # (cache 
enable) 
input signal and the state of the CD (cache 
disable 
bit) and NW (no write-through) 
bits in control 
register 
0 (CRO). When PCD = 1, caching 
is disabled 
regardless 
of the state of KEN #, CD and NW. (See 
Section 
5.0, On-Chip 
Cache). 


The state of the PCD and PWT bits are driven out on 
the PCD and PWT pins during a memory 
access. 


The PWT and PCD bits for a bus cycle are obtained 
either from control 
register 
3 (CR3), the Page Direc- 
tory Entry or the Page Table Entry, depending 
on the 
type of cycle run. However, 
when paging is disabled 
(PG = 
0 in CRO) or for cycles which 
bypass 
paging 
(Le., I/O 
(input/output) 
references, 
INTR 
(interrupt 
request) 
and HALT cycles), 
the PCD and PWT bits 
of CR3 are ignored. 
The Intel486 
DX CPU assumes 
PCD = 0 and PWT = 0 and drives these values on 
the PCD and PWT pins. 
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When 
paging 
is enabled 
(PG = 1 in CRO), the 
bits 
from the page table entry are cached 
in the transla- 
tion lookaside 
buffer 
(TLB), and are driven 
any time 
the page 
mapped 
by the TLB 
entry 
is referenced. 
For normal 
memory 
cycles 
run with paging enabled, 
the PWT and PCD bits are taken from the Page Ta- 
ble Entry. During TLB refresh 
cycles when the Page 
Directory 
and Page Table entries 
are read, the PWT 
and PCD bits must be obtained 
elsewhere. 
The bits 
are taken from CR3 when a Page Directory 
Entry is 
being read. The bits are taken from the Page Direc- 
tory Entry when the Page Table Entry is being updat- 
ed. 


The PCD or PWT bits in CR3 are initialized 
to zero at 
reset, 
but can be set to any value 
by level 0 soft- 
ware. 


The Intel486 
DX Microprocessor 
paging hardware 
is 
designed 
to support 
demand 
paged 
virtual 
memory 
systems. 
However, 
performance 
would 
degrade 
substantially 
if the processor 
was required 
to access 
two levels of tables for every memory 
reference. 
To 
solve this problem, 
the Intel486 
DX Microprocessor 
keeps a cache of the most recently 
accessed 
pages, 
this cache is called the Translation 
Lookaside 
Buffer 
(TLB). The TLB is a four-way 
set associative 
32-en- 
try page table cache. It automatically 
keeps the most 
commonly 
used 
Page Table 
Entries 
in the proces- 
sor. The 32-entry 
TLB coupled 
with a 4K page size, 
results 
in coverage 
of 
128 Kbytes 
of memory 
ad- 
dresses. 
For many common 
multi-tasking 
systems, 
the 
TLB 
will 
have 
a hit rate 
of 
about 
98%. 
This 
means 
that the processor 
will only have to access 
the two-level 
page structure 
on 2% 
of all memory 
references. 
Figure 4.22 illustrates 
how the TLB com- 
plements 
the Intel486 
DX Microprocessor's 
paging 
mechanism. 


Reading 
a new entry into the TLB (TLB refresh) 
is a 
two step process 
handled 
by the Intel486 
DX micro- 
processor 
hardware. 
The sequence 
of data cycles to 
perform 
a TLB refresh 
are: 


U/S 
R/W 
WP 
User Access 
Supervisor 
Access 


0 
0 
0 
None 
Read/Write/Execute 
0 
1 
0 
None 
Read/Write/Execute 
1 
0 
0 
Read/Execute 
Read/Write/Execute 
1 
1 
0 
Read/Write/Execute 
Read/Write/Execute 
0 
0 
1 
None 
Read/Execute 
0 
1 
1 
None 
Read/Write/Execute 
1 
0 
1 
Read/Execute 
Read/Execute 
1 
1 
1 
Read/Write/Execute 
Read/Write/Execute 


1. 
Read the correct Page Directory Entry, as point- 
ed to by the page base register and the upper 
10 bits of the linear address. The page base 
register is in control register 3. 
1a. Optionally perform a locked read/write to set 
the accessed bit in the directory entry. The di- 
rectory entry will actually get read twice if the 
Intel486 OX microprocessor needs to set any of 
the bits in the entry. If the page directory entry 
changes between the first and second reads, 
the data returned for the second read will be 
used. 
2. 
Read the correct entry in the Page Table and 
place the entry in the TLB. 
2a. Optionally perform a locked read/write to set 
the accessed and/or dirty bit in the page table 
entry. Again, note that the page table entry will 
actually get read twice if the Intel486 OX micro- 
processor needs to set any of the bits in the 
entry. Like the directory entry, if the data chang- 
es between the first and second read the data 
returned for the second read will be used. 


Note that the directory entry must always be read 
into the processor, since directory entries are never 
placed in the paging TLB. Page faults can be sig- 
naled from either the page directory read or the 
page table read. Page directory and page table en- 
tries may be placed in the Intel486 OX on-chip 
cache just like normal data. 


TRANSLATION 
LOOKASIDE 
BUffER 


MISS 


31 
0 


PAGE 
TABLE 


.98% 
HIT RATE 


Figure 4.22. Translation 
Lookaslde 
Buffer 


The paging hardware operates in the following fash- 
ion. The paging unit hardware receives a 32-bit lin- 
ear address from the segmentation unit. The upper 
20 linear address bits are compared with all 32 en- 
tries in the TLB to determine if there is a match. If 
there is a match (i.e., a TLB hit), then the 32-bit 
physical address is calculated and will be placed on 
the address bus. 


However, if the page table entry is not in the TLB, 
the Intel486 OX Microprocessor will read the appro- 
priate Page Directory Entry. If P = 1 on the Page 
Directory Entry indicating that the page table is in 
memory, then the Intel486 OX Microprocessor will 
read the appropriate Page Table Entry and set the 
Access bit. If P = 1 on the Page Table Entry indicat- 
ing that the page is in memory, the Intel486 OX Mi- 
croprocessor will update the Access and Dirty bits 
as needed and fetch the operand. The upper 20 bits 
of the linear address, read from the page table, will 
be stored in the TLB for future accesses. However, if 
P = 0 for either the Page Directory Entry or the 
Page Table Entry, then the processor will generate a 
page fault, an Exception 14. 


The processor will also generate an exception 14 
page fault, if the memory reference violated the 
page protection attributes (Le., U/S or R/W) (e.g., 
trying to write to a read-only page). CR2 will hold the 
linear address which caused the page fault. If a sec- 
ond page fault occurs, while the processor is at- 
tempting to enter the service routine for the first, 
then the processor will invoke the page fault (excep- 
tion 14) handler a second time, rather than the dou- 
ble fault (exception 8) handler. Since Exception 14 is 
classified as a fault, CS: EIP will point to the instruc- 
tion causing the page fault. The 16-bit error code 
pushed as part of the page fault handler will contain 
status bits which indicate the cause of the page 
fault. 


The 16-bit error code is used by the operating sys- 
tem to determine how to handle the page fault. Fig- 
ure 4.23a shows the format of the page-fault error 
code and the interpretation of the bits. 


NOTE: 
Even though the bits in the error code (U/S, W/R, 
and P) have similar names as the bits in the Page 
Directory/Table Entries, the interpretation of the er- 
ror code bits is different. Figure 4.23b indicates 
what type of access caused the page fault. 


U 
U U U U U U U U U U U U U U 
W P 
S R 


Figure 4.23a. Page Fault Error Code Format 


U/S: The U/S bit indicates whether the access 
causing the fault occurred when the processor was 
executing in User Mode (U/S = 1) or in Supervisor 
mode (U/S = 0). 


W/R: The W/R bit indicates whether the access 
causing the fault was a Read (W/R = 0) or a Write 
(W/R = 1). 
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P: The P bit indicates whether a page fault was 
caused by a not-present page (P = 0), or by a page 
level protection violation (P = 1). 


U/S 
W/R 
Access Type 


0 
0 
Supervisor' Read 
0 
1 
Supervisor Write 
1 
0 
User Read 
1 
1 
User Write 


·Descnptor 
table 
access 
will fault with 
U/S 
~ 
0, even 
if the program 
is executing 
at level 
3. 
Figure 4.23b. Type of Access 
Causing Page Fault 


4.5.7 
OPERATING 
SYSTEM 
RESPONSIBILITIES 


The Intel486 OX Microprocessor takes care of the 
page address translation process, relieving the bur- 
den from an operating system in a demand-paged 
system. The operating system is responsible for set- 
ting up the initial page tables, and handling any page 
faults. The operating system also is required to inval- 
idate (Le., flush) the TLB when any changes are 
made to any of the page table entries. The operating 
system must reload CR3 to cause the TLB to be 
flushed. 


Setting up the tables is simply a matter of loading 
CR3 with the address of the Page Directory, and 
allocating space for the Page Directory and the 
Page Tables. The primary responsibility of the oper- 
ating system is to implement a swapping policy and 
handle all of the page faults. 


A final concern of the operating system is to ensure 
that the TLB cache matches the information in the 
paging tables. In particular, any time the operating 
system sets the P present bit of page table entry to 
zero, the TLB must be flushed. Operating systems 
may want to take advantage of the fact that CR3 is 
stored as part of a TSS, to give every task or group 
of tasks its own set of page tables. 


The Intel486 OX Microprocessor allows the execu- 
tion of 8086 application programs in both Real Mode 
and in the Virtual 8086 Mode (Virtual Mode). Of the 
two. methods, Virtual 8086 Mode offers the system 
deSigner the most flexibility. The Virtual 8086 Mode 
allows the execution of 8086 applications, while still 
allowing the system designer to take full advantage 
of the Intel486 OX Microprocessor protection mech- 


anism. In particular, the Intel486 OX Microprocessor 
allows the simultaneous execution of 8086 operating 
systems and its applications, and an Intel486 OX Mi- 
croprocessor operating system and both 80286 and 
Intel486 OX Microprocessor applications. Thus, in a 
multi-user Intel486 OX Microprocessor computer, 
one person could be running an MS·DOS spread- 
sheet, another person using MS-DOS, and a third 
person could be running multiple Unix utilities and 


. applications. Each person in this scenario would be- 
lieve that he had the computer completely to him- 
self. Figure 4.24 illustrates this concept. 


4.6.2 
VIRTUAL 
8086 MODE 
ADDRESSING 
MECHANISM 


One of the major differences between Intel486 OX 
Microprocessor Real and. Protected modes is how 
the segment selectors are interpreted. When the 
processor is executing in Virtual 8086 Mode the seg- 
ment registers are used in an identical fashion to 
Real Mode. The contents of the segmel'lt register is 
shifted left 4 bits and added to the offset to form the 
segment base linear address. 


The Intel486 OX Microprocessor allows the operat- 
ing system to specify which programs use the 8086 
style address mechanism, and which programs use 
Protected Mode addressing, on a per task basis. 
Through the use of paging, the one megabyte ad- 
dress space of the Virtual Mode task can be mapped 
to anywhere in the 4 gigabyte linear address space 
of the Intel486 OX Microprocessor. Like Real Mode, 
Virtual Mode effective addresses (Le., segment off- 
sets) that exceed 64 Kbyte will cause an exception 
13. However, these restrictions should not prove to 
be important, because most tasks running in Virtual 
8086 Mode will simply be existing 8086 application 
programs. 


The pa.ging~ardware allows the concurrent running 
of multiple Virtual Mode tasks, and provides protec- 
tion and operating system isolation. Although it is 
not strictly necessary to have the paging hardware 
enabled to run Virtual Mode tasks, it is needed in 
order to run multiple Virtual Mode tasks or to relo- 
cate the address space of a Virtual Mode task to 
physical address space greater than one megabyte. 


The paging hardware allows the 20-bit linear ad- 
dress produced by a Virtual Mode program to be 
divided into up to 256 pages. Each one of the pages 
can be located anywhere within the maximum 4 gig- 
abyte physical address space of the Intel486 OX Mi- 
croprocessor. In addition, since CR3 (the Page Di- 
rectory. Base Register) is loaded by a task switch, 
each Virtual Mode task can use a different mapping 
scheme to map pages to different physical locations. 


Finally, the paging hardware allows the sharing of 
the 8086 operating system code between multiple 
8086 applications. Figure 4.24 shows how the In- 
tel486 OX Microprocessor paging hardware enables 
multiple 8086 programs to run under a virtual memo- 
ry demand paged system. 


4.6.4 
PROTECTION 
AND 1/0 
PERMISSION 
BITMAP 


All Virtual 8086 Mode programs execute at privilege 
level 3, the level of least privilege. As such, Virtual 
8086 Mode programs are subject to all of the protec- 
tion checks defined in Protected Mode. (This is dif- 
ferent from Real Mode which implicitly is executing 
at privilege level 0, the level of greatest privilege.) 
Thus, an attempt to execute a privileged instruction 
when in Virtual 8086 Mode will cause an exception 
13 fault. 


The following are privileged instructions, which may 
be executed only at Privilege Level O. Therefore, at- 
tempting to execute these instructions in Virtual 
8086 Mode (or anytime CPL > 0) causes an excep- 
tion 13 fault: 


LIDT; 
LGDT; 
LMSW; 
CLTS; 
HLT; 


MOV DRn,reg; 
MOV TRn,reg; 
MOV CRn,reg; 


MOV reg,DRn; 
MOV reg,TRn ; 
MOV reg,CRn. 


Several instructions, particularly those applying to 
the multitasking model and protection model, are 
available only in Protected Mode. Therefore, at· 
tempting to execute the following instructions in 
Real Mode or in Virtual 8086 Mode generates an 
exception 6 fault: 


STR; 
SLDT; 
VERR; 
VERW; 


The instructions which are IOPL-sensitive in Protect- 
ed Mode are: 


IN; 
STI; 
OUT; 
CLI 
INS; 
OUTS; 
REP INS; 
REP OUTS; 


• 
TASK 1 
.8086 
OS 
MEMORY 
MEMORY 


I?77J TASK 2 
~ 
Inlol386™ 
CPU OS 


I(.(Q 
MEMORY 
~ 
MEMORY 


LTR; 
LLDT; 
LAR; 
LSL; 
ARPL. 


In Virtual 8086 Mode, a slightly different set of in- 
structions are made 10PL-sensitive.The following in- 
structions are 10PL-sensitive in Virtual 8086 Mode: 
lNT n; 
5Tl; 


PU5HF; 
eLl; 


POPF; 
lRET 


The PUSHF, POPF, and IRET instructions are 10PL- 
sensitive in Virtual 8086 Mode only. This provision 
allows the IF flag (interrupt enable flag) to be virtual- 
ized to the Virtual 8086 Mode program. The INT n 
software interrupt instruction is also 10PL-sensitive 
in Virtual 8086 Mode. Note, however, that the INT 3 
(opcode OCCH),INTO, and BOUND instructions are 
not 10PL-sensitive in Virtual 8086 mode (they aren't 
10PL sensitive in Protected Mode either). 


Note that the I/O instructions (IN, OUT, INS, OUTS, 
REP INS, and REP OUTS) are not 10PL-sensitivein 
Virtual 8086 mode. Rather, the I/O instructions be- 
come automatically sensitive to the 1/0 Permission 
Bitmap contained in the Intel486 
OX Microproces- 
sor Task 
State 
Segment. 
The I/O Permission Bit- 
map, automatically used by the Intel486 OX micro- 
processor in Virtual 8086 Mode, is illustrated by Fig- 
ures 4.15a and 4.15b. 


The I/O Permission Bitmap can be viewed as a 0- 
64 Kbit bit string, which begins in memory at offset 
BiLMap_Offset 
in the current TSS. BiLMap_ 
Offset must be ::;;OFFFH so the entire bit map and 
the byte FFH which follows the bit map are all at 
offsets ::;; FFFFH from the TSS base. The 16-bit 
pointer BiLMap_Offset 
(15:0) is found in the word 
beginning at offset 66H (102 decimal) from the TSS 
base, as shown in Figure 4.15a. 


Each bit in the I/O Permission Bitmap corresponds 
to a single byte-wide I/O port, as illustrated in Figure 
4.15a. If a bit is 0, I/O to the corresponding byte- 
wide port can occur without generating an excep- 
tion. Otherwise the I/O instruction causes an excep- 
tion 13 fault. Since every byte-wide I/O port must be 
protectable, all bits corresponding to a word-wide or 
dword-wide port must be a for the word-wide or 
dword-wide I/O to be permitted. If all the referenced 
bits are 0, the I/O will be allowed. If any referenced 
bits are 1, the attempted I/O will cause an exception 
13 fault. 


Due to the use of a pointer to the base of the I/O 
Permission Bitmap, the bitmap may be located any- 
where within the TSS, or may be ignored completely 
by pointing the BiLMap_Offset 
(15:0) beyond the 
limit of the TSS segment. In the same manner, only 
a small portion of the 64K I/O space need have an 
associated map bit, by adjusting the TSS limit to 
truncate the bitmap. This eliminates the commitment 
of 8K of memory when a complete bitmap is not 


required, while allowing the fully general case if de- 
sired. 


EXAMPLE OF BITMAP FOR I/O PORTS 0-255: 
Setting the TSS limit to {biLMap_Offset 
+ 31 
+ 1"1 
[•• see note belowl will allow a 32-byte bit- 


map for the I/O ports #0-255, 
plus a terminator 
byte of all 1's [•• see note belowl. This allows the 
I/O bitmap to control I/O Permission to I/O port 0- 
255 while causing an exception 13 fault on attempt- 
ed I/O to any I/O port 80256 through 65,565. 


"IMPORTANT 
IMPLEMENTATION 
NOTE: Beyond 
the last byte of I/O mapping information in the I/O 
Permission Bitmap must be a byte containing all 1's. 
The byte of all 1's must be within the limit of the 
Intel486 microprocessor TSS segment (see Figure 
4.15a). 


In order to fully support the emulation of an 8086 
machine, interrupts in Virtual 8086 Mode are han- 
dled in a unique fashion. When running in Virtual 
Mode all interrupts and exceptions involve a privi- 
lege change back to the host Intel486 OX micro- 
processor operating system. The Intel486 OX micro- 
processor operating system determines if the inter- 
rupt comes from a Protected Mode application or 
from a Virtual Mode program by examining the VM 
bit in the EFLAGS image stored on the stack. 


When a Virtual Mode program is interrupted and ex- 
ecution passes to the interrupt routine at level 0, the 
VM bit is cleared. However, the VM bit is still set in 
the EFLAG image on the stack. 


The Intel486 OX operating system in turn handles 
the exception or interrupt and then returns control to 
the 8086 program. The Intel486 OX operating sys- 
tem may choose to let the 8086 operating system 
handle the interrupt or it may emulate the function of 
the interrupt handler. For example, many 8086 oper- 
ating system calls are accessed by PUSHing param- 
eters on the stack, and then executing an INT n in- 
struction. If the 10PLis set to a then all INT n instruc- 
tions will be intercepted by the Intel486 OX operat- 
ing system. The Intel486 OX operating system could 
emulate the 8086 operating system's call. Figure 
4.25 shows how the Intel486 OX operating system 
could intercept an 8086 operating system's call to 
"Open a File". 


A Intel486 OX operating system can provide a Virtu- 
al 8086 Environment which is totally transparent to 
the application software via intercepting and then 
emulating 8086 operating system's calls, and inter- 
cepting IN and OUT instructions. 


4.6.6 ENTERING AND LEAVING VIRTUAL 
8086 MODE 


Virtual 8086 mode is entered by executing an IRET 
instruction (at CPL= 0), or Task Switch (at any CPL) 
to a Intel486 DX task whose Intel486 DX TSS has a 
FLAGS image containing a 1 in the VM bit position 
while the processor is executing in Protected Mode. 
That is, one way to enter Virtual 8086 mode is to 
switch to a task with a Intel486 DX TSS that has a 1 
in the VM bit in the EFLAGS image. The other way is 
to execute a 32-bit IRET instruction at privilege level 
0, where the stack has a 1 in the VM bit in the 
EFLAGS image. POPF does not affect the VM bit, 
even if the processor is in Protected Mode or level 0, 
and so cannot be used to enter Virtual 8086 Mode. 
PUSHF always pushes a 0 in the VM bit, even if the 
processor is in Virtual 8086 Mode, so that a program 
cannot tell if it is executing in REAL mode, or in Vir· 
tual 8086 mode. 


The VM bit can be set by executing an IRET instruc- 
tion only at privilege level 0, or by any instruction or 
Interrupt which causes a task switch in Protected 
Mode (with VM= 1 in the new FLAGS image), and 
can be cleared only by an interrupt or exception in 
Virtual 8086 Mode. IRET and POPF instructions exe- 
cuted in REAL mode or Virtual 8086 mode will not 
change the value in the VM bit. 


The transition out of virtual 8086 mode to Intel486 
DX protected mode occurs only on receipt of an in- 
terrupt or exception (such as due to a sensitive in- 
struction). In Virtual 8086 mode, all interrupts and 
exceptions vector through the protected mode IDT, 
and enter an interrupt handler in protected Intel486 
DX mode. That is, as part of interrupt processing, 
the VM bit is cleared. 


Because the matching IRET must occur from level 0, 
if an Interrupt or Trap Gate is used to field an inter- 
rupt or exception out of Virtual 8086 mode, the Gate 
must perform an inter-level interrupt only to level O. 
Interrupt or Trap Gates through conforming seg- 
ments, or through segments with DPL> 0, will raise a 
GP fault with the CS selector as the error code. 


4.6.6.1 Task Switches To/From 
Virtual 
8086 Mode 


Tasks which can execute in virtual 8086 mode must 
be described by a TSS with the Intel486 DX Micro- 
processor format (TYPE 9 or 11 descriptor). 


A task switch out of virtual 8086 mode will operate 
exactly the same as any other task switch out of a 
task with an Intel486 DX TSS. All of the programmer 
visible state, including the FLAGS register with the 
VM bit set to 1, is stored in the TSS. 


The segment registers in the TSS will contain 8086 
segment base values rather than selectors. 


A task switch into a task described by a Intel486 DX 
TSS will have an additional check to determine if the 
incoming task should be resumed in virtual 8086 
mode. Tasks described by 80286 format TSSs can· 
not be resumed in virtual 8086 mode, so no check is 
required there (the FLAGS image in 80286 format 
TSS has only the low order 16 FLAGS bits). Before 
loading the segment register images from a Intel486 
DX TSS, the FLAGS image is loaded, so that the 
segment registers are loaded from the TSS image 
as 8086 segment base values. The task is now 
ready to resume in virtual 8086 execution mode. 


4.6.6.2 Transitions 
Through Trap and Interrupt 
Gates, and IRET 


A task switch is one way to enter or exit virtual 8086 
mode. The other method is to exit through a Trap or 
Interrupt gate, as part of handling an interrupt, and 
to enter as part of executing an IRET instruction. 
The transition out must use a Intel486 DX Micro- 
processor Trap Gate (Type 14), or Intel486 DX Inter- 
rupt Gate (Type 15), which must point to a non-con- 
forming level 0 segment (DPL= 0) in order to permit 
the trap handler to IRET back to the Virtual 8086 
program. The Gate must point to a non-conforming 
level 0 segment to perform a level switch to level 0 
so that the matching IRET can change the VM bit. 
Intel486 DX gates must be used, since 80286 gates 
save only the low 16 bits of the FLAGS register, so 
that the VM bit will not be saved on transitions 
through the 80286 gates. Also, the 16-bit IRET (pre- 
sumably) used to terminate the 80286 interrupt han- 
dier will pop only the lower 16 bits from FLAGS, and 
will not affect the VM bit. The action taken for a 
Intel486 DX Trap or Interrupt gate if an interrupt oc- 
curs while the task is executing in virtual 8086 mode 
is given by the following sequence. 
(1) Save the FLAGS register in a temp to push later. 


Turn off the VM and TF bits, and if the interrupt 
is serviced by an Interrupt Gate, turn off IF also. 


(2) Interrupt and Trap gates must perform a level 
switch from 3 (where the VM86 program exe- 
cutes) to level 0 (so IRET can return). This pro- 
cess involves a stack switch to the stack given 
in the TSS for privilege level O. Save the Virtual 
8086 Mode SS and ESP registers to push in a 
later step. The segment register load of SS will 
be done as a Protected Mode segment load, 
since the VM bit was turned off above. 
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(3) Push the 8086 segment register values onto the 
new stack, in the order: GS, FS, OS, ES. These 
are pushed as 32·bit quantities, with undefined 
values in the upper 16 bits. Then load these 4 
registers with null selectors (0). 
(4) Push the old 8086 stack pointer onto the new 
stack by pushing the SS register (as 32-bits, high 
bits undefined), then pushing the 32-bit ESP reg· 
ister saved above. 
(5) Push the 32·bit FLAGS register saved in step 1. 
(6) Push the old 8086 instruction pointer onto the 
new stack by pushing the CS register (as 32·bits, 
high bits undefined), then pushing the 32·bit EIP 
register. 


(7) Load up the new CS:EIPvalue from the interrupt 
gate, and begin execution of the interrupt routine 
in protected Intel486 OX Microprocessor mode. 


The transition out of virtual 8086 mode performs a 
level change and stack switch, in addition to chang· 
ing back to protected mode. In addition, all of the 
8086 segment register images are stored on the 
stack (behind the SS:ESP image), and then loaded 


with null (0) selectors before entering the interrupt 
handler. This will permit the handler to safely save 
and restore the OS, ES, FS, and GS registers as 
80286 selectors. This is needed so that interrupt 
handlers which don't care about the mode of the 
interrupted program can use the same prolog and 
epilog code for state saving (Le., push all registers in 
prolog, pop all in epilog) regardless of whether or not 
a "native" mode or Virtual 8086 mode program was 
interrupted. Restoring null selectors to these regis- 
ters before executing the IRET will not cause a trap 
in the interrupt handler. Interrupt routines which ex- 
pect values in the segment registers, or return val- 
ues in segment registers will have to obtain/return 
values from the 8086 register images pushed onto 
the new stack. They will need to know the mode of 
the interrupted program in order to know where to 
find/return segment registers. and also to know how 
to interpret segment register values. 


The IRET instruction will perform the inverse of the 
above sequence. Only the extended Intel486 OX 
IRET instruction (operand size= 32) can be used. 
and must be executed at level 0 to change the VM 
bit to 1. 


(1) If the NT bit in the FLAGs register is on, an inter· 


task return is performed. The current state is 
stored in the current TSS, and the link field in the 
current TSS is used to locate the TSS for the 
interrupted task which is to be resumed. 
Otherwise, 
continue 
with 
the 
following 
se· 
quence. 


(2) Read the FLAGS image from SS:8[ESP] into the 
FLAGS register. This will set VM to the value 
active in the interrupted routine. 
(3) Pop off the instruction pointer CS:EIP. EIP is 


popped first, then a 32-bit word is popped which 
contains the CS value in the lower 16 bits. If 
VM = 0, this CS load is done as a protected 
mode segment load. If VM = 1, this will be done 
as an 8086 segment load. 


(4) Increment the ESP register by 4 to bypass the 
FLAGS image which was "popped" in step 1. 


(5) If VM=1, load segment registers ES, DS, FS, 


and GS from memory locations SS:[ESP+8], 
SS:[ESP+ 12], 
SS:[ESP+ 16], 
and 


SS:[ESP+ 20], respectively, where the new val- 
ue of ESP stored in step 4 is used. Since VM = 1, 
these are done as 8086 segment register loads. 
Else if VM= 0, check that the selectors in ES, 
DS, FS, and GS are valid in the interrupted rou- 
tine. Null out invalid selectors to trap if an at- 
tempt is made to access through them. 


(6) If (RPL(CS) > 
CPL), pop the stack pointer 


SS:ESP from the stack. The ESP register is 
popped first, followed by 32-bits containing SS in 
the lower 16 bits. If VM= 0, SS is loaded as a 
protected mode segment register load. If VM = 1, 
an 8086 segment register load is used. 


(7) Resume execution of the interrupted routine. The 


VM bit in the FLAGS register (restored from the 
interrupt routine's stack image in step 1) deter- 
mines whether the processor resumes the inter- 
rupted routine in Protected mode of Virtual 8086 
mode. 


To meet its performance goals the Intel486 DX2 mi- 
croprocessor contains an eight Kbyte cache. The 
cache is software transparent to maintain binary 
compatibility with previous generations of the In- 
tel386™ /lntel486™ 
architecture. 


The on-chip cache has been designed for maximum 
flexibility and performance. The cache has several 
operating modes offering flexibility during program 
execution and debugging. Memory areas can be de- 
fined as non-cacheable by software and external 
hardware. Protocols for cache line invalidations and 
replacement are implemented in hardware, easing 
system design. 
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The on-chip cache is a unified code and data cache. 
The cache is used for both instruction and data ac- 
cesses and acts on physical addresses. 


The cache organization is 4-way set associative and 
each line is 16 bytes wide. The eight Kbytes of 
cache memory are logically organized as 128 sets, 
each containing four lines. 


The cache memory is physically split into four 
2-Kbyte blocks each containing 128 lines (see Fig- 
ure 5.1). Associated with each 2-Kbyte block are 
128 21-bit tags. There is a valid bit for each line in 
the cache. Each line in the cache is either valid or 
not valid. There are no provisions for partially valid 
lines. 
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The write strategy of on-chip cache is write-through. 
All writes will drive an external write bus cycle in 
addition to writing the in~ormation to the internal 
cache if the write was a cache hit. A write to an 
address not contained in the internal cache will only 
be written to external memory. Cache allocations 
are not made on write misses. 


Control of the cache is provided by the CD and NW 
bits in CRO.CD enables and disables the cache. NW 
controls memory write-through and invalidates. 


The CD and NW bits define four operating modes of 
the on-chip cache as given in Table 5.1. These 
modes provide flexibility in how the on-chip cache is 
used. 


CD 
NW 
Operating 
Mode 


1 
1 
Cache fills disabled, write-through and 
invalidates disabled 
1 
0 
Cache fills disabled, write-through and 
invalidates enabled 
0 
1 
INVALID. IF CROis loaded with this 
configuration of bits, a GP fault with 
error code of 0 is raised. 
0 
0 
Cache fills enabled, write-through and 
invalidates enabled 


CD=1, 
NW=1 
The cache is completely disabled by setting 
CD= 1 and NW= 1 and then flushing the 
cache. This mode may be useful for debug- 
ging programs where it is important to see 
all memory cycles at the pins. Writes which 
hit in the cache will not appear on the exter- 
nal bus. 
It is possible to use the on-chip cache as 
fast static RAM by "pre-loading" 
certain 
memory areas into the cache and then set- 
ting CD= 1 and NW= 1. Pre-loading can be 
done by careful choice of memory refer- 
ences with the cache turned on or by use of 
the testability functions (see Section 8.2). 
When the cache is turned off the memory 
mapped by the cache is "frozen" into the 
cache since fills and invalidates are dis- 
abled. 


CD=1, 
NW=O 
Cache fills are disabled but write-throughs 
and invalidates are enabled. This mode is 
the same as if the KENIF pin was strapped 
HIGH disabling cache fills. Write-throughs 
and invalidates may still occur to keep the 
cache valid. This mode is useful if the soft- 
ware must disable the cache for a short pe- 
riod of time, and then re-enable it without 
flushing the original contents. 
CD=O, NW=1 
INVALID. If CROis loaded with this bit con- 
figuration, a General Protection fault with 
error code of 0 is raised. Note that this 
mode would imply a non-transparent write- 
back cache. A future processor may define 
this combination of bits to implement a 
write-back cache. 
CD=O, NW=O 


This is the normal operating mode. 


Completely disabling the cache is a two step pro- 
cess. First CD and NW must be set to 1 and then the 
cache must be flushed. If the cache is not flushed, 
cache hits on reads will still occur and data will be 
read from the cache. 


Any area of memory can be cached in the Intel486 
DX 
microprocessor. 
Non-cacheable 
portions 
of 
memory can be defined by the external system or by 
software. The external system can inform the In- 
tel486 DX microprocessor that a memory address is 
non-cacheable by returning the KENIF pin inactive 
during a memory access (refer to Section 7.2.3). 
Software can prevent certain pages from being 
cached by setting the PCD bit in the page table en- 
try. 


A read request can be generated from program op- 
eration or by an instruction pre-fetch. The data will 
be supplied from the on-chip cache if a cache hit 
occurs on the read address. If the address is not in 
the cache, a read request for the data is generated 
on the external bus. 


If the read request is to a cacheable portion of mem- 
ory, the 
Intel486 DX microprocessor 
initiates a 
cache line fill. During a line fill a 16-byte line is read 
into the Intel486 DX microprocessor. 


Cache fills will only be generated for read misses. 
Write misses will never cause a line in the internal 
cache to be allocated. If a cache hit occurs on a 
write, the line will be updated. 


Cache line fills can be performed over 8- and 16-bit 
busses using the dynamic bus sizing feature. Refer 
to Section 7.1.3 for a description of dynamic 
bus 
sizing. 


Refer to Section 7.2.3 for further information on 
cacheable cycles. 


The Intel486 OX microprocessor contains both a 
hardware and software mechanism for invalidating 
lines in its internal cache. Cache line invalidations 
are needed to keep the Intel486 OX microproces- 
sor's cache contents consistent with external mem- 
ory. 


Refer to Section 7.2.8 for further information on 
cache line invalidations. 


When a line needs to be placed in its internal cache 
the Intel486 OX microprocessor first checks to see if 
there is a non-valid line in the set that can be re- 
placed. If all four lines in the set are valid, a pseudo 
least-recently-used mechanism is used to determine 
which line should be replaced. 


A valid bit is associated with each line in the cache. 
When a line needs to be placed in a set, the four 
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valid bits are checked to see if there is a non-valid 
line that can be replaced. If a non-valid line is found, 
that line is marked for replacement. 


The four lines in the set are labeled 10,11,12,and 13. 
The order in which the valid bits are checked during 
an invalidation is 10,11,12and 13.All valid bits are 
cleared when the processor is reset or when the 
cache is flushed. 


Replacement in the cache is handled by a pseudo 
least recently used (LRU) mechanism when all four 
lines in a set are valid. Three bits, 80, 81 and 82, 
are defined for each of the 128 sets in the cache. 
These bits are called the LRU bits. The LRU bits are 
updated for every hit or replace in the cache. 


If the most recent access to the set was to 10or 11, 
80 is set to 1. 80 is set to 0 if the most recent ac- 
cess was to 12or 13.If the most recent access to 
10:11was to 10,81 is set to 1, else 81 is set to O. If 
the most recent access to 12:13was to 12,82 is set to 
1, else 82 is set to O. 


The pseudo LRU mechanism works in the following 
manner. When a line must be replaced, the cache 
will first select which of 10:11and 12:13was least re- 
cently used. Then the cache will determine which of 
the two lines was least recently used and mark it for 
replacement. This decision tree is shown in Figure 
5.2. When the processor is reset or when the cache 
is flushed all 128 sets of three LRU bits are set to O. 
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recently 
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recently 
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5.6 
Page Cacheability 


Two bits for cache control, PWT and PCD, are de- 
fined in the page table and page directory entries. 
The state of these bits are driven out on the PWT 
and PCD pins during memory access cycles. 


The PWT bit controls write policy for second level 
caches used with the Intel486 DX microprocessor. 
Setting PWT= 1 defines a write-through policy for 
the current page while PWT= 0 allows the possibility 
of write-back. The state of PWT is ignored internally 
by the Intel486 DX microprocessor since the on-chip 
cache is write through. 


The PCD bit controls cacheability on a page by page 
basis. The PCD bit is internally ANDed with the 
KEN# signal to control cacheability on a cycle by 
cycle basis (see Figure 5.3). PCD=O enables cach- 
ing while PCD= 1 forbids it. Note that cache fills are 
enabled when PCD= 0 AND KEN# = O.This logical 
AND is implemented physically with a NOR gate. 


The state of the PCD bit in the page table entry is 
driven on the PCD pin when a page in external mem- 
ory is accessed. The state of the PCD pin informs 
the external system of the cacheability of the re- 
quested information. The external system then re- 
turns KEN# telling the Intel486 DX microprocessor 
if the area is cacheable. The Intel486 DX microproc- 
essor initiates a cache line fill if PCD and KEN# 
indicate that the requested information is cacheable. 
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The PCD bit is masked with the CD (cache disable) 
bit in control register 0 to determine the state of the 
PCD pin. If CD= 1 the Intel486 OX microprocessor 
forces the PCD pin HIGH. If CD= 0 the PCD pin is 
driven with the value for the page table entry/direc- 
tory. See Figure 5.3. 


The PWT and PCD bits for a bus cycle are obtained 
from either CR3, the page directory or page table 
entry. These bits are assumed to be zero during real 
mode, whenever paging is disabled, or for cycles 
that bypass paging, (I/O references, interrupt ac- 
knowledge and Halt cycles), the PWT and PCD bits 
are taken from CR3. These bits are initialized to 0 on 
reset, but can be set to any value by level 0 soft- 
ware. 


When paging is enabled, the bits from the page table 
entry are cached in the TLB, and are driven any time 
the page mapped by the TLB entry is referenced. 
For normal memory cycles, PWT and PCD are taken 
from the page table entry. During TLB refresh cycles 
where the page table and directory entries are read, 
the PWT and PCD bits must be obtained elsewhere. 
During page table updates the bits are obtained from 
the page directory. When the page directory is up- 
dated the bits are obtained from CR3. 


5.7 
Cache Flushing 


The on-chip cache can be flushed by external hard- 
ware or by software instructions. Flushing the cache 
clears all valid bits for all lines in the cache. The 
cache is flushed when external hardware asserts the 
FLUSH# pin. 


The flush pin needs to be asserted for one clock if 
driven synchronously or for two clocks if driven 
asynchronously. The flush input is asynchronous but 
setup and hold times must be met. The flush pin 
should be deasserted after the cache flush is com- 
plete. Failure to deassert the pin will cause execu- 
tion to stop as the processor will be repeatedly flush- 
ing the cache. If external hardware activates flush in 
response to an I/O write, flush must be asserted for 
at least two clocks prior to ready being returned for 
the I/O write. This ensures that the flush completes 
before the CPU begins execution of the instruction 
following the OUT instruction. 


Flush is recognized during HOLD just like EADS#. 


The instructions INVD and WBINVD cause the on- 
cache to be flushed. External caches connected to 
the Intel486 OX microprocessor are signalled to 
flush their contents when these instructions are exe- 
cuted. 


WBINVD will cause ari external write-back cache to 
write back dirty lines before flushing its contents. 
The external cache is signalled using the bus cycle 
definition pins and the byte enables (refer to Section 
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6.2.5 for the bus cycle definition pins and Section 
7.2.11 for special bus cycles). Refer to the Intel486 
OX microprocessor programmers reference manual 
for detailed instruction definitions. 


The results of the INVD and WBINVD instructions 
are identical for the operation of the Intel486 OX mi- 
croprocessor's on-chip cache since the cache is 
write-through. Note that the INVD and WBINVD in- 
structions are machine dependent. Future members 
of the 
Intel486 
OX microprocessor 
family 
may 
change the definition of this instruction. 


5.8 
Caching Translation 
Lookaside 
Buffer Entries 


The Intel486 OX microprocessor contains an inte- 
grated paging unit with a translation lookaside buffer 
(TLB). The TLB contains 32 entries. The TLB has 
been enhanced over the Intel386 microprocessor's 
TLB by upgrading the replacement strategy to a 
pseudo-LRU (least recently used) algorithm. The 
pseudo-LRU replacement algorithm is the same as 
that used in the on-chip cache. 


The paging TLB operation is automatic whenever 
paging is enabled. The TLB contains the most re- 
cently used page table entries. A page table entry 
translates the linear address pointing to a particular 
page to the physical address where the page is 
stored in memory (refer to Section 4.5, Paging). 


The paging unit will look up the linear address in the 
TLB in response to an internal bus request. The cor- 
responding physical address is passed on to the on- 
chip cache or the external bus (in the event of a 
cache miss) when the linear address is present in 
the TLB. 


The paging unit will access the page tables in exter- 
nal memory if the linear address is not in the TLB. 
The required page table entry will be read into the 
TLB and then the cache or bus cycle for the actual 
data will take place. The process of reading a new 
page table entry into the TLB is called a TLB refresh. 


A TLB refresh is a two step process. The paging unit 
must first read the page directory entry which points 
to the appropriate page table. The page table entry 
to be stored in the TLB is then read from the page 
table. Control register 3 (CR3) points to the base of 
the page directory table. 


The Intel486 OX microprocessor will allow page di- 
rectory and page table entries (returned during TLB 
refreshes) to be stored in the on-chip cache. Setting 
the PCD bits in CR3 and the page directory entry to 
1 will prevent the page directory and page table en- 
tries from being stored in the on-chip cache (see 
Section 5.6, Page Cacheabllity). 
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The Intel486 OX2 microprocessor bus has been de- 
signed to be identical with the Intel486 micropro- 
cessor bus. Several new features have been added 
to the Intel486 OX2 to increase performance and 
testability. New features include a speed doubler for 
the core logic, and IEEE 1149.1 boundary scan sup- 
port. 


The Intel486 OX2 is driven by what can be called a 
%x clock, as opposed to the 1x clock in the Intel486 
OX and the 2x clock in the Intel386 micropro- 
cessors. Thus a 50 MHz Intel486 OX2 is driven by a 
25 MHz clock, in contrast with 25 MHz processors 
like the Intel486 OX2 and the Intel386 requiring 
25 MHz and 50 MHz, respectively. Since the In- 
tel486 OX has a clock doubler driving its core, but 
not its bus interface, it provides a simpler system 
design for a given performance level than the In- 
tel486 OX. In reality, this permits dramatic increases 
in performance by just plugging in the Intel486 
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OX2 in a system that had already been designed for 
the Inte1486.This performance is supplied because 
the bus interface is identical to that of an Intel486 
OX and all of the core is running at twice the speed 
of the comparable Intel486 OX. This speedup in- 
cludes the internal cache memory, the floating point 
unit, the instruction decode unit, the ALU and every- 
thing except the bus interface. 


Like the Intel386 microprocessor, the Intel486 OX 
microprocessor has separate parallel busses for 
data and addresses. The bidirectonal data bus is 
32 bits in width. The address bus consists of two 
components: 30 address lines (A2-A31) and 4 byte 
enable lines (BEO#-BE3#). 
The address bus ad- 2 


dresses external memory in the same manner as the 
Intel386 microprocessor: The address lines form the 
upper 30 bits of the address and the byte enables 
select individual bytes within a 4 byte location. The 
address lines are bidirectional for use in cache line 
invalidations. 


A2-A31 l32-Bit 
Address 
Bu, 


Byte)~.. 


] 


Bu, Cycle 


Definition 
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Bu, 


Arbitration 
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Bur,\ 
Control 


} 
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Control 


The Intel486 DX microprocessor's burst bus mecha- 
nism enables high-speed cache fills from external 
memory. Burst cycles can strobe data into the proc- 
essor at a rate of one item every clock. Non-burst 
cycles have a maximum rate of one item every two 
clocks. Burst cycles are not limited to cache fills: all 
bus cycles requiring more than a single data cycle 
can be bursted. 


The Intel486 DX microprocessor has a bus hold fea- 
ture similar to that of the Intel386 microprocessor. 
During bus hold, the Intel486 DX microprocessor re- 
linquishes control of the local bus by floating its ad- 
dress, data and control busses. 


The Intel486 DX microprocessor has an address 
hold feature in addition to bus hold. During address 
hold only the address bus is floated, the data and 
control busses can remain active. Address hold is 
used for cache line invalidations. 


Ahead is a brief description of the Intel486 DX mi- 
croprocessor input and output signals arranged by 
functional groups. Before beginning the signal de- 
scriptions a few terms need to be defined. The # 
symbol at the end of a signal name indicates the 
active, or asserted, state occurs when the signal is 
at a low voltage. When a # is not present after the 
signal name, the signal is active at the high voltage 
level. The term "ready" is used to indicate that the 
cycle is terminated with RDY# or BRDY#. 


Section 6 and 7 will discuss bus cycles and data 
cycles. A bus cycle is at least two clocks long and 
begins with ADS# active in the first clock and ready 
active in the last clock. Data is transferred to or from 
the Intel486 DX microprocessor during a data cycle. 
A bus cycle contains one or more data cycles. 


ClK provides the fundamental timing and the inter- 
nal operating frequency for the Intel486 DX2 micro- 
processor. All external timing parameters are speci- 
fied with respect to the rising edge of ClK. 


The Intel486 DX2 microprocessor can operate over 
a wide frequency range but ClK's frequency cannot 
change rapidly while RESET is inactive. ClK's fre- 
quency must be stable for proper chip operation 
since a single edge of ClK is used internally to gen- 
erate four phases. ClK only needs TTl 
levels for 
proper operation. Figure 6.2 illustrates the ClK 
waveform. 


6.2.2 
Address Bus (A31-A2, BEO#-BE3#) 


A31-A2 
and BEO#-BE3# 
form the address bus 
and provide physical memory and I/O port address- 
es. The Intel486 DX microprocessor is capable of 
addressing 4 gigabytes of physical memory space 
(OOOOOOOOH 
through FFFFFFFFH), and 64 Kbytes 
of 
I/O 
address 
space 
(OOOOOOOOHthrough 
OOOOFFFFH).A31-A2 identify addresses to a 4-byte 
location. BEO#-BE3# 
identify which bytes within 
the 4-byte location are involved in the current trans- 
fer. 


Addresses are driven back into the Intel486 DX mi- 
croprocessor over A31-A4 during cache line invali- 
dations. The address lines are active HIGH. When 
used as inputs into the processor, A31-A4 
must 
meet the setup and hold times, t22 and t23. A31-A2 
are not driven during bus or address hold. 


tx = input setup times 
ty = input hold times, 
output 
float, valid and hold times 


The byte enable outputs, BEO#-BE3#, 
determine 
which bytes must be driven valid for read and write 
cycles to external memory. 


BE3# applies to 024-031 
BE2# applies to 016-023 
BE1# applies to 08-015 
BEO# applies to 00-07 


BEO#-BE3# 
can be decoded to generate AO, A1 
and BHE# 
signals used in 8- and 16-bit systems 
(see Table 7.5). BEO#-BE3# 
are active LOW and 
are not driven during bus hold. 


6.2.3 DATA LINES (031-00) 


The bidirectional lines, 031-00, 
form the data bus 
for the Intel486 OX microprocessor. 00-07 
define 
the least significant byte and 024-031 
the most sig- 
nificant byte. Data transfers to 8- or 16-bit devices is 
possible using the data bus sizing feature controlled 
by the BS8# or BS16# input pins. 


031-00 
are active HIGH. For reads, 031-00 
must 
meet the setup and hold times, t22 and t23' 031-00 
are not driven during read cycles and bus hold. 


6.2.4 PARITY 


Data Parity Input/Outputs (DPO-DP3) 


OPO-OP3 are the data parity pins for the processor. 
There is one pin for each byte of the data bus. Even 
parity is generated or checked by the parity genera- 
torsi checkers. Even parity means that there are an 
even number of HIGH inputs on the eight corre- 
sponding data bus pins and parity pin. 


Data parity is generated on all write data cycles with 
the same timing as the data driven by the Intel486 
OX microprocessor. Even parity information must be 
driven back to the Intel486 OX microprocessor on 
these pins with the same timing as read information 
to insure that the correct parity check status is indi- 
cated by the Intel486 OX microprocessor. 


The values read on these pins do not affect program 
execution. It is the responsibility of the system to 
take appropriate actions if a parity error occurs. 


Input signals on OPO-OP3 must meet setup and 
hold times t22 and t23 for proper operation. 


Parity status is driven on the PCHK# pin, and a pari- 
ty error is indicated by this pin being LOW. PCHK# 
is driven the clock after ready for read operations to 
indicate the parity status for the data sampled at the 


end of the previous clock. Parity is checked during 
code reads, memory reads and liD 
reads. Parity is 
not checked dUring interrupt acknowledge cycles. 
PCHK# only checks the parity status for enabled 
bytes as indicated by the byte enable and bus size 
signals. It is valid only in the clock immediately after 
read data is returned to the Intel486 OX microproc- 
essor. At all other times it is inactive (HIGH). 
PCHK# is never floated. 


Driving PCHK# is the only effect that bad input pari- 
ty has on the Intel486 OX microprocessor. The In- 
tel486 OX microprocessor will not vector to a bus 
error interrupt when bad data parity is returned. In 
systems that will not employ parity, PCHK# can be 
ignored. In systems not using parity, OPO-OP3 
should be connected to Vcc through a pullup resis- 
tor. 


6.2.5 BUS CYCLE DEFINITION 


MIlOI', 
DIC#, W/R# Outputs 


MilOI', 
O/C# 
and W/R# 
are the primary bus cycle 
definition signals. They are driven valid as the AOS# 
signal is asserted. MilOI' 
distinguishes between 
memory and liD 
cycles, O/C# 
distinguishes be- 


tween data and control cycles and W/R# 
distin- 
guishes between write and read cycles. 


Bus cycle definitions as a function of MilOI', 
O/C# 
and W/R# 
are given in Table 6.1. Note there is a 
difference between the Intel486 OX microprocessor 
and Intel386 microprocessor bus cycle definitions. 
The halt bus cycle type has been moved to location 
001 in the Intel486 OX microprocessor from location 
101 in the Intel386 microprocessor. Location 101 is 
now reserved and will never be generated by the 
Intel486 OX microprocessor. 


M/IO# 
D/C# 
W/R# 
Bus Cycle Initiated 


0 
0 
0 
InterruptAcknowledge 
0 
0 
1 
Halt/SpecialCycle 
0 
1 
0 
1/0 Read 
0 
1 
1 
1/0 Write 
1 
0 
0 
CodeRead 
1 
0 
1 
Reserved 
1 
1 
0 
MemoryRead 
1 
1 
1 
MemoryWrite 


LOCK# indicates that the Intel486 OX microproces- 
sor is running a read-modify-write cycle where the 
external bus must not be relinquished between the 


read and write cycles. Read-modify-write cycles are 
used to 
implement 
memory-based semaphores. 
Multiple reads or writes can be locked. 


When LOCKIF is asserted, the current bus cycle is 
locked and the Intel486 DX microprocessor should 
be allowed exclusive access to the system bus. 
LOCKIF 
goes active in the first clock of the first 
locked bus cycle and goes inactive after ready is 
returned indicating the last locked bus cycle. 


The Intel486 DX microprocessor will not acknowl- 
edge bus hold when LOCKIF is asserted (though it 
will allow an address hold). LOCKIF is active LOW 
and is floated during bus hold. Locked read cycles 
will not be transformed into cache fill cycles if KENIF 
is returned active. Refer to Section 7.2.6 for a de- 
tailed discussion of Locked bus cycles. 


The pseudo-lock feature allows atomic reads and 
writes of memory operands greater than 32 bits. 
These operands require more than one cycle to 
transfer. The Intel486 DX microprocessor asserts 
PLOCKIF during floating point long reads and writes 
(64 bits), segment table descriptor reads (64 bits) 
and cache line fills (128 bits}. 


When PLOCKIF is asserted no other master will be 
given control of the bus between cycles. A bus hold 
request (HOLD) is not acknowledged during pseudo- 
locked reads and writes, with one exception. During 
non-cacheable non-bursted code prefetches, HOLD 
is recognized on memory cycle boundaries even 
though PLOCKIF is asserted. The Intel486 DX mi- 
croprocessor will drive PLOCKIF active until the ad- 
dresses for the last bus cycle of the transaction 
have been driven regardless of whether BRDYIF or 
RDYIF are returned. 


A pseudo-locked transfer is meaningful only if the 
memory operand is aligned and if its completely con- 
tained within a single cache line. A 64-bit floating 
point number must be aligned to an 8-byte boundary 
to guarantee an atomic access. 


Normally PLOCKIF 
and BLASTIF 
are inverse of 
each other. However during the first cycle of a 64-bit 
floating point write, both PLOCKIF and BLASTIF will 
be asserted. 


Since PLOCKIF is a function of the bus size and 
KENIF inputs, PLOCKIF should be sampled only in 
the clock ready is returned. This piri is active LOW 
and is not driven during bus hold. Refer to Section 
7.2.7 for a detailed discussion of pseudo-locked bus 
cycles. 
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The bus control signals allow the processor to indi- 
cate when a bus cycle has begun, and allow other 
system hardware to control burst cycles, data bus 
width and bus cycle termination. 


The ADS IF output indicates that the address and 
bus cycle definition signals are valid. This signal will 
go active in the first clock of a bus cycle and go 
inactive in the second and subsequent clocks of the 
cycle. ADS IF is also inactive when the bus is idle. 


ADSIF is used by external bus circuitry as the indica- 
tion that the processor has started a bus cycle. The 
external circuit must sample the bus cycle definition 
pins on the next rising edge of the clock after ADS IF 
is driven active. 


ADS IF is active LOW and is not driven during bus 
hold. 


RDYIF indicates that the current bus cycle is com- 
plete. In response to a read, RDYIF indicates that 
the external system has presented valid data on the 
data pins. In response to a write request, RDYIF indi- 
cates that the external system has accepted the In- 
tel486 DX microprocessor data. RDYIF is ignored 
when the bus is idle and at the end of the first clock 
of the bus cycle. Since RDYIF is sampled during ad- 
dress hold, data can be returned to the processor 
when AHOLD is active. 


RDYIF is active LOW, and is not provided with an 
internal pullup resistor. This input must satisfy setup 
and hold times t16 and t17 for proper chip operation. 


6.2.7 
BURST 
CONTROL 


Burst Ready 
Input (BRDY IF) 


BRDYIF performs the same function during a burst 
cycle that RDYIF performs during a non-burst cycle. 
BRDYIF indicates that the external system has pre- 
sented valid data on the data pins in response to a 
read or that the external system has accepted the 
Intel486 DX microprocessor data in response to a 
write. BRDYIF is ignored when the bus is idle and at 
the end of the first clock in a bus cycle. 


During a burst cycle, BRDYIF will be sampled each 
clock, and if active, the data presented on the data 
bus pins will be strobed into the Intel486 DX micro- 
processor. ADSIF is negated during the second 
through last data cycles in the burst, but address 
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lines A2-A3 and byte enables will change to reflect 
the next data item expected by the Intel486 OX mi- 
croprocessor. 


If ROYII is returned simultaneously with BROYII, 
BROYII is ignored and the burst cycle is premature- 
ly aborted. An additional complete bus cycle will be 
initiated after an aborted burst cycle if the cache line 
fill was not complete. BROYII is treated as a normal 
ready for the last data cycle in a burst transfer or for 
non-burstable cycles. Refer to Section 7.2.2 for 
burst cycle timing. 


BROYII is active lOW and is provided with a small 
internal pullup resistor. BROYII must satisfy the set- 
up and hold times t16 and t17. 


Burst Last Output (BLAST II) 


BLASTII indicates that the next time BROYII is re- 
turned it will be treated as a normal ROYII, terminat- 
ing the line fill or other multiple-data-cycle transfer. 
BLASTII is active for all bus cycles regardless of 
whether they are cacheable or not. This pin is active 
lOW and is not driven during bus hold. 


6.2.8 INTERRUPT SIGNALS (RESET, INTR, 
NMI) 


The interrupt signals can interrupt or suspend exe- 
cution of the processor's current instruction stream. 


Reset Input (RESET) 


RESET forces the Intel486 OX2 microprocessor to 
begin execution at a known state. For a power-up 
(cold start) reset, Vcc and ClK must reach their 
proper 
DC and AC specifications 
for 
at least 
1 ms before the Intel486 OX2 microprocessor be- 
gins instruction execution. The RESET pin should 
remain active during this ti.me to ensure proper In- 
tel486 OX2 microprocessor operation. However, for 
a warm boot-up 
case, RESET Is required to re- 
main active for a minimum of 15 clocks. The test- 
ability operating modes are programmed by the fail- 
ing (inactive going) edge of RESET. (Refer to Sec- 
tion 8.0 for a description of the test modes during 
reset.) 


Maskable Interrupt 
Request Input (INTR) 


INTR indicates that an external interrupt has been 
generated. Interrupt processing is initiated if the IF 
flag is active in the EFLAGS register. 


The Intel486 OX microprocessor will generate two 
locked 
interrupt acknowledge bus cycles in re- 
sponse to asserting the INTR pin. An 8-bit interrupt 
number will be latched from an external interrupt 
controller at the end of the second interrupt ac- 
knowledge cycle. INTR must remain active until the 
interrupt acknowledges have been performed to as- 


sure program interruption. Refer to Section 7.2.10 
for a detailed discussion of interrupt acknowledge 
cycles. 


The INTR pin is active HIGH and is not provided with 
an internal pulldown resistor. INTR is asynchronous, 
but the INTR setup and hold times, t20 and t21, must 
be met to assure recognition on any specific clock. 


NMI is the non-maskable interrupt request signal. 
Asserting NMI causes an interrupt with an internally 
supplied vector value of 2. External interrupt ac- 
knowledge cycles are not generated since the NMI 
interrupt vector is internally generated. When NMI 
processing begins, the NMI signal will be masked 
internally until the IRET instruction is executed. 


NMI is rising edge sensitive after internal synchroni- 
zation. NMI must be held lOW for at least four elK 
periods before this rising edge for proper operation. 
NMI is not provided with an internal pulldown resis- 
tor. NMI is asynchronous but setup and hold times, 
t20and t21must be met to assure recognition on any 
specific clock. 


This section describes the mechanism by which the 
processor relinquishes control of its local bus when 
requested by another bus master. 


Bus Request Output (BREQ) 


The Intel486 OX microprocessor asserts BREQ 
whenever a bus cycle is pending internally. Thus, 
BREQ is always asserted in the first clock of a bus 
cycle, along with ADSII. Furthermore, if the Intel486 
OX microprocessor is currently not driving the bus 
(due to HOLD, AHOLD, or BOFFII), BREQ is assert- 
ed in the same clock that ADSII would have been 
asserted if the processor were driving the bus. After 
the first clock of the bus cycle, BREQ may change 
state. It will be asserted if additional cycles are nec- 
essary to complete a transfer (via BS8II, BS16II , 
KENII), 
or if more cycles are pending internally. 
However, if no additional cycles are necessary to 
complete the current transfer, BREQ can be negat- 
ed before ready comes back for the current cycle. 
External logic can use the BREQ signal to arbitrate 
among multiple processors. This pin is driven re- 
gardless of the state of bus hold or address hold. 
BREQ is active HIGH and is never floated. During a 
hold state, internal events may cause BREQ to be 
deasserted prior to any bus cycles. 


HOLD allows another bus master complete control 
of the Intel486 OX microprocessor bus. The Intel486 


DX microprocessor will respond to an active HOLD 
signal by asserting HLDA and placing most of its 
output and input/output pins in a high impedance 
state (floated) after completing its current bus cycle, 
burst cycle, or sequence of locked cycles. In addi- 
tion, if the Intel486 DX CPU receives a HOLD re- 
quest while performing a code fetch, and that cycle 
is backed off (BOFF#), the Intel486 DX CPU will 
recognize HOLD before restarting the cycle. The 
BREQ, HLDA, PCHK# and FERR# 
pins are not 
floated during bus hold. The Intel486 DX microproc- 
essor will maintain its bus in this state until the 
HOLD is deasserted. Refer to Section 7.2.9 for tim- 
ing diagrams for bus hold cycles and HOLD request 
acknowledge during BOFF#. 


Unlike the Intel386 microprocessor, the Intel486 
OX 
microprocessor 
will recognize 
HOLD 
during 
re- 
set. Pullup resistors are not provided for the outputs 
that are floated in response to HOLD. HOLD is ac- 
tive HIGH and is not provided with an internal pull- 
down resistor. HOLD must satisfy setup and hold 
times t18 and t19 for proper chip operation. 


HLDA indicates that the Intel486 DX microprocessor 
has given the bus to another local bus master. HLDA 
goes active in response to a hold request presented 
on the HOLD pin. HLDA is driven active in the same 
bus clock cycle that the Intel486 DX microprocessor 
floats its bus. 


HLDA will be driven inactive when leaving bus hold 
and the Intel486 DX microprocessor will resume 
driving the bus. The Intel486 DX microprocessor will 
not cease internal activity during bus hold since the 
internal cache will satisfy the majority of bus re- 
quests. HLDA is active HIGH and remains driven 
during bus hold. 


Backoff 
Input (BOFF #) 


Asserting the BOFF# input forces the Intel486 DX 
microprocessor to release control of its bus in the 
next clock. The pins floated are exactly the same as 
in response to HOLD. The response to BOFF# dif- 
fers from the response to HOLD in two ways: First, 
the 
bus is floated 
immediately in response to 
BOFF# while the Intel486 DX microprocessor com- 
pletes the current bus cycle before floating its bus in 
response to HOLD. Second the Intel486 DX does 
not assert HLDA in response to BOFF#. 


The processor remains in bus hold until BOFF# is 
negated. Upon negation, the Intel486 DX microproc- 
essor restarts the bus cycle aborted when BOFF# 
was asserted. To the internal execution engine the 
effect of BOFF# is the same as inserting a few wait 
states to the original cycle. Refer to Section 7.2.12 
for a description of bus cycle restart. 
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Any data returned to the processor while BOFF# is 
asserted is ignored. BOFF# has higher priority than 
RDY# or BRDY#. If both BOFF# and ready are 
returned in the same clock, BOFF# takes effect. If 
BOFF# is asserted while the bus is idle, the Intel486 
DX microprocessor will float its bus in the next bus 
clock. BOFF# is active LOW and must meet setup 
and hold times t18 and t19for proper chip operation. 


6.2.10 
CACHE 
INVALIDATION 


The AHOLD and EADS# 
inputs are used during 
cache invalidation cycles. AHOLD conditions the In- 
tel486 DX microprocessors address lines, A4-A31, 
to accept an address input. EADS# indicates that 
an external address is actually valid on the address 
inputs. Activating EADS# will cause the Intel486 DX 
microprocessor to read the external address bus 
and perform an internal cache invalidation cycle to 
the address indicated. Refer to Section 7.2.8 for 
cache invalidation cycle timing. 


Address 
Hold Request 
Input (AHOLD) 


AHOLD is the address hold request. It allows anoth- 
er bus master access to the Intel486 DX microproc- 
essor address bus for performing an internal cache 
invalidation cycle. Asserting AHOLD will force the 
Intel486 DX microprocessor to stop driVing its ad- 
dress bus in the next clock. While AHOLD is active 
only the address bus will be floated, the remainder 
of the bus can remain active. For example, data can 
be returned for a previously specified bus cycle 
when AHOLD is active. The Intel486 DX microproc- 
essor will not initiate another bus cycle during ad- 
dress hold. Since the Intel486 DX microprocessor 
floats its bus immediately in response to AHOLD, an 
address hold acknowledge is not required. If AHOLD 
is asserted while a bus cycle is in progress, and no 
readies are returned during the time AHOLD is as- 
serted, the Intel486 DX will redrive the same ad- 
dress (that it originally sent out) once AHOLD is neg- 
ated. 


AHOLD is recognized during reset. Since the entire 
cache is invalidated by reset, any invalidation cycles 
run during reset will be unnecessary. AHOLD is ac- 
tive HIGH and is provided with a small internal pull- 
down resistor. It must satisfy the setup and hold 
times t18 and t19 for proper chip operation. This pin 
determines whether or not the built in self test fea- 
tures of the Intel486 DX microprocessor will be exer- 
cised on assertion of RESET. 


EADS# indicates that a valid external address has 
been driven onto the Intel486 DX address pins. This 
address will be used to perform an internal cache 
invalidation cycle. The external address will be 
checked with the current cache contents. If the ad- 
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dress specified matches any areas in the cache, that 
area will immediately be invalidated. 


An invalidation cycle may be run by asserting 
EADS# regardless of the state of AHOLD, HOLD 
and BOFF#. EADS# is active LOW and is provided 
with an internal pullup resistor. EADS# must satisfy 
the setup and hold times t12 and t13 for proper chip 
operation. 


6.2.11 CACHE CONTROL 


Cache Enable Input (KEN# ) 


KEN# is the cache enable pin. KEN# is used to 
determine whether the data being returned by the 
current cycle is cacheable. When KEN# is active 
and the Intel486 OX microprocessor generates a cy- 
cle that can be cached (most any memory read cy- 
cle), the cycle will be transformed into a cache line 
fill cycle. 


A cache line is 16 bytes long. Duringthe first cycle of 
a cache line fill the byte-enable pins should be ig- 
nored and data should be returned as if all four byte 
enables were asserted. The Intel486 OX microproc- 
essor will run between 4 and 16 contiguous bus cy- 
cles to fill the line depending on the bus data width 
selected by SS8# and BS16#. Refer to Section 
7.2.3 for a description of cache line fill cycles. 


The KEN# input is active LOW and is provided with 
a small internal pullup resistor. It must satisfy the 
setup and hold times t14 and t15 for proper chip op- 
eration. 


Cache Flush Input (FLUSH#) 


The FLUSH# input forces the Intel486 OX micro- 
processor to flush its entire internal cache. FLUSH# 
is active LOW and need only be asserted for one 
clock. FLUSH# is asynchronous but setup and hold 
times t20 and t21 must be met for recognition on any 
specific clock. 


FLUSH# also determines whether or not the 3-state 
test mode of the Intel486 OX microprocessor will be 
invoked on assertion of RESET. 


6.2.12 PAGE CACHEABILITY (PWT, PCO) 


The PWT and PCD output signals correspond to two 
user attribute bits in the page table entry. When pag- 
ing is enabled, PWT and PCD correspond to bits 3 
and 4 of the page table entry respectively. For cy- 
cles that are not paged when paging is enabled (for 
example I/O cycles) PWT and PCD correspond to 
bits 3 and 4 in control register 3. When paging is 
disabled, the Intel486 OX CPU ignores the PCD and 
PWT bits and assumes they are zero for the purpose 
of caching and driving PCD and PWT. 


PCD is masked by the CD (cache disable) bit in con- 
trol register 0 (CRO).When CD= 1 (cache line fills 
disabled) the Intel486 OX microprocessor forces 
PCD HIGH. When CD=O, PCD is driven with the 
value of the page table entry/directory. 


The purpose of PCD is to provide a cacheable/non- 
cacheable indication on a page by page basis. The 
Intel486 OXwill not perform a cache fill to any page 
in which bit 4 of the page table entry is set. PWT 
corresponds to the write-back bit and can be used 
by an external cache to provide this functionality. 
PCD and PWT bits are assigned to be zero during 
real mode or whenever paging is disabled. Refer to 
Sections 4.5.4 and 5.6 for a discussion of non- 
cacheable pages. 


PCD and PWT have the same timing as the cycle 
definition pins (M/IO#, 
D/C#, 
W/R#). 
PCD and 
PWT are active HIGH and are not driven during bus 
hold. 


6.2.13 NUMERIC ERROR REPORTING 
(FERR#,IGNNH) 


To allow PC-type floating point error reporting, the 
Intel486 
OX microprocessor 
provides 
two 
pins, 


FERR# and IGNNE#. 


The Intel486 OX microprocessor asserts FERR# 
whenever an unmasked floating point error is en- 
countered. FERR# is similar to the ERROR# pin on 
the Intel387 math coprocessor. FERR# 
can be 
used by external logic for PC-typefloating point error 
reporting in Intel486 OX microprocessor systems. 
FERR# is active LOW, and is not floated during bus 
hold. 


In some cases, FERR# is asserted when the next 
floating point instruction is encountered and in other 
cases it is asserted before the next floating point 
instruction is encountered depending upon the exe- 
cution state of the instruction causing the exception. 


The following class of floating point exceptions drive 
FERR# at the time the exception occurs (i.e., before 
encountering the next floating point instruction). 
1. The stack fault, invalid operation, and denormal 
exceptions on all transcendental instructions, in- 
teger arithmetic instructions, FSQRT, FSCALE, 
FPREM(1), FXTRACT, FBLD, and FBSTP. 
2. Any exceptions on store instructions (including 
integer store instructions). 


The following class of floating point exceptions drive 
FERR# only after encountering the next floating 
point instruction. 


1. Exceptions other than on all transcendental in- 
structions, 
integer 
arithmetic 
instructions, 


FSQRT, FSCALE, FPREM(1), FXTRACT, F8LO, 
and F8STP. 
2. Any exception on all basic arithmetic, load, com- 
pare, and control instructions (Le., all other in- 
structions). 


Ignore Numeric Error Input (IGNNE#) 


The Intel486 OXmicroprocessor will ignore a numer- 
ic error and continue executing non-control floating 
point instructions when IGNNE# 
is asserted, but 
FERR# will still be activated. When deasserted, the 
Intel486 OX microprocessor will freeze on a non- 
control floating point instruction if a previous instruc- 
tion caused an error. IGNNE# has no effect when 
the NE bit in control register 0 is set. 


The IGNNE# input is active LOW and is provided 
with a small internal pullup resistor. This input is 
asynchronous, but must meet setup and hold times 
t20 and t21 to insure recognition on any specific 
clock. 


6.2.14 BUS SIZE CONTROL (BS16#, B58#) 


The 8S16# and BS8# inputs allow external 16- and 
8-bit busses to be supported with a small number of 
external components. The Intel486 OX CPU sam- 
ples these pins every clock. The value sampled in 
the clock before ready determines the bus size. 
When asserting 8S16# or 8S8# only 16 or 8 bits of 
the data bus need be valid. If both 8S16# 
and 
8S8# are asserted, an 8-bit bus width is selected. 


When 8S16# or 8S8# are asserted the Intel486 OX 
microprocessor will convert a larger data request to 
the appropriate number of smaller transfers. The 
byte enables will also be modified appropriately for 
the bus size selected. 


8S16# and 8S8# are active LOW and are provided 
with small internal pullup resistors. BS16# 
and 
8S8# must satisfy the setup and hold times t14 and 
t15 for proper chip operation. 


6.2.15 ADDRESS BIT 20 MASK (A20M #) 


Asserting the A20M# input causes the Intel486 OX 
microprocessor to mask physical address bit 20 be- 
fore performing a lookup in the internal cache and 
before driving a memory cycle to the outside world. 
When A20M # is asserted, the Intel486 OX micro- 
processor emulates the 1 Mbyte address wrap- 
around that occurs on the 8086. A20M# is active 
LOW and must be asserted only when the processor 
is in real mode. The A20M# is not defined in Pro- 
tected Mode. A20M# is asynchronous but should 
meet setup and hold times t20 and t21 for recogni- 
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tion in any specific clock. For correct operation of 
the chip, A20M# should be sampled high 2 clocks 
before and 2 clocks after RESET goes low. 


Test Clock (TCK) 


TCK is an input to the Intel486 OX2 CPU and pro- 
vides the clocking function required by the JTAG 
boundary scan feature. TCK is used to clock state 
information and data into and out of the component. 
State select information and data are clocked into 
the component on the rising edge of TCK on TMS 
and TOI, respectively. Data is clocked out of the part 
on the falling edge of TCK on TOO. 


In addition to using TCK as a free running clock, it 
may be stopped in a low, 0, state, indefinitely as 
described in IEEE 1149.1. While TCK is stopped in 
the low state, the boundary scan latches retain their 
state. 


When boundary scan is not used, TCK should be 
tied high or left as a NC (This is important during 
power up to avoid the possibility of glitches on the 
TCK which could prematurely initiate boundary scan 
operations). TCK is supplied with an internal pullup 
resistor. 


TCK is a clock signal and is used as a reference for 
sampling other JTAG signals. On the rising edge of 
TCK, TMS and TOI are sampled. On the falling edge 
of TCK, TOO is driven. 


Test Mode Select (TMS) 


TMS is decoded by the JTAG TAP (Tap Access 
Port) to select the operation of the test logic, as de- 
scribed in Section 8.5.4. 


To guarantee deterministic behavior of the TAP con- 
troller, TMS is provided with an internal pull-up resis- 
tor. If boundary scan is not used, TMS may be tied 
high or left unconnected. TMS is sampled on the 
rising edge of TCK. TMS is used to select the inter- 
nal TAP states required to load boundary scan in- 
structions to data on TO!. For proper initialization of 
the JTAG logic, TMS should be driven high, "1", for 
at least four TCK cycles following the rising edge of 
RESET. 


Test Data Input (TOI) 


TOI is the serial input used to shift JTAG instructions 
and data into the component. The shifting of instruc- 
tions and data occurs during the SHIFT-IR and 
SHIFT-DR controller 
states, 
respectively. These 
states are selected using the TMS signal as de- 
scribed in Section 8.5.4. 


An internal pull-up resistor is provided on TOI to en- 
sure a known logic state if an open circuit occurs on 
the TOI path. Note that when "1" 
is continuously 
shifted into the instruction register, the BYPASS 
instruction 
is selected. TOI is sampled on the 
rising edge of TCK, during the SHIFT-IR and the 
SHIFT-DR states. During all other TAP controller 
states, TOI is a "don't care". 


Test Data Output 
(TOO) 


TOO is the serial output used to shift JTAG instruc- 
tions and data out of the component. The shifting of 
instructions and data occurs during the SHIFT-IR 
and SHIFT-DR TAP controller states, respectively. 
These states are selected using the TMS signal as 
described in Section 8.5.4. When not in SHIFT-IR or 
SHIFT-DR state, TOO is driven to a high impedance 
state to allow connecting TOO of different devices in 
parallel. 


TOO is driven on the falling edge of TCK during the 
SHIFT-IR and SHIFT-DR TAP controller states. At 
all other times TOO is driven to the high impedance 
state. 


The Intel486 OX2 microprocessor contains four 
write buffers to enhance the performance of consec- 
utive writes to memory. The buffers can be filled at a 
rate of one write per clock until all four buffers are 
filled. 


When all four buffers are empty and the bus is idle, a 
write request will propagate directly to the external 
bus bypassing the write buffers. If the bus is not 
available at the time the write is generated internally, 
the write will be placed in the write buffers and prop- 
agate to the bus as soon as the bus becomes avail- 
able. The write is stored in the on-chip cache imme- 
diately if the write is a cache hit. 


Writes will be driven onto the external bus in the 
same order in which they are received by the write 
buffers. Under certain conditions a memory read will 
go onto th'e external bus before the memory writes 
pending in the buffer even though the writes oc- 
curred earlier in the program execution. 


A memory read will only be reordered in front of all 
writes in the buffers under the following conditions: If 
all writes pending in the buffers are cache hits and 
the read is a cache miss. Under these conditions the 
Intel486 OX microprocessor will not read from an 
external memory location that needs to be updated 
by one of the pending writes. 


Reordering of a read with the writes pending in the 
buffers can only occur once before all the buffers 
are emptied. Reordering read once only maintains 
cache consistency. Consider the following example: 
The CPU writes to location X. Location X is in the 
internal cache, so it is updated there immediately. 
However, the bus is busy so the write out to main 
memory is buffered (see Figure 6.3(a». At this point, 
any reads to location X would be cache hits and 
most up-to-date data would be read. 


Main Memory 


~~atax 
Y 
data y 
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The next instruction causes a read to location Y. 
Location Y is not in the cache (a cache miss). Since 
the write in the write buffer is a cache hit, the read is 
reordered. When location Y is read, it is put into the 
cache. The possibility exists that location Y will re- 
place location X in the cache. If this is true, location 
X would no longer be cached (see Figure 6.3(b». 


Main MemoryG 


Cache consistency has been maintained up to this 
point. If a subsequent read is to location X (now a 
cache miss) and it was reordered in front of the buff- 
ered write to location X, stale data would be read. 
This is why only 1 read is allowed to be reordered. 
Once a read is reordered, all the writes in the write 
buffer are flagged as cache misses to ensure that no 
more reads are reordered. Since one of the condi- 
tions to reorder a read is that all writes in the write 
buffer must be cache hits, no more reordering is al- 
lowed until all of those flagged writes propogate to 
the bus. Similarly, if an invalidation cycle is run all 
entries in the write buffer are flagged as cache 
misses. 


For multiple processor systems and/or systems us- 
ing OMA techniques, such as bus snooping, locked 
semaphores should be used to maintain cache con- 
sistency. 
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6.3.1 
WRITE 
BUFFERS 
AND 1/0 CYCLES 


Input/Output (1/0) cycles must be handled in a dif- 
ferent manner by the write buffers. 


110 reads are never reordered in front of buffered 
memory writes. This insures that the Intel486 DX mi- 
croprocessor will update all memory locations be- 
fore reading status from an 110 device. 


The Intel486 DX microprocessor never buffers sin- 
gle 1/0 writes. When processing an OUT instruction, 
internal execution stops until the 1/0 write actually 
completes on the external bus. This allows time for 
the external system to drive an invalidate into the 
Intel486 DX microprocessor or to mask interrupts 
before the processor progresses to the instruction 
following OUT. REP OUTS instructions will be buff- 
ered. 


1/0 device recovery time must be handled slightly 
differently by the Intel486 DX microprocessor than 
with the Intel386 microprocessor. 1/0 device back- 
to-back write recovery times could be guaranteed by 
the Intel386 microprocessor by inserting a jump to 
the next instruction in the code that writes to the 
device. The jump forces the Intel386 microprocessor 
to generate a prefetch bus cycle which can't begin 
until the 110 write completes. 


Inserting a jump to the next write will not work with 
the Intel486 DX microprocessor because the pre- 
fetch could be satisfied by the on-chip cache. A read 
cycle must be explicitly generated to a non-cache- 
able location in memory to guarantee that a read 
bus cycle is performed. This read will not be allowed 
to proceed to the bus until after the 1/0 write has 
completed because 1/0 writes are not buffered. The 
1/0 device will have time to recover to accept anoth- 
er write during the read cycle. 


6.3.2 
WRITE 
BUFFERS 
IMPLICATIONS 
ON 
LOCKED 
BUS CYCLES 


Locked bus cycles are used for read-modify-write 
accesses to memory. During a read-modify-write ac- 
cess, a memory base variable is read, modified and 
then written back to the same memory location. It is 
important that no other bus cycles, generated by 
other bus masters or by the Intel486 DX microproc- 
essor itself, be allowed on the external bus between 
the read and write portion of the locked sequence. 


During a locked read cycle the Intel486 DX micro- 
processor will always access external memory, it will 
never look for the location in the on-Chipcache, but 
for write cycles, data is written in the internal cache 
(if cache hit) and in the external memory. All data 
pending in the Intel486 DX microprocessor's write 
buffers will be written to memory before a locked 
cycle is allowed to proceed to the external bus. 


The Intel486 DX microprocessor will assert the 
LOCK# pin after the write buffers are emptied dur- 
ing a locked bus cycle. With the LOCK# pin assert- 
ed, the microprocessor will read the data, operate 
on the data and place the results in a write buffer. 
The contents of the write buffer will then be written 
to external memory. LOCK# will become inactive 
after the write part of the locked cycle. 


6.4 
Interrupt and Non-Maskable 
Interrupt Interface 


The Intel486 DX microprocessor provides two asyn- 
chronous interrupt inputs, INTR (interrupt request) 
and NMI (non-maskable interrupt input). This section 
describes the hardware interface between the in- 
struction execution unit and the pins. For a descrip- 
tion of the algorithmic response to interrupts refer to 
Section 2.7. For interrupt timings refer to Section 
7.2.10. 


The Intel486 DX microprocessor contains a two- 
clock synchronizer on the interrupt line. An interrupt 
request will reach the internal instruction execution 
unit two clocks after the INTR pin is asserted, if 
proper setup is provided to the first stage of the syn- 
chronizer. 


There is no special logic in the interrupt path other 
than the synchronizer. The INTR signal is level sen- 
sitive and must remain active for the instruction exe- 
cution unit to recognize it. The interrupt will not be 
serviced by the Intel486 DX microprocessor if the 
INTR signal does not remain active. 


The instruction execution unit will look at the state of 
the synchronized interrupt signal at specific clocks 
during the execution of instructions (if interrupts are 
enabled). These specific clocks are at instruction 
boundaries, or iteration boundaries in the case of 
string move instructions. Interrupts will only be ac- 
cepted at these boundaries. 


An interrupt must be presented to the Intel486 DX 
microprocessor INTR pin three clocks before the 
end of an instruction for the interrupt to be acknowl- 
edged. Presenting the interrupt 3 clocks before the 
end of an instruction allows the interrupt to pass 
through the two clock synchronizer leaving one 
clock to prevent the initiation of the next sequential 
instruction and to begin interrupt service. If the inter- 
rupt is not received in time to prevent the next in- 
struction, it will be accepted at the end of next in- 
struction, assuming INTR is still held active. The in- 
terrupt service microcode will start after two dead 
clocks. 


The longest 
latency 
between 
when 
an interrupt 
re- 
quest 
is presented 
on the 
INTR pin and when 
the 
interrupt 
service 
begins 
is: longest 
instruction 
used 
+ the two clocks 
for synchronization 
+ one clock 
required 
to vector 
into the 
interrupt 
service 
micro- 
code. 


The NMI pin has a synchronizer 
like that used on the 
INTR line. Other than the synchronizer, 
the NMllog- 
ic is different 
from that of the maskable 
interrupt. 


NMI is edge triggered 
as opposed 
to the level trig- 
gered INTR signal. The rising edge of the NMI signal 
is used to generate 
the interrupt 
request. 
The NMI 
input need not remain active until the interrupt 
is ac- 
tually 
serviced. 
The 
NMI pin only 
needs 
to remain 
active 
for a single 
clock 
if the 
required 
setup 
and 
hold times are met. NMI will operate 
properly 
if it is 
held active for an arbitrary 
number 
of clocks. 


The NMI input must be held inactive 
for at least four 
clocks 
after it is asserted 
to reset the edge triggered 
logic. A subsequent 
NMI may not be generated 
if the 
NMI is not held inactive 
for at least two clocks 
after 
being asserted. 


The 
NMI 
input 
is internally 
masked 
whenever 
the 
NMI 
routine 
is entered. 
The 
NMI input 
will remain 
masked 
until an IRET (return from interrupt) 
instruc- 
tion 
is executed. 
Masking 
the 
NMI signal 
prevents 
recursive 
NMI calls. If another 
NMI occurs 
while the 
NMI is masked off, the pending 
NMI will be executed 
after the current 
NMI is done. Only one NMI can be 
pending 
while NMI is masked. 


The Intel486 
OX2 microprocessor 
has a built in self 
test (BIST) that can be run during reset. The BIST is 
invoked 
if the AHOLO 
pin is asserted 
for 2 clocks 
before and 2 clocks 
after RESET is deasserted. 
RE- 
SET 
must 
be active 
for 
15 clocks 
with 
or without 
BIST enabled. 
Refer 
to Section 
8.0 for information 
on Intel486 
OX2 microprocessor 
testability. 
To en- 
sure proper 
results, 
FLUSH # must not be asserted 
while 
BIST is executing. 


The Intel486 
OX microprocessor 
registers 
have the 
values 
shown 
in Table 
1.5 
after 
RESET 
is 
per- 
formed. 
The 
EAX register 
contains 
information 
on 
the success 
or failure 
of the BIST if the self test is 
executed. 
The OX register always contains 
a compo- 
nent identifier 
at the conclusion 
of RESET. 
The up- 
per byte of OX (OH) will contain 
04 and the lower 
byte (OL) will contain 
a stepping 
identifier 
(see Table 
1.5). The floating 
point 
registers 
are initialized 
as if 
the 
FINIT /FNINIT 
(initialize 
processor) 
instruction 


was executed 
if the BIST was performed. 
If the BIST 
is not executed, 
the floating 
point registers 
are un- 
changed. 
Table 
6.2. Register 
Values 
after 
Reset 


Register 
InitialValue 
InitialValue 
(BIST) 
(No Blst) 


EAX 
Zero (Pass) 
Undefined 


ECX 
Undefined 
Undefined 


EOX 
0400 + Revision10 
0400 + Revision10 
EBX 
Undefined 
Undefined 


ESP 
Undefined 
Undefined 
EBP 
Undefined 
Undefined 


ESI 
Undefined 
Undefined 


EOI 
Undefined 
Undefined 


EFLAGS 
0OOOOOO2h 
00000002h 


EIP 
OFFFOh 
OFFFOh 
ES 
OOOOh 
OOOOh 
CS 
FOOOho 
FOOOho 


SS 
OOOOh 
OooOh 


OS 
OOooh 
OOOOh 
FS 
OOOOh 
ooOOh 
GS 
ooOOh 
OOOOh 
10TR 
Base = 0,Limit=3FFh 
Base = 0,Limit=3FFh 
CRO 
60000010h 
6000oo10h 
OR7 
ooooooOOh 
OOOOOOOOh 


CW 
037Fh 
Unchanged 


SW 
OOOOh 
Unchanged 
TW 
FFFFh 
Unchanged 
FIP 
OOOOOOOOh 
Unchanged 
FEA 
OOOOOOOOh 
Unchanged 
FCS 
OOOOh 
Unchanged 
FOS 
OOOOh 
Unchanged 
FOP 
OOOh 
Unchanged 
FSTACK 
Undefined 
Unchanged 


Intel486 
OX2 CPU 
Component 
Revision 
Stepping 
Name 
10 
10 


A 
04h 
32h 


B 
04h 
33h 


The Intel486 
OX microprocessor 
will start executing 
instructions 
at location 
FFFFFFFOH 
after 
RESET. 


When the first InterSegment 
Jump or Call is execut- 
ed, address lines A20-A31 
will drop LOW for CS-rel- 


ative 
memory 
cycles, 
and 
the 
Intel486 
OX micro- 


processor 
will only execute 
instructions 
in the lower 
one Mbyte of physical 
memory. 
This allows the sys- 


tem designer 
to use a ROM at the top of physical 
memory 
to 
initialize 
the 
system 
and 
take 
care 
of 
RESETs. 


RESET forces the Intel486 OX microprocessor to 
terminate all execution and local bus activity. No in- 
struction or bus activity will occur as long as RESET 
is active. 


The Intel486 DX2 microprocessor recognizes and 
can respond to HOLD, AHOLD, and BOFFII 
re- 
quests regardless of the state of RESET.Thus, even 
though the processor is in reset, it can still float its 
bus in response to any of these requests. 


While in reset, the Intel486 OX microprocessor bus 
is in the state shown in Figure 6.4 if the HOLD, 
AHOLD and BOFFII requests are inactive. Note that 
the address (A31-A2, BE311-BEOII) and cycle defi- 
nition (M/IOII, 
D/CII, 
W/RII) 
pins are undefined 
from the time reset is asserted up to the start of the 
first bus cycle. All undefined pins (except FERRII) 
assume known values at the beginning of the first 
bus cycle. The first bus cycle is always a code fetch 
to address FFFFFFFOH. FERRII reflects the state 
of the ES (error summary status) bit in the floating 
point unit status word. The ES bit is initialized when- 
ever the floating point unit state is initialized. The 
floating point unit's status word register can be ini- 
tialized by BIST or by executing FINITIFNI NIT in- 
struction. Thus, after reset and before executing the 
first FINIT or FNINIT instruction, the values of the 
FERRII and the numeric status word register bits 
0-7 depends on whether or not BIST is performed. 
Table 6-4 shows the state of FERRII signal after 
reset and before the execution of the FINITIFNINIT 
instruction. 


BIST 
FERRII 
FPU Status 


Performed 
Pin 
Word Register 
Bits 0-7 


YES 
Inactive 
Inactive 
(High) 
(Low) 


NO 
Undefined 
Undefined 
(Low or High) 
(Low or High) 
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After the first FINIT or FNINIT instruction, FERRII 
pin and the FPU status word register bits (0-7) will 
be inactive irrespective of the Built-In Self-Test 
(BIST). 


Power 
Down Mode (Upgrade 
Processor 
Support) 


The Power Down Mode on the Intel486 DX2 micro- 
processor, when initiated by the upgrade processor, 
reduces the power consumption of the Intel486 DX2 
CPU (see Table 14-2 D.C. Specifications), as well as 
forces all of its output signals to be 3-stated. The 
UPII 
pin on the Intel486 DX2 microprocessor is 
used for enabling the Power Down Mode. 


Once the UPII pin is driven active by the upgrade 
processor upon power-up, the Intel486 DX2 micro- 
processor's bus is floated immediately. The Intel486 
DX2 CPU enters the Power Down Mode when the 
UPII pin is sampled asserted in the clock before the 
falling edge of RESET. The UPII pin has no effect 
on the power down status, except during this edge. 
The Intel486 DX2 CPU then remains in the Power 
Down Mode until the next time the RESET signal is 
activated. For warm resets, with the upgrade proces- 
sor in the system, the Intel486 DX2 CPU will remain 
3-stated and re-enter the Power Down Mode once 
RESET is de-asserted. Similarly for power-up resets, 
if the upgrade processor is not taken out of the sys- 
tem, the Intel486 DX2 CPU will 3-state its outputs 
upon sensing the UPII pin active and enter the Pow- 
er Down Mode after the falling edge of RESET. 


- 
~ 
ClK 
~ 
liiiiJ 
IF' 
RESET 
<D 
<:::I 
~ 
'>. 
<:::I 
AHOlD 
~ 
FlUSH* 
~ 
<-,1K) 
~ 
FlUSH* 
~ 
(.')'M) 


A20W* 


(.yne) 


A20W' 
(.'me) 


"Tlis 
ADS* 
eIi 
CJl 
BREO 
:. 


"0 
A31-Ao4' 
WIO*, 
BLAST 
SO 
tn 
BEO-BE3*, 
PWT. PCD 
.. 
A3• "'2' PLOCK. 
II.. 
D/C*, 
W/R* 
CD 
ClI 
PCHK*, 
Q. 


LOCK- 
e~ 
~,-Do· 
::I 
co 
DPO-3 
::D 
@ 
m 
HlDA 
tnm~ 


_________ 
/_77_/ 


--------->->->-->}---------------------------------------------------_. 


NOTES: 
1. RESET is an asynchronous 
input. t20 must be met only to guarantee 
recognition 
on a specific 
clock 
edge. 


2a. When A20M # is driven synchronously, 
it must be driven high (inactive) 
for the ClK 
edge prior to the falling edge of RESET to ensure 
proper 
operation. 
A20M # 
setup and hold times 
must be met. 


2b. When A20M# 
is driven asynchronously, 
it must be driven 
high (inactive) 
for two ClKs 
prior to and two ClKs 
after the falling 
edge of RESET 
to ensure 
proper 
operation. 
3a. When 
FlUSH# 
is driven 
synchronously, 
it should 
be driven 
low (active) 
for the ClK 
edge prior to the falling 
edge of RESET 
to invoke 
the 3-state 
Output 
Test 
Mode. All outputs 
are guaranteed 
3-stated 
within 
10 ClKs 
of RESET being deasserted. 
FlUSH# 
setup and hold times 
must be met. 


3b. When FlUSH# 
is driven asynchronously, 
it must be driven low (active) for two ClKs 
prior to and two ClKs 
after the falling edge of RESET to invoke the 3-state 
Output Test Mode. All outputs 
are guaranteed 
3-stated 
within 
10 ClKs 
of RESET being deasserted. 
4. AHOlD 
should 
be driven 
high (active) 
for the ClK 
edge prior to the falling 
edge of RESET to invoke 
the Suilt-In-Self-Test 
(SIST). AHOlD 
setup and hold times 


must be met. 
5. Hold is recognized 
normally 
during RESET. 
On power-up 
HlDA 
is indeterminate 
until RESET 
is recognized 
by the CPU. 
6. 15 ClKs 
RESET 
pulse width for warm resets. 
Power-up 
resets 
require 
RESET to be asserted 
for at least 1 ms after Vcc 
and ClK 
are stable. 
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All data transfers occur as a result of one or more 
bus cycles. logical data operands of byte, word and 
dword lengths may be transferred without restric- 
tions on physical address alignment. Oata may be 
accessed at any byte boundary but two or three cy- 
cles may be required for unaligned data transfers. 
See Section 7.1.3 Oynamic Bus Sizing and 7.1.6 Op- 
erand Alignment. 


The Intel486 OXmicroprocessor address signals are 
split into two components. High-order address bits 
are provided by the address lines, A2-A31. The byte 
enables, BEO#-BE3#, 
form the low-order address 
and provide linear selects for the four bytes of the 
32-bit address bus. 


The byte enable outputs are asserted when their as- 
sociated data bus bytes are involved with the pres- 
ent bus cycle, as listed in Table 7.1. Byte enable 
patterns which have a negated byte enable separat- 
ing two or three asserted byte enables will never 
occur (see Table 7.5). All other byte enable patterns 
are possible. 


Table 7.1. Byte Enables and Associated 
Data and Operand Bytes 


Byte 
Enable 
Associated Data Bus Signals 
Signal 


BEO# 
00-07 
(byte O-Ieast significant) 


BE1# 
08-015 
(byte 1) 


BE2# 
016-023 
(byte 2) 


BE3# 
024-031 
(byte 3-most 
significant) 


Address bits AO and A1 of the physical operand's 
base address can be created when necessary. Use 
of the byte enables to create AOand A1 is shown in 
Table 7.2. The byte enables can also be decoded to 
generate BlE # (byte low enable) and BHE# (byte 
high enable). These signals are needed to address 
16-bit memory systems (see Section 7.1.4 Inter- 
facing with 8- and 16-bit memories). 


Table 7.2. Generating AO-A31 from 
BEO#-BE3# 
andA2-A31 


Intel486™ 
OX CPU Address 
Signals 


A31 
......... 
A2 
BE3*' 
BE2# 
BE1# 
BEO# 


Physical 
Base 


Address 


A31 
......... 
A2 
A1 
AO 


A31 
......... 
A2 
0 
0 
X 
X 
X 
Low 


A31 
......... 
A2 
0 
1 
X 
X 
Low 
High 


A31 
......... 
A2 
1 
0 
X 
Low 
High 
High 


A31 
......... A2 
1 
1 
Low 
High 
High 
High 


Bus cycles may access physical memory space or 
I/O space. Peripheral devices in the system may ei- 
ther be memory-mapped, or I/O-mapped, or both. 
Physical memory addresses range from OOOOOOOOH 
to FFFFFFFFH (4 gigabytes). I/O addresses range 
from OOOOOOOOH 
to OOOOFFFFH(64 Kbytes) for pro- 
grammed I/O. See Figure 7.1. 
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The Intel486 OX microprocessor data path to memo- 
ry and input/output (110) spaces can be 32-, 16- or 
8-bits wide. The byte enable signals, 8EO#-8E3#, 
allow byte granularity when addressing any memory 
or I/O structure whether 8, 16 or 32 bits wide. 


The Intel486 OX microprocessor includes bus con- 
trol pins, 8516# 
and 858#, which allow direct con- 
nection to 16- and 8-bit memories and I/O devices. 
Cycles to 32-, 16- and 8-bit may occur in any se· 
quence, since the 858# 
a"d 8516# 
signals are 
sampled during each bus cycle. 


32-bit wide memory and I/O spaces are organized 
as arrays of physical 4-byte words. Each memory or 
I/O 4-byte word has four individually addressable 
bytes at consecutive byte addresses (see Figure 
7.2). The lowest addressed byte is associated with 
data signals 00-07; 
the highest-addressed byte 
with 024-031. 
Physical 4-byte words begin at ad- 
dresses divisible by four. 


16·Blt Wide Organization 


FFFFFFFFHrnFFFFFFFEH 


00000001 
H '-. 
• 
.....-' OOOOOOOOH 


Figure 7.2. Physical Memory 
and 1/0 Space Organization 
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16-bit memories are organized as arrays of physical 
2-byte words. Physical 2-byte words begin at ad- 
dresses divisible by two. The byte enables BEO#- 
BE3#, must be decoded to A1, BLE# and BHE# to 
address 16-bit memories (see 5ection 7.1.4). 


To address 8-bit memories, the two low order ad- 
dress bits AOand A1, must be decoded from BEO#- 
BE3#. The same logic can be used for 8- and 16-bit 
memories since the decoding logic for BLE# and AO 
are the same (see 5ection 7.1.4). 


Dynamic data bus sizing is a feature allowing proc- 
essor connection to 32-, 16- or 8-bit buses for mem- 
ory or 110. A processor may connect to all three bus 
sizes. Transfers to or from 32-, 16- or 8-bit devices 
are supported by dynamically determining the bus 
width during each bus cycle. Address decoding cir- 
cuitry may assert B516# 
for 16-bit devices, or 
B58# for 8-bit devices during each bus cycle. B58# 
and B516# must be negated when addressing 32- 
bit devices. An 8-bit bus width is selected if both 
B516# and B58# are asserted. 


B516# and B58# force the Intel486 OX microproc- 
essor to run additional bus cycles to complete re- 
quests larger than 16- or 8 bits. A 32-bit transfer will 
be converted into two 16-bit transfers (or 3 transfers 
if the data is misaligned) when B516# is asserted. 
Asserting B58# 
will convert a 32-bit transfer into 
four 8-bit transfers. 


Extra cycles forced by B516# or B58# should be 
viewed as independent bus cycles. B516# or B58# 
must be driven active during each of the extra cycles 
unless the addressed device has the ability to 
change the number of bytes it can return between 
cycles. 
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The Intel486 OX microprocessor will drive the byte 
enables appropriately during extra cycles forced by 
B58# 
and B516#. A2-A31 
will not change if ac- 
cesses are to a 32-bit aligned area. Table 7.3 shows 
the set of byte enables that will be generated on the 
next cycle for each of the valid possibilities of the 
byte enables on the current cycle. 


The dynamic bus sizing feature of the Intel486 OX 
microprocessor is significantly different than that of 
the Intel386 microprocessor. Unlike the Intel386 mi- 
croprocessor, the Intel486 OX microprocessor re- 
quires that data bytes be driven on the addressed 
data pins. The simplest example of this function is a 
32-bit aligned, B516# read. When the Intel486 OX 
microprocessor reads the two high order bytes, they 
must be driven on the data bus pins 016-031. 
The 
Intel486 OX microprocessor expects the two low or- 
der bytes on 00-015. 
The Intel386 microprocessor 
expects both the high and low order bytes on 00- 
015. The Intel386 microprocessor always reads or 
writes data on the lower 16 bits of the data bus when 
B516# is asserted. 


The external system must contain buffers to enable 
the Intel486 OX microprocessor to read and write 
data on the appropriate data bus pins. Table 7.4 
shows the data bus lines where the Intel486 OX mi- 
croprocessor expects data to be returned for each 
valid combination of byte enables and bus sizing op- 
tions. 


Valid data will only be driven onto data bus pins cor- 
responding to active byte enables during write cy- 
cles. Other pins in the data bus will be driven but 
they will not contain valid data. Unlike the Intel386 
microprocessor, the Intel486 OX microprocessor will 
not duplicate write data onto parts of the data bus 
for which the corresponding byte enable is negated. 
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BE3# 
BE2# 
BEH 
BEO# 
wlo Bs8# IBs16# 
wBs8# 
WBs16# 
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7.1.4 INTERFACING WITH 8·, 16- AND 32-BIT 
MEMORIES 


In 32-bit physical memories such as Figure 7.3, each 
4-byte word begins at a byte address that is a multi- 
ple of four. A2-A31 are used as a 4-byte word se- 
lect. BEO#-BE3# 
select individual bytes within the 
4-byte word. B88# and B816# are negated for all 
bus cycles involving the 32-bit array. 


Figure 7.3. Intel486TM OX Microprocessor 
with 32-Blt Memory 


16- and 8-bit memories require external byte swap- 
ping logic for routing data to the appropriate data • 
lines and logic for generating BHE#, BLE# and A1. 
In systems where mixed memory widths are used, 
extra address decoding logic is necessary to assert 
B816# or B88#. 


Figure 7.4 shows the Intel486 OX microprocessor 
address bus interface to 32·, 16- and 8·bit memo- 
ries. To address 16·bit memories the byte enables 
must be decoded to produce A1, BHE# and BLE# 
(AO).For 8-bit wide memories the byte enables must 
be decoded to produce AO and A1. The same byte 
select logic can be used in 16- and 8-bit systems 
since BLE# is exactly the same as AO (see Table 
7.5). 


BEO#-BE3# 
can be decoded as shown in Table 
7.5 to generate A1, BHE# and BLE#. The byte se- 
lect logic necessary to generate BHE# and BLE# is 
shown in Figure 7.5. 


Intel486 
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Bus (A31-A2 
BEOIO-BE31O) 
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Microprocessor 
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5810 1 
1BS1610 
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16-Bit 


Decode 
BHEIO, BlElO, 
A 1 
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BEOIO-BE31O 
Byt. 
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AO(BlEIO). 
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8-Bit 


A31-A2 
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Intel486™ 
OX CPU Signals 
8, 16-Blt Bus Signals 
Comments 
BE3# 
BE2# 
BE1# 
BEO# 
A1 
BHE# 
BLE# 
(AO) 
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BLE# asserted when 00-07 
of 16-bit bus is active. 


BHE# asserted when 08-015 
of 16-bit bus is active. 


A1 low for all even words; A1 high for all odd words. 


Key: 
x = don't care 
H = high voltage level 
L = low voltage level 
* = a non-occurring pattern of Byte Enables; either none are asserted, 
or the pattern has Byte Enables asserted for non-contiguous bytes 


~ 
~ 
241245-38 


BE1# 
~ 
~ 
241245-39 


Combinations of BEO#-BE3# 
which never occur 
are those in which two or three asserted byte en- 
ables are separated by one or more negated byte 
enables. These combinations are "don't care" con- 
ditions in the decoder. A decoder can use the non- 
occurring BEO#-BE3# 
combinations to its best ad- 
vantage. 


Figure 7.6 shows a Intel486 OXmicroprocessor data 
bus interface to 16- and 8-bit wide memories. Exter- 
nal byte swapping logic is needed on the data lines 
so that data is supplied to, and received from the 
Intel486 OXmicroprocessor on the correct data pins 
(see Table 7.4). 
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7.1.5 DYNAMIC BUS SIZING DURING CACHE 
LINE FILLS 


BS8# and B816# can be driven during cache line 
fills. The Intel486 OX microprocessor will generate 
enough 8- or 16-bit cycles to fill the cache line. This 
can be up to 16 8-bit cycles. 


The external system should assume that all byte en- 
ables are active for the first cycle of a cache line fill. 
The Intel486 OX microprocessor will generate prop- 
er byte enables for subsequent cycles in the line fill. 
Table 7.6 shows the appropriate AO(BLE#), A1 and 
BHE# for the various combinations of the Intel486 
OX microprocessor byte enables on both the first 
and subsequent cycles of the cache line fill. The ••••• 
marks all combinations of byte enables that will be 
generated by the Intel486 OX microprocessor during 
a cache line fill. 


Physical 4-byte words begin at addresses that are 
multiples of four. It is possible to transfer a logical 
operand that spans more than one physical 4-byte 
word of memory or I/O at the expense of extra cy- 
cles. Examples are 4-byte operands beginning at ad- 
dresses that are not evenly divisible by 4, or 2-byte 
words split between two physical 4-byte words. 
These are referred to as unaligned transfers. 


Operand alignment and data bus size dictate when 
multiple bus cycles are required. Table 7.7 describes 
the transfer cycles generated for all combinations of 
logical operand lengths, alignment, and data bus siz- 
ing. When multiple cycles are required to transfer a 
multi-byte logical operand, the highest-order bytes 
are transferred first. For example, when the proces- 
sor does a 4-byte unaligned read beginning at loca- 
tion x11 in the 4-byte aligned space, the three high 
order bytes are read in the first bus cycle. The low 
byte is read in a subsequent bus cycle. 


BE3# 
BE2# 
BE1# 
BEO# 
First Cache Fill Cycle 
Any Other Cycle 
AO 
A1 
BHE# 
AO 
A1 
BHE# 


1 
1 
1 
0 
0 
0 
0 
0 
0 
1 
1 
1 
0 
0 
0 
0 
0 
0 
0 
0 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
·0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
0 
1 
0 
0 
0 
1 
0 
0 
1 
0 
0 
1 
0 
0 
0 
1 
0 
0 
·0 
0 
0 
1 
0 
0 
0 
1 
0 
0 
1 
0 
1 
1 
0 
0 
0 
0 
1 
1 
·0 
0 
1 
1 
0 
0 
0 
0 
1 
0 
·0 
1 
1 
1 
0 
0 
0 
1 
1 
0 


•• 


intel~ 


Byte-Length of Logical Operand 


1 
2 
4 


Physical Byte Address in 
xx 
00 
01 
10 
11 
00 
01 
10 
11 
Memory (Low Order Bits) 


Transfer Cycles 
b 
hb 
d 
hb 
hw 
h3 
over 32-Bit Bus 
w 
w 
w 
Ib 
13 
Iw 
Ib 


Transfer Cycles over 
Ib 
hb 
Iw 
hb 
hw 
mw 
16-Bit Data Bus 
b 
w 
hb 
w 
Ib 
hw 
Ib 
Iw 
hb 
= BS16# Asserted 
mw 
Ib 


Transfer Cycles over 
Ib 
hb 
mhb 
mlb 
8-Bit Data Bus 
b 
Ib 
Ib 
Ib 
hb 
mlb 
Ib 
hb 
mhb 
= BS8# Asserted 
hb 
hb 
hb 
Ib 
mhb 
mlb 
Ib 
hb 


hb 
mhb 
mlb 
Ib 


KEY: 
b = byte transfer 
w = 2-byte transfer 
3 = 3-byte transfer 
d = 4-byte transfer 


h = high-order portion 
I = low-order portion 
m = mid-order portion 


The function of unaligned transfers with dynamic 
bus sizing is not obvious. When the external systems 
asserts BS16# or BS8# forcing extra cycles, low- 
order bytes or words are transferred first (opposite 
to the example above). When the Intel486 DX micro- 
processor requests a 4-byte read and the external 
system asserts BS16#. the lower 2 bytes are read 
first followed by the upper 2 bytes. 


In the unaligned transfer described above, the proc- 
essor requested three bytes on the first cycle. If the 
external system asserted BS16# during this 3-byte 
transfer, the lower word is transferred first followed 
by the upper byte. In the final cycle the lower byte of 
the 4-byte operand is transferred as in the 32-bit ex- 
ample above. 


The Intel486 DX microprocessor supports a wide va- 
riety of bus transfers to meet the needs of high per- 
formance systems. Bus transfers can be single cycle 
or multiple cycle, burst or non-burst, cacheable or 
non-cacheable, 8-, 16- or 32-bit, and pseudo-locked. 
To support multiprocessing systems there are cache 
invalidation cycles and locked cycles. 


4-ByteOperand 
~ 
t 
t 
byte with 
byte with 
lowest 
highest 
address 
address 


This section begins with basic non-cacheable non- 
burst single cycle transfers. It moves on to multiple 
cycle transfers and introduces the burst mode. 
Cacheability is introduced in Section 7.2.3. The re- 
maining sections describe locked, pseudo-locked, 
invalidate, bus hold and interrupt cycles. 


Bus cycles and data cycles are discussed in this 
section. A bus cycle is at least two clocks long and 
begins with ADS# active in the first clock and ready 
active in the last clock. Data is transferred to or from 
the Intel486 DX microprocessor during a data cycle. 
A bus cycle contains one or more data cycles. 


Refer to Section 7.2.13 for a description of the bus 
states shown in the timing diagrams. 


7.2.1 NON-CACHEABLE NON-BURST SINGLE 
CYCLE 


7.2.1.1 No Wait States 


The fastest non-burst bus cycle that the Intel486 DX 
microprocessor supports is two clocks long. These 
cycles are called 2-2 cycles because reads and 
writes take two cycles each. The first 2 refers to 


reads and the second to writes. For example, if a 
wait state needs to be added to a write, the cycle 
would be called 2-3. 


Basic two clock read and write cycles are shown in 
Figure 7.7. The Intel486 OX microprocessor initiates 
a cycle by asserting the address status signal 
(AOS#) at the rising edge of the first clock. The 
AOS# output indicates that a valid bus cycle defini- 
tion and address is available on the cycle definition 
lines and address bus. 


The non-burst ready input (ROY#) is returned by the 
external system in the second clock. ROY# indi- 
cates that the external system has presented valid 
data on the data pins in response to a read or the 
external system has accepted data in response to a 
write. 


The Intel486 OX microprocessor samples ROY# at 
the end of the second clock. The cycle is complete if 
ROY# is active (LOW) when sampled. Note that 
ROY# is ignored at the end of the first clock of the 
bus cycle. 


The burst last signal (BLAST#) is asserted (LOW) 
by the Intel486 OX microprocessor during the sec- 
ond clock of the first cycle in all bus transfers illus- 
trated in Figure 7.7. This indicates that each transfer 
is complete after a single cycle. The Intel486 OX 
microprocessor asserts BLAST# in the last cycle of 
a bus transfer. 


The timing of the parity check output (PCHK#) is 
shown in Figure 7.7. The Intel486 OX microproces- 
sor drives the PCHK# output one clock after ready 
terminates a read cycle. PCHK# indicates the parity 
status for the data sampled at the end of the previ- 
ous clock. The PCHK# signal can be used by the 
external system. The Intel486 OX microprocessor 
does nothing in response to the PCHK# output. 


7.2.1.2 
Inserting 
Walt States 


The external system can insert wait states into the 
basic 2-2 cycle by driving ROY# inactive at the end 
of the second clock. ROY# must be driven inactive 
to insert a wait state. Figure 7.8 illustrates a simple 
non-burst, non-cacheable signal with one wait state 
added. Any number of wait states can be added to 
an Intel486 OX microprocessor bus cycle by main- 
taining ROY# inactive. 


The burst ready input (BROY#) must be driven inac· 
tive on all clock edges where ROY# is driven inac- 
tive for proper operation of these simple non-burst 
cycles. 


7.2.2 
MULTIPLE 
AND BURST 
CYCLE 
BUS 
TRANSFERS 


Multiple cycle bus transfers can be caused by inter- 
nal requests from the Intel486 OX microprocessor or 
by the external memory system. An internal request 
for a 64-bit floating point load or a 128·bit pre-fetch 
must take more than one cycle. Internal requests for 
unaligned data may also require multiple bus cycles. 
A cache line fill requires multiple cycles to complete. 
The external system can cause a multiple cycle 
transfer when it can only supply 8 or 16 bits per 
cycle. 


Only multiple cycle transfers caused by internal re- 
quests are considered in this section. Cacheable cy- 
cles and 8- and 16-bit transfers are covered in Sec- 
tions 7.2.3 and 7.2.5. 


7.2.2.1 
Burst Cycles 


The Intel486 OX microprocessor can accept burst 
cycles for any bus requests that require more than a 
single data cycle. Ouring burst cycles, a new data 
item is strobed into the Intel486 OX microprocessor 
every clock rather than every other clock as in non- 
burst cycles. The fastest burst cycle requires 2 
clocks for the first data item with subsequent data 
items returned every clock. 


The Intel486 OX microprocessor is capable of burst- 
ing a maximum of 32 bits during a write. Burst writes 
can only occur if BS8# or BS16# is asserted. For 
example, the Intel486 OX microprocessor can burst 
write four 8-bit operands or two 16-bit operands in a 
single burst cycle. But the Intel486 OX microproces- 
sor cannot burst multiple 32·bit writes in a single 
burst cycle. 


Burst cycles begin with the Intel486 OX microproc- 
essor driving out an address and asserting AOS# in 
the same manner as non-burst cycles. The Intel486 
OX microprocessor indicates that it is willing to per- 
form a burst cycle by holding the burst last signal 
(BLAST#) inactive in the second clock of the cycle. 
The external system indicates its willingness to do a 
burst cycle by returning the burst ready signal 
(BROY#) active. 


The addresses of the data items in a burst cycle will 
all fall within the same 16·byte aligned area (corre- 
sponding to an internal Intel486 OX microprocessor 
cache line). A 16-byte aligned area begins at loca- 
tion XXXXXXXOand ends at location XXXXXXXF. 
Ouring a burst cycle, only BEO-3#, A2' and A3 may 
change. A4-A31, MilOI', 
O/C#, 
and W/R# 
will reo 


main stable throughout a burst. Given the first ad- 
dress in a burst, external hardware can easily calcu- 
late the address of subsequent transfers in advance. 
An external memory system can be designed to 
quickly fill the Intel486 OX microprocessor internal 
cache lines. 


• 


Intel486TM DX2 MICROPROCESSOR 
intel~ 


Ti 
T1 
T2 
Tl 
T2 
T1 
T2 
T1 
T2 
Ti 


ClK 


ADS# 
\ 
/ 
\ 
/ 
\ 
/ 
\ 
/ 


A2-A31 
Iot/IO# 
X 
X 
X 
X 
D/C# 
BEO-3# 


W/R# 
\ 
/ 
\ 
/ 


RDY# 


BlAST# 
X 
\ 
/ 
\ 
/ 
\ 
/ 
\ 
C 


DATA 
TO 
FROM CPU 
TO 
FROM CPU 
CPU 
CPU 
, 
I 
I 
, 


PCHK# 
I 
I 
, 


I Q 


I OJ 
I 
I 


READ 
WRITE 
READ 
WRITE 


241245-42 


Figure 7.7. Basic 2·2 Bus Cycle 
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Figure 7.8. Basic 3-3 Bus Cycle 
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Burst cycles are not limited to cache line fills. Any 
multiple cycle read request by the Intel486 OX mi- 
croprocessor can be converted into a burst cycle. 
The Intel486 OX microprocessor will only burst the 
number of bytes needed to complete a transfer. For 
example, eight bytes will be bursted in for a 64-bit 
floating point non-cacheable read. 


The external system converts a multiple cycle re- 
quest into a burst cycle by returning BROY# active 
rather than ROY# (non-burst ready) in the first cycle 
of a transfer. For cycles that cannot be bursted such 
as interrupt acknowledge and halt, BROY# has the 
same effect as ROY#. BROY# is ignored if both 
BROY# and ROY# are returned in the same clock. 
Memory areas and peripheral devices that cannot 
perform bursting must terminate cycles with ROY#. 


7.2.2.2 
Terminating 
Multiple 
and 
Burst Cycle Transfers 


The Intel486 OX microprocessor drives BLAST# in- 
active for all but the last cycle in a multiple cycle 
transfer. BLAST# is driven inactive in the first cycle 
to inform the external system that the transfer could 
take additional cycles. BLAST# is driven active in 
the last cycle of the transfer indicating that the next 
time BROY# or ROY# is returned the transfer is 
complete. 


BLAST# is not valid in the first clock of a bus cycle. 
It should be sampled only in the second and subse- 
quent clocks when ROY# or BROY# is returned. 


The number of cycles in a transfer is a function of 
several factors including the number of bytes the mi- 
croprocessor needs to complete an internal request 
(1, 2, 4, 8, or 16), the state of the bus size inputs 
(BS8# and BS16#), the state of the cache enable 
input (KEN#) and alignment of the data to be trans- 
ferred. 


When the Intel486 OX microprocessor initiates a re- 
quest it knows how many bytes will be transferred 
and if the data is aligned. The external system must 
tell the microprocessor whether the data is cache- 
able (if the transfer is a read) and the width of the 
bus by returning the state of the KEN#, BS8# and 
BS16# inputs one clock before ROY# or BROY# is 
returned. The Intel486 OX microprocessor deter- 
mines how many cycles a transfer will take based on 
its internal information and inputs from the external 
system. 


BLAST# is not valid in the first clock of a bus cycle 
because the Intel486 OX microprocessor cannot de- 
termine the number of cycles a transfer will take until 


the external system returns KEN# , BS8# 
and 
BS16#. 
BLAST# 
should only be sampled in the 
second and subsequent clocks of a cycle when the 
external system returns ROY# or BROY#. 


The system may terminate a burst cycle by returning 
ROY# 
instead of BROY#. 
BLAST# 
will remain 
deasserted until the last transfer. However, any 
transfers required to complete a cache line fill will 
follow the burst order, e.g., if burst order was 4, 0, C, 
8 and ROY# was returned at after 0, the next trans- 
fers will be from C and 8. 


7.2.2.3 
Non-Cacheable, 
Non-Burst, 
Multiple 
Cycle Transfers 


Figure 7.9 illustrates a 2 cycle non-burst, non-cache- 2 
able multiple cycle read. This transfer is simply a 
sequence of two single cycle transfers. The Intel486 
OX microprocessor indicates to the external system 
that this is a multiple cycle transfer by driving 
BLAST# inactive during the second clock of the first 
cycle. The external system returns ROY# active in- 
dicating that it will not burst the data. The external 
system also indicates that the data is not cacheable 
by returning KEN# inactive one clock before it re- 
turns ROY# active. When the Intel486 OX micro- 
processor samples ROY# active it ignores BROY#. 


Each cycle in the transfer begins when AOS# is 
driven active and the cycle is complete when the 
external system returns ROY# active. 


The Intel486 OX microprocessor indicates the last 
cycle of the transfer by driving BLAST# active. The 
next ROY# returned by the external system termi- 
nates the transfer. 


7.2.2.4 
Non-Cacheable 
Burst Cycles 


The external system converts a multiple cycle re- 
quest into a burst cycle by returning BROY# active 
rather than ROY# in the first cycle of the transfer. 
This is illustrated in Figure 7.10. 


There are several features to note in the burst read. 
AOS# is only driven active during the first cycle of 
the transfer. ROY# must be driven inactive when 
BROY# is returned active. 


BLAST# behaves exactly as it does in the non-burst 
read. BLAST# is driven inactive in the second clock 
of the first cycle of the transfer indicating more cy- 
cles to follow. In the last cycle·,BLAST# is driven 
active telling the external memory system to end the 
burst after returning the next BROY#. 
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Any memory read can become a cache fill operation. 
The external memory system can allow a read re- 
quest to fill a cache line by returning KEN# active 
one clock before ROY# or BROY# during the first 
cycle of the transfer on the external bus. Once 
KEN# is asserted and the remaining three require- 
ments described below are met, the Intel486 OX mi- 
croprocessor will fetch an entire cache line regard- 
less of the state of KEN#. KEN# must be returned 
active in the last cycle of the transfer for the data to 
be written into the internal cache. The Intel486 OX 
microprocessor will only convert memory reads or 
prefetches into a cache fill. 


KEN# is ignored during write or I/O cycles. Memory 
writes will only be stored in the on-chip cache if 
there is a cache hit. lID space is never cached in 
the' internal cache. 


To transform a read or a prefetch into a cache line 
fill the following conditions must be met: 
1. The KEN# pin must be asserted one clock pri· 
or to ROY# or BROY# being returned for the 
first data cycle. 
2. The cycle must be of the type that can be inter- 
nally cached. (Locked reads, I/O reads, and 
interrupt acknowledge cycles are never cach- 
ed). 
3. The page table entry must have the page 
cache disable bit (PCO) set to O. To cache a 
page table entry, the page directory must have 
PCO=O. To cache reads or prefetches when 
paging is disabled, or to cache the page direc- 
tory entry, control register 3 (CR3) must have 
PCO=O. 


4. The cache disable (CD) bit in control register 0 
(CRO)must be clear. 


External hardware can determine when the Intel486 
OX microprocessor has transformed a read or pre- 
fetch into a cache fill by examining the KEN# , 
MIIO#, 
O/C#, 
W/R#, 
LOCK#, 
and PCO pins. 
These pins convey to the system the outcome of 
conditions 1-3 in the above list. In addition, the In- 
tel486 OX drives PCO high whenever the CD bit in 
CRO is set, so that external hardware can evaluate 
condition 4. 


7.2.3.1 Byte Enables during a Cache Line Fill 


For the first cycle in the line fill, the state of the byte 
enables should be ignored. In a non-cacheable 
memory read, the byte enables indicate the bytes 
actually required by the memory or code fetch. 


The Intel486 OX microprocessor expects to receive 
valid data on its entire bus (32 bits) in the first cycle 
of a cache line fill. Data should be returned with the 
assumption that all the byte enable pins are driven 
active. However if BS8# is asserted only one byte 
need be returned on data lines 00-07. 
Similarly if 
BS16# is asserted two bytes should be returned on 
00-015. 


The Intel486 OX microprocessor will generate the 
addresses and byte enables for all subsequent cy- 
cles in the line fill. The order in which data is read 
during a line fill depends on the address of the first 
item read. Byte ordering is discussed in Section 
7.2.4. 


7.2.3.2 Non-Burst Cacheable Cycles 


Figure 7.11 shows a non·burst cacheable cycle. The 
cycle becomes a cache fill when the Intel486 OX 
microprocessor samples KEN# active at the end of 
the first clock. The Intel486 OX microprocessor 
drives BLAST# inactive in the second clock in re- 
sponse to KEN#. BLAST# is driven inactive be- 
cause a cache fill requires 3 additional cycles to 
complete. BLAST# remains inactive until the last 
transfer in the cache line fill. KEN# must be re- 
turned active in the last cycle of the transfer for the 
data to be written into the internal cache. 


Note that this cycle would be a single bus cycle if 
KEN# was not sampled active at the end of the first 
clock. The subsequent three reads would not have 
happened since a cache fill was not requested. 


The BLAST# output is invalid in the first clock of a 
cycle. BLAST# may be active during the first clock 
due to earlier inputs. Ignore BLAST# until the sec- 
ond clock. 


During the first cycle of the cache line fill the exter- 
nal system should treat the byte enables as if they 
are all active. In subsequent cycles in the burst, the 
Intel486 OX microprocessor drives the address lines 
and byte enables (see Section 7.2.4.2 for Burst and 
Cache Line Fill Order). 
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7.2.3.3 
Burst Cacheable 
Cycles 


Figure 7.12 illustrates a burst mode cache fill. As in 
Figure 7.11, the transfer becomes a cache line fill 
when the external system returns KEN# active at 
the end of the first clock in the cycle. 


The external system informs the Intel486 OX micro- 
processor that it will burst the line in by driving 
BROY# active at the end of the first cycle in the 
transfer. 


Note that during a burst cycle AOS# is only driven 
with the first address. 
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7.2.3.4 
Effect 
of Changing 
KEN # during a 
Cache 
Une Fill 
would be a cache line fill. Similarly, it uses the value 
of KEN# in the last cycle, before early ROY# to 
load the line just retrieved from the memory into the 
cache. KEN# is sampled every clock, it must satisfy 
setup and hold time. 
KEN# can change multiple times as long as it ar- 
rives at its final value in the clock before ROY# or 
BROY# is returned. This is illustrated in Figure 7.13. 
Note that the timing of BLAST# follows that of 
KEN# 
by one clock. The Intel486 OX samples 
KEN# every clock and uses the value returned in 
the clock before ready to determine if a bus cycle 


KEN# can also change multiple times before a burst 
cycle as long as it arrives at its final value one clock 
before ready is returned active. 
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Figure 7.13. Effect of Changing KEN# 


7.2.4 BURST MODE DETAILS 
data into the chip when either ROY# or BRDY# are 
active. Driving BRDY# and ROY# inactive adds a 


7.2.4.1 Adding Walt States to Burst Cycles 
wait state to the transfer. A burst cycle where two 
clocks are required for every burst item is shown in 


Burst cycles need not return data on every clock. 
Figure 7.14._ 


The Intel486 OX microprocessor will only strobe 
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7.2.4.2 Burst and Cache line 
Fill Order 


The burst order used by the Intel486 OX microproc- 
essor is shown in Table 7.7. This burst order is fol- 
lowed by any burst cycle (cache or not), cache line 
fill (burst or not) or code prefetch. 


This burst order is optimized for a two-way inter- 
leaved memory architecture. This means that if the 
memory is built as 64-bit (versus 32-bit) words which 
are multiplexed into the 32-bit data bus, the Intel486 
CPU will read all 64 bits before accessing the next 
location. 


The microprocessor presents each request for data 
in an order determined by the first address in the 
transfer. For example, if the first address was 104 
the next three addresses in the burst will be 100, 
10C and 108. 


First 
second 
Third 
Fourth 
Addr. 
Addr. 
Addr. 
Addr. 


0 
4 
8 
C 
4 
0 
C 
8 
8 
C 
0 
4 
C 
8 
4 
0 


An example of burst address sequencing is shown in 
Figure 7.15. 
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The sequences shown in Table 7.7 accommodate 
systems with 64-bit busses as well as systems with 
32-bit data busses. The sequence applies to all 
bursts, regardless of whether the purpose of the 
burst is to fill a cache line, do a 64-bit read, or do a 
pre-fetch. If either Bs8# 
or Bs16# 
is returned ac- 
tive, the Intel486 OX microprocessor completes the 
transfer of the current 32-bit word before progress- 
ing to the next 32-bit word. For example, a Bs16# 
burst to address 4 has the following order: 4-6-0-2- 
C-E-8-A. 


7.2.4.3 
Interrupted 
Burst Cycles 


Some memory systems may not be able to respond 
with burst cycles in the order defined in Table 7.7. 
To support these systems the Intel486 OX micro- 
processor allows a burst cycle to be interrupted at 


any time. The Intel486 OX microprocessor will auto- 
matically generate another normal bus cycle after 
being interrupted to complete the data transfer. This 
is called an interrupted burst cycle. The external sys- 
tem can respond to an interrupted burst cycle with 
another burst cycle. 


The external system can interrupt a burst cycle by 
returning ROY# instead of BROY#. ROY# can be 
returned after any number of data cycles terminated 
with BROY#. 


An example of an interrupted burst cycle is shown in 
Figure 7.16. The Intel486 OX microprocessor imme- 
diately drives AOs # active to initiate a new bus cy- 
cle after ROY# is returned active. BLAST# is driven 
inactive one clock after AOs# 
begins the second 
bus cycle indicating that the transfer is not complete. 
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KEN1/ need not be returned active in the first data 
cycle of the second part of the transfer in Figure 
7.16. The cycle had been converted to a cache fill in 
the first part of the transfer and the Intel486 OX mi- 
croprocessor expects the cache fill to be completed. 
Note that the first half and second half of the trans- 
fer in Figure 7.16 are each two cycle burst transfers. 


The order in which the Intel486 OX microprocessor 
requests operands during an interrupted burst trans- 
fer is determined by Table 7.7. Mixing ROYI/ and 
BROY1/ does not change the order in which oper- 
and addresses are requested by the Intel486 OX mi- 
croprocessor. 


An example of the order in which the Intel486 OX 
microprocessor requests operands during a cycle in 
which 
the 
external 
system 
mixes 
ROY1/ 
and 
BROY1/ is shown in Figure 7.17. The Intel486 OX 
microprocessor initially requests a transfer begin- 
ning at location 104. The transfer becomes a cache 
line fill when the external system returns KEN1/ ac- 
tive. The first cycle of the cache fill transfers the 
contents of location 104 and is terminated with 
ROY1/. The Intel486 OXmicroprocessor drives out a 
new request (by asserting AOSI/) to address 100. If 
the external system terminates the second cycle 
with BROY1/. the Intel486 OX microprocessor will 
next request! expect address 10C. The correct order 
is determined by the first cycle in the transfer, which 
may not be the first cycle in the burst if the system 
mixes ROY1/ with BROY1/. 
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Figure 7.17. Interrupted 
Burst Cycle with Unobvlous 
Order of Addresses 


Driving the BS16llf and BS8llf active can force the 
Intel486 DX microprocessor to run additional cycles 
to complete what would have been only a single 
32-bit cycle. BS8llf and BS16llf may change the 
state of BLASTllf when they force subsequent cy- 
cles from the transfer. 


The Intel486 DX microprocessor supports both 16- 
and 8-bit external busses through the BS16llf and 
BS8llf inputs. BS16llf and BS8# allow the external 
system to specify, on a cycle by cycle basis, whether 
the addressed component can supply 8, 16 or 32 
bits. BS16llf and BS8# can be used in burst cycles 
as well as non-burst cycles. If both BS16# 
and 
BSS# are returned active for any bus cycle, the In- 
tel486 DX microprocessor will respond as if only 
BSS# were active. 


The timing of BS16# and BS8# is the same as that 
of KEN#. BS16# and BS8llf must be driven active 
before the first RDYllf or BRDYllf is driven active. 


Figure 7.18 shows an example in which BS8# 
forces the Intel486 DX microprocessor to run two 
extra cycles to complete a transfer. The Intel486 DX 
microprocessor issues a request for 24 bits of infor- 
mation. The external system drives BS8# active in- 
dicating that only eight bits of data can be supplied 
per cycle. The Intel486 DX microprocessor issues 
two extra cycles to complete the transfer. 
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Extra cycles forced by the 8S16# and 8S8# should 
be viewed as independent bus cycles. 8S16# and 
8S8 # should be driven active for each additional 
cycle unless the addressed device has the ability to 
change the number of bytes it can return between 
cycles. The Intel486 OX microprocessor will drive 
8LAST# 
inactive until the last cycle before the 
transfer is complete. 


8S8# and 8S16# operate during burst cycles in ex- 
actly the same manner as non-burst cycles. For ex- 
ample, a single non-cacheable read could be trans- 
ferred by the Intel486 OX microprocessor as four 
8-bit burst data cycles. Similarly, a single 32-bit write 
could be written as four 8-bit burst data cycles. An 
example of a burst write is shown in Figure 7.19. 
8urst writes can only occur if 8S8# 
or 8S16# 
is 
asserted. 
Refer to Section 7.1.3 for the sequencing of ad- 
dresses while 8S8# or 8S16# are active. 


\ 
/ 


X 
: c 
I 


X 
X 
X 
x 
: c 


intel~ 


\ 
/ 


X 
/ 
I 
\ 
c- 


o 
I 
I 
( 
i FROM CPU 
)- 


241245-54 


Locked cycles are generated in software for any in- 
struction that performs a read-modify-write opera- 
tion. During a read-modify-write operation the proc- 
essor can read and modify a variable in external 
memory and be assured that the variable is not ac- 
cessed between the read and write. 


Locked cycles are automatically generated during 
certain bus transfers. The xchg (exchange) instruc- 
tion generates a locked cycle when one of its oper- 
ands is memory based. Locked cycles are generat- 
ed when a segment or page table entry is updated 
and during interrupt acknowledge cycles. Locked cy- 
cles are also generated when the LOCK instruction 
prefix is used with selected instructions. 


Locked cycles are implemented in hardware with the 
LOCK# pin. When LOCK# is active, the processor 
is performing a read-modify-write operation and the 
external bus should not be relinquished until the cy- 
cle is complete. Multiple reads or writes can be 
locked. A locked cycle is shown in Figure 7.20. 
LOCK# goes active with the address and bus defini- 
tion pins at the beginning of the first read cycle and 
remains active until ROY# is returned for the last 
write cycle. For unaligned 32 bits read-modify-write 
operation, the LOCK# remains active for the entire 
duration of the multiple cycle. It will go inactive when 
ROY# is returned for the last write cycle. 
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When LOCK# is active, the Intel486 DX microproc- 
essor will recognize address hold and backoff but 
will not recognize bus hold. It is left to the external 
system to properly arbitrate a central bus when the 
Intel486 DX microprocessor generates LOCK#. 


Pseudo-locked cycles assure that no other master 
will be given control of the bus during operand trans- 
fers which take more than one bus cycle. Examples 
include 64-bit floating point read and writes, 64-bit 
descriptor loads and cache line fills. 


Pseudo-locked 
transfers 
are 
indicated 
by 
the 
PLOCK# 
pin. The 
memory operands 
must be 
aligned for correct operation of a pseudo-locked cy- 
cle. 


PLOCK# need not be examined during burst reads. 
A 64-bit aligned operand can be retrieved in one 
burst (note: this is only valid in systems that do not 
interrupt bursts). 


The system must examine PLOCK# during 64-bit 
writes since the Intel486 DX microprocessor cannot 
burst write more than 32 bits. However, burst can be 
used within each 32-bit write cycle if BS8# 
or 
BS16# is asserted. BLAST will be deasserted in re- 
sponse to BS8# or BS16#. A 64-bit write will be 
driven out as two non-burst bus cycles. BLAST# is 
asserted during both writes since a burst is not pos- 
sible. 


PLOCK# is asserted during the first write to indicate 
that another write follows. This behavior is shown in 
Figure 7.21. 


The first cycle of a 64-bit floating point write is the 
only case in which both PLOCK# and BLAST# are 
asserted. Normally PLOCK# and BLAST# are the 
inverse of each other. 


During all of the cycles where PLOCK# is asserted, 
HOLD is not acknowledged until the cycle com- 
pletes. This results in a large HOLD latency, espe- 
cially when BS8# or BS16# is asserted. To reduce 
the HOLD latency during these cycles, windows are 
available between transfers to allow HOLD to be ac- 
knowledged 
during 
non-cacheable, 
non-bursted 
code prefetches. PLOCK# will be asserted since 
BLAST# is negated, but it is ignored and HOLD is 
recognized during the prefetch. 


PLOCK# can change several times during a cycle 
settling to its final value in the clock ready is re- 
turned. 


Invalidate cycles are needed to keep the Intel486 
DX microprocessor's internal cache contents con- 
sistent with external memory. The Intel486 DX mi- 
croprocessor contains a mechanism for listening to 
writes by other devices to external memory. When 
the processor finds a write to a Section of external 
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memory contained in its internal cache,the proces- 
sor's internal copy is invalidated. 


Invalidations use two pins, address hold request 
(AHOlD) 
and valid external address (EADS#). 
There are two steps in an invalidation cycle. First, 
the external system asserts the AHOlD input forcing 
the Intel486 DX microprocessor to immediately relin- 
quish its address bus. Next, the external system as- 
serts EADS# indicating that a valid address is on 
the 
Intel486 
DX microprocessor's 
address bus. 
EADS# and the invalidation address, Figure 7-22 
shows the fastest possible invalidation cycle. The 
Intel486 DX CPU recognizes AHOlD 
on one ClK 
edge and floats the address bus in response. To 
allow the address bus to float and avoid contention, 
EADS# and the invalidation address should not be 
driven until the following ClK edge. The microproc- 
essor reads the address over its address lines. If the 
microprocessor finds this address in its internal 
cache, the cache entry is invalidated. Note that the 
Intel486 DX microprocessor's address bus is input! 
output unlike the Intel386 microprocessor's bus, 
which is output only. 


The Intel486 DX microprocessor immediately relin- 
quishes its address bus in the next clock upon as- 
sertion of AHOlD. For example, the bus could be 3 
wait states into a read cycle. If AHOlD is activated, 
the Intel486 DX microprocessor will immediately 


float its address bus before ready is returned termi- 
nating the bus cycle. 


When AHOlD is asserted only the address bus is 
floated, the data bus can remain active. Data can be 
returned for a previously specified bus cycle during 
address hold (see Figures 7.22, 7.23). 


EADS# is normally asserted when an external mas- 
ter drives an address onto the bus. AHOlD need not 
be driven for EADS# to generate an internal invali- 
date. If EADS# alone is asserted while the Intel486 
DX microprocessor is driving the address bus, it is 
possible that the invalidation address will come from 
the Intel486 DX microprocessor itself. 


Note that it is also possible to run an invalidation 
cycle by asserting EADS# when HOLD or BOFF# 
is asserted. 


Running an invalidate cycle prevents the Intel486 
DX microprocessor cache from satisfying other inter- 
nal requests, so invalidations should be run only 
when necessary. The fastest possible invalidate cy- 
cle is shown in Figure 7.22, while a more realistic 
invalidation cycle is shown in 7.23. Both of the ex- 
amples take one clock of cache access from the 
rest of the Intel486 DX microprocessor. 
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Figure 7.22. Fast Internal 
Cache Invalidation 
Cycle 
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7.2.8.1 Rate of Invalidate Cycles 


The Intel486 OX microprocessor can accept one in- 
validate per clock except in the last clock of a line 
fill. One invalidate per clock is possible as long as 
EAOS# is negated in ONE or BOTH of the following 
cases: 
1. In the clock ROY# or BROY# is returned for 
the last time. 
2. In the clock following ROY# or BROY# being 
returned for the last time. 


This definition allows two system designs. Simple 
designs can restrict invalidates to one every other 
clock. The simple design need not track bus activity. 
Alternatively, systems can request one invalidate 
per clock provided that the bus is monitored. 


7.2.8.2 Running Invalidate Cycles Concurrently 
with Line Fills 


Precautions are necessary to avoid caching stale 
data in the Intel486 OX microprocessor's cache in a 
system with a second level cache. An example of a 
system with a second level cache is shown in Figure 
7.24. An external device can be writing to main 
memory over the system bus while the Intel486 OX 
microprocessor is retrieving data from the second 
level cache. The Intel486 OX microprocessor will 
need to invalidate a line in its internal cache if the 
external device is writing to a main memory address 
also contained in the Intel486 OX microprocessor's 
cache. 
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Figure 7.24. System with Second Level Cache 


A potential problem exists if the external device is 
writing to an address in external memory, and at the 
same time the Intel486 OX microprocessor is read- 
ing data from the same address in the second level 
cache. The system must force an invalidation cycle 
to invalidate the data that the Intel486 OX micro- 
processor has requested during the line fill. 


If the system asserts EAOS# before the first data in 
the line fill is returned to the Intel486 OX microproc- 
essor, the system must return data consistent with 
the new data in the external memory upon resump- 
tion of the line fill after the invalidation cycle. This is 
illustrated by the asserted EAOS# signal labeled 1 
in Figure 7.25. 


If the system asserts EAOS# at the same time or 
after the first data in the line fill is returned (in the 
same clock that the first ROY# or BROY# is re- 
turned or any subsequent clock in the line fill) the 
data will be read into the Intel486 OX microproces- 
sors input buffers but it will not be stored in the on- 
chip cache. This is illustrated by asserted EAOS# 
signal labeled 2"In Figure 7.25. The stale data will be 
used to satisfy the request that initiated the cache fill 
cycle. 
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NOTES: 
1. Data returned 
must be consistent 
if its address 
equals 
the invalidation 
address 
in this clock 
2. Data returned 
will not be cached 
if its address 
equals 
the invalidation 
address 
in this clock 


The Intel486 OX microprocessor provides a bus 
hold, hold acknowledge protocol using the bus hold 
request (HOLD) and bus hold acknowledge (HLOA) 
pins. Asserting the HOLD input indicates that anoth- 
er bus master desires control of the Intel486 OX mi- 
croprocessor's bus. The processor will respond by 
floating its bus and driving HLOA active when the 
current bus cycle, or sequence of locked cycles is 
complete. An example of a HOLO/HLOA transaction 
is shown in Figure 7.26a. Unlike the Intel386 micro- 
processor, the Intel486 OX microprocessor can re- 
spond to HOLD by floating its bus and asserting 
HLOA while RESET is asserted. 


Note that HOLD will be recognized during un-aligned 
writes (less than or equal to 32-bits) with BLAST# 
being active for each write. For greater than 32-bit or 
un-aligned write, HOLD# recognition is prevented 
by PLOCK# getting asserted. 


For cacheable and nonbursted or bursted cycles, 
HOLD is acknowledged during backoff only if HOLD 
and BOFF# are asserted during an active bus cycle 
(after AOS# asserted) and before the first ROYI' or 
BROY# has been returned (see Figure 7.26b), The 
order in which HOLD and BOFF# go active is unim- 
portant (so long as both are active prior to the first 
ROY# IBROY # 
returned by the system). Figure 
7.26b shows the case where HOLD is asserted first; 
HOLD could be asserted simultaneously or after 
BOFF# and still be acknowledged. 


The pins floated during bus hold are: BEO# -BE3#, 
PCO, PWT, 
W/R#, 
OIC#, 
MilOI', 
LOCK#, 


PLOCKI', 
AOS#, 
BLAST#, 
00-031, 
A2-A31, 


OPO-OP3. 


The Intel486 OX microprocessor generates interrupt 
acknowledge cycles in response to maskable inter- 
rupt requests generated on the interrupt request in- 
put (INTR) pin. Interrupt acknowledge cycles have a 
unique cycle type generated on the cycle type pins. 
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Figure 7.26a. HOLD/HLDA 
Cycles 


ClK 
ADS" 


..J '"'U \..J ""'\..) '"'UI\...J ""'\..) 
""'\..) '"'UIV ""'\..) '"'U1\ 
'-J 


I 


" 
" 


I 


I 


" 


104/10" 


D/C" 


W/R" 


KEN- 


BRDY" 


HlDA 


BOFF" 


An example interrupt acknowledge transaction is 
shown in Figure 7.27. Interrupt acknowledge cycles 
are generated in locked pairs. Data returned during 
the first cycle is ignored. The interrupt vector is re- 
turned during the second cycle on the lower 8 bits of 
the data bus. The Intel486 OX microprocessor has 
256 possible interrupt vectors. 


The state of A2 distinguishes the first and second 
interrupt acknowledge cycles. The byte address 
driven during the first interrupt acknowledge cycle is 
4 (A31-A3 
low, A2 high, BE3#-BE1# 
high, and 


BEO# low). The address driven during the second 
interrupt acknowledge cycle is 0 (A31-A2 
low, 


BE3#-BE1# 
high, BEO# low). 
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Figure 7.27. Interrupt 
Acknowledge 
Cycles 


Each of the interrupt acknowledge cycles are termi- 
nated when the external system returns ROY# or 
BROY#. Wait states can be added by withholding 
ROY/I or BROY/I. The Intel486 OX microprocessor 
automatically generates four idle clocks between the 
first and second cycles to allow for 8259A recovery 
time. 


The Intel486 OX microprocessor provides four spe- 
cial bus cycles to indicate that certain instructions 
have been executed, or certain conditions have oc- 
curred internally. The special bus cycles in Table 7.8 
are defined when the bus cycle definition pins are in 
the 
following 
state: 
M/IO/l 
= 0, 
O/G/I = 0 
and 
W/R/I 
= 1. During these cycles the address bus is 
driven low while the data bus is undefined. 


Two of the special cycles indicate halt or shutdown. 
Another special cycle is generated when the In- 
tel486 OX microprocessor executes an INVO (invali- 
date data cache) instruction and could be used to 
flush an external cache. The Write Back cycle is 
generated when the Intel486 OX microprocessor ex- 
ecutes the 
WBINVO (write-back invalidate data 
cache) instruction and could be used to synchronize 
an external write-back cache. 
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The external hardware must acknowledge these 
special bus cycles by returning ROY/I or BROY/I • 


BE3/1 
BE2/1 
BE1# 
BEG/I 
Special 
BuaCycle 


1 
1 
1 
0 
Shutdown 
1 
1 
0 
1 
Flush 
1 
0 
1 
1 
Halt 
0 
1 
1 
1 
Write Back 


7.2.11.1 
Halt Indication 
Cycle 


The Intel486 OX microprocessor halts as a result of 
executing a HALT instruction. Signaling its entrance 
into the halt state, a halt indication cycle is per- 
formed. The halt indication cycle is identified by the 
bus definition signals in special bus cycle state and a 
byte address of 2. BEO/l and BE2/1 are the only 
signals distinguishing halt indication from shutdown 
indication, which drives an address of O.During the 
halt cycle undefined data is driven on 00-031. The 
halt indication cycle 
must be acknowledged 
by 
ROY/I or BROY/I asserted. 
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A halted Intel486 OX microprocessor resumes exe- 
cution when INTR (if interrupts are enabled) or NMI 
or RESET is asserted. 


7.2.11.2 Shutdown Indication Cycle 


The Intel486 OX microprocessor shuts down as a 
result of a protection fault while attempting to pro- 
cess a double fault. Signaling its entrance into the 
shutdown state, a shutdown indication cycle is per- 
formed. The shutdown indication cycle is identified 
by the bus definition signals in special bus cycle 
state and a byte address of O. 


In a multi-master system another bus master may 
require the use of the bus to enable the Intel486 OX 
microprocessor to complete its current bus request. 
In this situation the Intel486 OX microprocessor will 
need to restart its bus cycle after the other bus mas- 
ter has completed its bus transaction. 


A bus cycle may be restarted if the external system 
asserts the backoff (BOFF#) input. The Intel486 OX 
microprocessor 
samples the 
BOFF# 
pin every 
clock. The Intel486 OX microprocessor will immedi- 
ately (in the next clock) float its address, data and 
status pins when BOFF# is asserted (see Figure 
7.28). Any bus cycle in progress when BOFF# is 


asserted is aborted and any data returned to the 
processor is ignored. The same pins are floated in 
response to BOFF# as are floated in response to 
HOLD. HLOA is not generated in response to 
BOFF#. BOFF# has higher priority than ROY# or 
BROY#. If either ROY# or BROY# are returned in 
the same clock as BOFF#, BOFF# takes effect. 


The device asserting BOFF# is free to run any cy- 
cles it wants while the Intel486 OX microprocessor 
bus is in its high impedance state. If backoff is re- 
quested after the Intel486 OX microprocessor has 
started a cycle, the new master should wait for 
memory to return ROY# or BROY# before assum- 
ing control of the bus. Waiting for ready provides a 
handshake to insure that the memory system is 
ready to accept a new cycle. If the bus is idle when 
BOFF# is asserted, the new master can start its 
cycle two clocks after issuing BOFF# . 


The external memory can view BOFF# in the same 
manner as BLAST#. Asserting BOFF# tells the ex- 
ternal memory system that the current cycle is the 
last cycle in a transfer. 


The bus remains in the high impedance state until 
BOFF# is negated. Upon negation, the Intel486 OX 
microprocessor restarts its bus cycle by driving out 
the address and status and asserting AOS#. The 
bus cycle then continues as usual. 


• 


Asserting BOFF# during a burst, BS8# or BS16# 
cycle will force the Intel486 OX microprocessor to 
ignore data returned for that cycle only. Data from 
previous cycles will still be valid. For example, if 
BOFF# is asserted on the third BRDY# of a burst, 
the Intel486 OX microprocessor assumes the data 
returned with the first and second BRDY#'s is cor- 
rect and restarts the burst beginning with the third 
item. The same rule applies to transfers broken into 
multiple cycle by BS8# or BS16#. 


Asserting BOFF# in the same clock as ADS# will 
cause the Intel486 OX microprocessor to float its 
bus in the next clock and leave ADS# floating low. 
Since ADS# is floating low, a peripheral may think 
that a new bus cycle has begun even though the 
cycle was aborted. 


There are two possible solutions to this problem. 
The first is to have all devices recognize this condi- 
tion and ignore ADS# until ready comes back. The 
second approach is to use a "two clock" backoff: in 
the first clock AHOLD is asserted, and in the second 
clock BOFF# 
is asserted. This guarantees that 
ADS# will not be floating low. This is only necessary 
in systems where BOFF# may be asserted in the 
same clock as ADS#. 


A bus state diagram is shown in Figure 7.30. A de- 
scription of the signals used in the diagram is given 
in Table 7.9. 
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Figure 7.30. Bus State Diagram 


Table 7.9. Bus State Description 


State 
Means 


Ti 
Bus is idle. Address and status signals may be driven to undefined values, or the bus may be 
floated to a high impedance state. 


T1 
First clock cycle of a bus cycle. Valid address and status are driven and ADS# is asserted. 


T2 
Second and subsequent clock cycles of a bus cycle. Data is driven if the cycle is a write, or data is 
expected if the cycle is a read. ROY# and BRDY# are sampled. 


T1b 
First clock cycle of a restarted bus cycle. Valid address and status are driven and ADS# is 
asserted. 


Tb 
Second and subsequent clock cycles of an aborted bus cycle. 


The Intel486 OX microprocessor provides two op- 
tions for reporting floating point errors. The simplest 
method is to raise interrupt 16 whenever an un- 
masked floating point error occurs. This option may 
be enabled by setting the NE bit in control register 0 
(CRO). 


The Intel486 OX microprocessor also provides the 
option of allowing external hardware to determine 
how floating point errors are reported. This option is 
necessary for compatibility with the error reporting 
scheme used in DOS based systems. The NE bit 
must be cleared in CROto enable user-defined error 
reporting. User-defined error reporting is the default 
condition because the NE bit is cleared on reset. 


Two pins, floating point error (FERR#) and ignore 
numeric error (IGNNE#), are provided to direct the 
actions of hardware if user-defined error reporting is 
used. The Intel486 OX microprocessor asserts the 
FERR# output to indicate that a floating point error 
has occurred. FERR# corresponds to the ERROR# 
pin on the Intel387 math coprocessor. However, 
there is a difference in the behavior of the two. 


In some cases FERR# is asserted when the next 
floating point instruction is encountered and in other 
cases it is asserted before the next floating point 
instruction is encountered depending upon the exe- 
cution state of the instruction causing the exception. 


The following class of floating point exceptions drive 
FERR# at the time the exception occurs (Le.,before 
encountering the next floating point instruction). 
1. The stack fault, invalid operation, and denormal 


exceptions on all transcendental instructions, in- 
teger arithmetic instructions, FSQRT, FSEALE, 
FPREM(1), FXTRACT, FBLO, and FBSTP. 


2. Any exceptions on store instructions (including 


integer store instructions). 


The following class of floating point exceptions drive 
FERR# only after encountering the next floating 
point instruction. 
1. Exceptions other than on all transcendental in- 


structions, 
integer 
arithmetic 
instructions, 


FSQRT, FSCALE, FPREM(1), FXTRACT, FBLO, 
and FBSTP. 


2. Any exception on all basic arithmetic, load, com- 


pare, and control instructions (Le., all other in- 
structions). 


For both sets of exceptions above, the lotel387 
Math Coprocessor asserts ERROR# when the error 
occurs and does not wait for the next floating point 
instruction to be encountered. 


IGNNE# is an input to the Intel486 OX microproces- 
sor. 


When the NE bit in CROis cleared, and IGNNE# is 
asserted, the Intel486 OX microprocessor will ignore 
a user floating point error and continue executing 
floating point instructions. When IGNNE# is negat- 
ed, the Intel486 OX microprocessor will freeze on 
floating point instructions which get errors (except 
for 
the 
control 
instructions 
FNCLEX, 
FNINIT, 


FNSAVE, FNSTENV, FNSTCW, FNSTSW, FNSTSW 
AX, FNENI, FNOISI and FNSETPM). IGNNE# may 
be asynchronous to the Intel486 OX clock. 


In systems with user-defined error reporting, the 
FERR# pin is connected to the interrupt controller. 
~hen 
an unmasked floating point error occurs, an 
Interrupt is raised. If IGNNE# is high at the time of 
this interrupt, the Intel486 OX microprocessor will 
freeze (disallowing execution of a subsequent float- 
ing point instruction) untit the interrupt handler is in- 
voked. By driving the IGNNE# pin low (when clear- 
ing the interrupt request), the interrupt handler can 
allow execution of a floating point instruction, within 
the interrupt handler, before the error condition is 
cleared 
(by 
FNCLEX, 
FNINIT, 
FNSAVE 
or 


FNSTENV). If execution of a non-control floating 
point instruction, within the floating point interrupt 
handler, is not needed, the IGNNE# pin can be tied 
HIGH. 
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Testing the Intel486 DX2 microprocessor can be di- 
vided into three categories: Built-In Self Test (BIST), 
Boundary Scan, and external testing. BIST performs 
basic device testing on the Intel486 DX2 CPU, in- 
cluding the non-random logic, control ROM (CRaM), 
translation 
lookaside 
buffer 
(TLB), and on-chip 
cache memory. Boundary Scan provides additional 
test hooks that conform to the IEEE Standard Test 
Access Port and Boundary Scan Architecture (IEEE 
Std.1149.1). The Intel486 DX2 microprocessor also 
has a test mode in which all of its outputs are 3-stat- 
ed. Additional testing can be performed by using the 
test registers within the Intel486 DX2 CPU. 


8.1 
Built-In Self Test (BIST) 


The BIST is initiated by holding the AHOLD (address 
hold) pin HIGH for 2 CLKs before and 2 CLKs after 
RESET going from HIGH to LOW as shown in Figure 
6.4. The BIST takes approximately 600 thousand 
clocks, or approximately 24 milliseconds with a 
50 MHz Intel486 DX2 microprocessor. No bus cy- 
cles will be run by the Intel486 DX2 microprocessor 
until the BIST is concluded. Note that for the In- 
tel486 DX2 microprocessor the RESET must be ac- 
tive for 15 clocks with or without BIST being enabled 
for warm resets. 


The results of BIST is stored in the EAX register. 
The Intel486 DX2 microprocessor has successfully 
passed the BIST if the contents of the EAX register 
are zero. If the results in EAX are not zero then the 
BIST has detected a flaw in the microprocessor. The 
microprocessor performs reset and begins normal 
operation at the completion of the BIST. 


The non-random logic, control ROM, on-chip cache 
and translation lookaside buffer (TLB) are tested 
during the BIST. 


The cache portion of the BIST verifies that the 
cache is functional and that it is possible to read and 
write to the cache. The BIST manipulates test regis- 
ters TR3, TR4 and TR5 while testing the cache. 
These test registers are described in Section 8.2. 


The cache testing algorithm writes a value to each 
cache entry, reads the value back, and checks that 
the correct value was read back. The algorithm may 
be repeated more than once for each of the 512 
cache entries using different constants. 


The TLB portion of the BIST verifies that the TLB is 
functional and that it is possible to read and write to 
the TLB. The BIST manipulates test registers TR6 
and TR7 while testing the TLB. TR6 and TR7 are 
described in Section 8.3. 


8.2 
On-Chip Cache Testing 


The on-chip cache testability hooks are designed to 
be accessible during the BIST and for assembly lan- 
guage testing of the cache. 


The Intel486 DX2 microprocessor contains a cache 
fill buffer and a cache read buffer. For testability 
writes, data must be written to the cache fill buffer 
before it can be written to a location in the cache. 
Data must be read from a cache location into the 
cache read buffer before the microprocessor can 
access the data. The cache fill and cache read buff- 
er are both 128 bits wide. 


8.2.1 
CACHE 
TESTING 
REGISTERS 
TR3, TR4 
AND TR5 


Figure 8.1 shows the three cache testing registers: 
the Cache Data Test Register (TR3), the Cache 
Status Test Register (TR4) and the Cache Control 
Test Register (TR5). External access to these regis- 
ters is provided through MOV reg,TREG and MOV 
TREG, reg instructions. 


The cache fill buffer and the cache read buffer can 
only be accessed through TR3. Data to be written to 
the cache fill buffer must first be written to TR3. Data 
read from the cache read buffer must be loaded into 
TR3. 


TR3 is 32 bits wide while the cache fill and read 
buffers are 128 bits wide. 32 bits of data must be 
written to TR3 four times to fill the cache fill buffer. 
32 bits of data must be read from TR3 four times to 
empty the cache read buffer. The entry select bits in 
TR5 determine which 32 bits of data TR3 will access 
in the buffers. 


Cache 
Status Test Register: 
TR4 


TR4 handles tag, LRU and valid bit information dur- 
ing cache tests. TR4 must be loaded with a tag and 
a valid bit before a write to the cache. After a read 
from a cache entry, TR4 contains the tag and valid 
bit from that entry, and the LRU bits and four valid 
bits from the accessed set. 


Cache 
Control 
Test Register: 
TR5 


TR5 specifies which testability operation will be per- 
formed and the' set and entry within the set which 
will be accessed. 


• 


The seven bit set select field determines which of 
the 128 sets will be accessed. 


The functionality of the two entry select bits depend 
on the state of the control bits. When the fill or read 
buffers are being accessed, the entry select bits 
point to the 32-bit location in the buffer being ac- 
cessed. When a cache location is specified, the en- 
try select bits point to one of the four entries in a set. 
Refer to Table 8.1. 


Five testability functions can be performed on the 
cache. The two control bits in TR5 specify the oper- 
ation to be executed. The five operations are: 


1. Write cache fill.buffer 
2. Perform a cache testability write 
3. Perform a cache testability read 
4. Read the cache read buffer 
5. Perform a cache flush 
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Table 8.1 shows the encoding of the two control bits 
in TR5 for the cache testability functions. Table 8.1 
also shows the functionality of the entry and set se- 
lect bits for each control operation. 


The cache tests attempt to use as much of the nor- 
mal operating circuitry as possible. Therefore when 
cache tests are being performed. the cache must be 
disabled (the CD and NW bits in control register 
must be set to 1 to disable the cache. See Section 
5). 


A testability write to the cache is a two step process. 
First the cache fill buffer must be loaded with 128 
bits of data and TR4 loaded with the tag and valid 
bit. Next the contents of the fill buffer are written to a 
cache location. Sample assembly code to do a write 
is given in Figure 8.2. 
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Table 8.1. Cache Control 
Bit Encoding 
and Effect of 
Control 
Bits on Entry Select and Set Select Functionality 


Control 
Bits 
Operation 
Entry Select Bits 
Set Select Bits 
Bit 1 
Bit 0 
Function 


0 
0 
Enable 
{ 
Fill Buffer Write 
Select 32-bit location in fill/read 
Read Buffer Read 
buffer 
- 


0 
1 
Perform Cache Write 
Select an entry in set. 
Select a set to write to 


1 
0 
Perform Cache Read 
Select an entry in set. 
Select a set to read from 


1 
1 
Perform Flush Cache 
- 
- 


Sample 
Assembly 
Code 


An example 
assembly 
language 
sequence 
to perform 
a cache write is: 


eax. 
ebx. 
ecx. 
edx contain 
the cache 
line 
to write 
edi contains 
the tag 
information 
to load 
eRO already 
says to enable 
reads/write 
to TR5 


fill the 
cache 
buffer 
mov 
esi,O 
mov tr5,esi 
mov tr3,eax 
mov 
esi,4 
mov tr5,esi 
mov tr3,ebx 
mov 
esi,8 
mov tr5,esi 
mov tr3,ecx 
mov 
esi,Och 
mov tr5,esi 
mov tr3,edx 


set up command 
load 
to TR5 
load data 
into cache 
fill buffer 


mov 
esi,l 
mov tr5,esi 


mov 
esi,2 
mov tr5,esi 


read the data 
from the read buffer 


mov 
esi,O 
mov tr5,esi 
mov 
eax,tr3 
mov 
esi,4 
mov tr5,esi 
mov 
ebx,tr3 
mov 
esi,8 
mov tr5,esi 
mov 
ecx,tr3 
mov 
esi,Och 
mov tr5,esi 
mov 
edx,tr3 


read the 
status 
from TR4 


mov 
edi,tr4 
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Loading the fill buffer is accomplished by first writing 
to the entry select bits in TR5 and setting the control 
bits in TR5 to 00. The entry select bits identify one of 
four 32-bit locations in the cache fill buffer to put 32 
bits of data. Following the write to TR5, TR3 is writ- 
ten with 32 bits of data which are immediately 
placed in the cache fill buffer. Writing to TR3 initiates 
the write to the cache fill buffer. The cache fill buffer 
is loaded with 128 bits of data by writing to TR5 and 
TR3 four times using a different entry select location 
each time. 


TR4 must be loaded with the 21-bit tag and valid bit 
(bit 10 in TR4) before the contents of the fill buffer 
are written to a cache location. 


The contents of the cache fill buffer are written to a 
cache location by writing TR5 with a control field of 
01 along with the set select and entry select fields. 
The set select and entry select field indicate the lo- 
cation in the cache to be written. The normal cache 
LRU update circuitry updates the internal LRU bits 
for the selected set. 


Note that a cache testability write can only be done 
when the cache is disabled for replaces (the CD bit 
is control register 0 is reset to 1). Also note that care 
must be taken when directly writing to entries in the 
cache. If the entry is set to overlap an area of mem- 
ory that is being used in external memory, that 
cache entry could inadvertently be used instead of 
the external memory. Of course, this is exactly the 
type of operation that one would desire if the cache 
were to be used as a high speed RAM. 


A cache testability read is a two step process. First 
the contents of the cache location are read into the 
cache read buffer. Next the data is examined by 
reading it out of the read buffer. Sample assembly 
code to do a testability read is given in Figure 8.2. 


Reading the contents of a cache location into the 
cache read buffer is initiated by writing TR5 with the 
control bits set to 10 and the desired seven-bit set 
select and two-bit entry select. In response to the 
write to TR5, TR4 is loaded with the 21-bit tag field 
and the single valid bit from the cache entry read. 
TR4 is also loaded with the three LRU bits and four 
valid bits correspondirig to the cache set that was 
accessed. The cache read buffer is filled with the 
128-bit value which was found in the data array at 
the specified location. 


The contents of the read buffer are examined by 
performing four reads of TR3. Before reading TR3 
the entry select bits in TR5 must loaded to indicate 
which of the four 32-bit words in the read buffer to 
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transfer into TR3 and the control bits in TR5 must be 
loaded with 00. The register read of TR3 will initiate 
the transfer of the 32-bit value from the read buffer 
to the specified general purpose register. 


Note that it is very important that the entire 128-bit 
quantity from the read buffer and also the informa- 
tion from TR4 be read before any memory refer- 
ences are allowed to occur. If memory operations 
are allowed to happen, the contents of the read buff- 
er will be corrupted. This is because the testability 
operations use hardware that is used in normal 
memory accesses for the Intel486 DX2 microproc- 
essor whether the cache is enabled or not. 


The control bits in TR5 must be written with 11 to 
flush the cache. None of the other bits in TR5 have 
any meaning when 11 is written to the control bits. 
Flushing the cache will reset the LRU bits and the 
valid bits to 0, but will not change the cache tag or 
data arrays. 


When the cache is flushed by writing to TR5 the 
special bus cycle indicating a cache flush to the ex- 
ternal system is not run (see Section 7.2.11, Special 
Bus Cycles). The cache should be flushed with the 
instruction INVD (Invalidate Data Cache) instruction 
or the WBINVD (Write-back and Invalidate Data 
Cache) instruction. 


8.3 
Translation 
Lookaside 
Buffer 
(TLB) Testing 


The Intel486 DX2 microprocessor TLB testability 
hooks are similar to those in the Intel386 microproc- 
essor. The testability hooks have been enhanced to 
provide added test features and to include new fea- 
tures in the Intel486 DX2 microprocessor. The TLB 
testability hooks are designed to be accessible dur- 
ing the BIST and for assembly language testing of 
the TLB. 


8.3.1 
TRANSLATION 
LOOKASIDE 
BUFFER 
ORGANIZATION 


The Intel486 DX2 microprocessors TLB is 4-way set 
associative and has space for 32 entries. The TLB is 
logically split into three blocks shown in Figure 8.3. 


The data block is physically split into four arrays, 
each with space for eight entries. An entry in the 
data block is 22 bits wide containing a 20-bit physi- 
cal address and two bits for the page attributes. The 
page attributes are the PCD (page cache disable) bit 
and the PWT (page write-through) bit. Refer to Sec- 
tion 4.5.4 for a discussion of the PCD and PWT bits. 
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The tag block is also split into four arrays, one for 
each of the data arrays. A tag entry is 21 bits wide 
containing a 17-bit linear address and four protec- 
tion bits. The protection bits are valid (V), user/su- 
pervisor (U/S), read/write (R/W) and dirty (D). 


TLB is the same as used by the on-chip cache. For a 
description of this algorithm refer to Section 5.5. 


The third block contains eight three bit quantities 
used in the pseudo least recently used (LRU) re- 
placement algorithm. These bits are called the LRU 
bits. The LRU replacement algorithm used in the 


The two TLB test registers are shown in Figure 8.4. 
TR6 is the command test register and TR7 is the 
data test register. External access to these registers 
is provided through 
MOV reg,TREG and 
MOV 
TREG,reg instructions. 
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t 
t 
ReplacementPointerSelect (Writes) 
ReplacementPointer(Writes) 
HitIndication(Lookup) 
HitLocation(Lookup) 


TR6 Protection 
Bit 
TR6 Protection 
Bit # 
Meaning 
on 
Meaning 
on 
(B) 
(B#) 
TLB Write Operation 
TLB Lookup 
Operation 


0 
0 
Undefined 
Miss any TLB TAG Bit B 
0 
1 
Write 0 to TLB TAG Bit B 
Match TLB TAG Bit B if 0 
1 
0 
Write 1 to TLB TAG Bit B 
Match TLB TAG Bit B if 1 


1 
1 
Undefined 
Match any TLB TAG Bit B 


TR6 contains the tag information and control infor- 
mation used in a TLB test. Loading TR6 with tag and 
control information initiates a TLB write or lookup 
test. 


TR6 contains three bit fields, a 20-bit linear address 
(bits 12-31), seven bits for the TLB tag protection 
bits (bits 5-11) and one bit (bit 0) to define the type 
of operation to be performed on the TLB. 


The 20-bit linear address forms the tag information 
used in the TLB access. The lower three bits of the 
linear address select which of the eight sets are ac- 
cessed. The upper 17 bits of the linear address form 
the tag stored in the tag array. 


The seven TLB tag protection bits are described be- 
low. 


V: 
0,0#: 
U,U#: 


The valid bit for this TLB entry 
The dirty bit for/from the TLB entry 
The user/supervisor bit for/from 
the TLB 
entry 
The read/write bit for/from the TLB entry 


Two bits are used to represent the 0, U/S and R/W 
bits in the TLB tag to permit the option of a forced 
miss or hit during a TLB lookup operation. The 
forced miss or hit will occur regardless of the state 
of the actual bit in the TLB. The meaning of these 
pairs of bits is given in Table 8.2. 


The operation bit in TR6 determines if the TLB test 
operation will be a write or a lookup. The function of 
the operation bit is given in Table 8.3. 


TR6 
TLB Operation 
BltO 
to Be Performed 


0 
TLB Write 


1 
TLB Lookup 


TR7 contains the information stored or read from the 
data block during a TLB test operation. Before a TLB 


test write, TR7 contains the physical address and 
the page attribute bits to be stored in the entry. After 
a TLB test lookup hit, TR7 contains the physical ad- 
dress, page attributes, LRU bits and entry location 
from the access. 


TR7 contains a 20-bit physical address (bits 12-31), 
two bits for PCD (bit 11) and PWT (bit 10) and three 
bits for the LRU bits (bits 7-9). The LRU bits in TR7 
are only used during a TLB lookup test. The func- 
tionality of TR7 bit 4 differs for TLB writes and look- 
ups. The encoding of bit 4 is defined in Tables 8.4 
and 8.5. Finally TR7 contains two bits (bits 2-3) to 
specify a TLB replacement pointer or the location of 
a TLB hit. 


TR7 
Replacement 
Pointer 


Bit 4 
Used on TLB Write 


0 
Pseudo-LRU Replacement Pointer 


1 
Data Test Register Bits 3:2 


TR7 
Meaning 
after TLB 
Bit 4 
Lookup Operation 


0 
TLB Lookup Resulted in a Miss 
1 
TLB Lookup Resulted in a Hit 


A replacement pointer is used during a TLB write. 
The pointer indicates which of the four entries in an 
accessed ,set is to be written. The replacement 
pointer can be specified to be the internal LRU bits 
or bits 2-3 in TR7. The source of the replacement 
pointer is specified by TR7 bit 4. The encoding of bit 
4 during a write is given by Table 8.4. 


Note that both testability writes and lookups affect 
the state of the internal LRU bits regardless of the 
replacement pointer used. All TLB write operations 
(testability or normal operation) cause the written 
entry to become the most recently used. For exam- 
ple, during a testability write with the replacement 
pointer specified by TR7 bits 2-3, the indicated en- 
try is written and that entry becomes the most re- 
cently used as specified by the internal LRU bits. 


There are two TLB testing operations: write entries 
into the TLB, and perform TLB lookups. One major 
enhancement over TLB testing in the Intel386 micro- 
processor is that paging need not be disabled while 
executing testability writes or lookups. 


Note that any time one TLB set contains the same 
linear address in more than one of its entries, look- 
ing up that linear address will give unpredictable re- 
sults. Therefore a single linear address should not 
be written to one TLB set more than once. 


To perform a TLB write TR7 must be loaded fol- 
lowed by a TR6 load. The register operations must 
be performed in this order since the TLB operation is 
triggered by the write to TR6. 


TR7 is loaded with a 20-bit physical address and 
values for PCD and PWT to be written to the data 
portion of the TLB. In addition, bit 4 of TR7 must be 
loaded to indicate whether to use TR7 bits 3-2 or the 
internal LRU bits as the replacement pointer on the 
TLB write operation. Note that the LRU bits in TR7 
are not used in a write test. 


TR6 must be written to initiate the TLB write opera- 
tion. Bit 0 in TR6 must be reset to zero to indicate a 
TLB write. The 20-bit linear address and the seven 
page protection bits must also be written in TR6 to 
specify the tag portion of the TLB entry. Note that 
the three least significant bits of the linear address 
specify which of the eight sets in the data block will 
be loaded with the physical address data. Thus only 
17 of the linear address bits are stored in the tag 
array. 


To perform a TLB lookup it is only necessary to write 
the proper tags and control information into TR6. Bit 
o in TR6 must be set to 1 to indicate a TLB lookup. 
TR6 must be loaded with a 20-bit linear address and 
the seven protection bits. To force misses and 
matches of the individual protection bits on TLB 
lookups, set the seven protection bits as specified in 
Table 8.2. 


A TLB lookup operation is initiated by the write to 
TR6. TR7 will indicate the result of the lookup opera- 
tion following the write to TR6. The hit/miss indica- 
tion can be found in TR7 bit 4 (see Table 8.5). 


TR7 will contain the following information if bit 4 indi- 
cated that the lookup test resulted in a hit. Bits 2-3 
will indicate in which set the match occurred. The 22 
most significant bits in TR7 will contain the physical 
address and page attributes contained in the entry. 


Bits 9-7 
will contain the LRU bits associated with 
the accessed set. The state of the LRU bits is previ- 
ous to their being updated for the current lookup. 


If bit 4 in TR7 indicated that the lookup test resulted 
in a miss the remaining bits in TR7 are undefined. 


Again it should be noted that a TLB testability lookup 
operation affects the state of the LRU bits. The LRU 
bits will be updated if a hit occurred. The entry which 
was hit will become the most recently used. 


The Intel486 DX2 microprocessor provides the abili- 
ty to float all its outputs and bidirectional pins. This 
includes all pins floated during bus hold as well as 
pins which are never floated in normal operation of 
the chip (HLDA, BREQ, FERR# 
and PCHK#). 


When the Intel486 DX2 microprocessor is in the 3- 
state output test mode external testing can be used 
to test board connections. 


The 
3-state 
test 
mode 
is 
invoked 
by 
driving 
FLUSH# low for 2 clocks before and 2 clocks after 
RESET going low. The outputs are guaranteed to 3- 
state no later than 10 clocks after RESET goes low 
(see Figure 6.4). The Intel486 DX2 microprocessor 
remains in the 3-state test mode until the next 
RESET. 


8.5 
Intel486™ 
DX2 Microprocessor 
Boundary Scan (JTAG) 


The Intel486 DX2 microprocessor provides testabili- 
ty features compatible with the IEEE Standard Test 
Access Port and Boundary Scan Architecture (IEEE 
Std.1149.1). The test logic provided allows for test- 
ing to insure that components function correctly, that 
interconnections between various components are 
correct, and that various components interact cor- 
rectly on the printed circuit board. 


The boundary scan test logic consists of a boundary 
scan register and support logic that are accessed 
through a test access port (TAP). The TAP provides 
a simple serial interface that makes it possible to 
test all signal traces with only a few probes. 


The TAP can be controlled via a bus master. The 
bus master can be either automatic test equipment 
or a component (PLD) that interfaces to the four-pin 
test bus. 


The boundary scan test logic contains the following 
elements: 
- 
Test access port (TAP), consisting of input pins 
TMS, TCK, and TOI; and output pin TOO. 
- 
TAP controller, which interprets the inputs on the 
test mode select (TMS) line and performs the 
corresponding operation. The operations per- 
formed by the TAP include controlling the in- 
struction and data registers within the compo- 
nent. 


- 
Instruction register (IR), which accepts instruc- 
tion codes shifted into the test logic on the test 
data input (TOI) pin. The instruction codes are 
used to select the specific test operation to be 
performed or the test data register to be ac- 
cessed. 


- 
Test data registers: The Intel486 OX2 microproc- 
essor contains three test data registers: Bypass 
register 
(BPR), Device 
Identification 
register 
(DID), and Boundary Scan register (BSR). 


The instruction and test data registers are separate 
shift-register paths connected in parallel and have a 
common serial data input and a common serial data 
output connected to the TAP signals, TOI and TOO, 
respectively. 


The Intel486 OX2 CPU contains the two required 
test data registers; bypass register and boundary 
scan register. In addition, they also have a device 
identification register. 


Each test data register is serially connected to TOI 
and TOO, with TOI connected to the most significant 
bit and TOO connected to the least significant bit of 
the test data register. Data is shifted one stage (bit 
position within the register) on each rising edge of 
the test clock (TCK). 


In addition the Intel486 OX2 CPU contains a runbist 
register to support the RUNBIST boundary scan in- 
struction. 


8.5.2.1 
Bypass 
Register 


The Bypass Register is a one-bit shift register that 
provides the minimal length path between TOI and 
TOO. This path can be selected when no test opera- 
tion is being performed by the component to allow 
rapid movement of test data to and from other com- 
ponents on the board. While the bypass register is 
selected, data is transferred from TOI to TOO with- 
out inversion. 


8.5.2.2 
Boundary 
Scan Register 


The Boundary Scan Register is a single shift register 
path containing the boundary scan cells that are 
connected to all input and output pins of the Intel486 
OX2 CPU. Figure 8.5 shows the logical structure of 
the boundary scan register. While output cells deter- 
mine the value of the signal driven on the corre- 
sponding pin, input cells only capture data; they do 
not affect the normal operation of the device. Data is 
transferred 
without 
inversion from TOI to 
TOO 
through the boundary scan register during scanning. 
The boundary scan register can be operated by the 
EXTEST and SAMPLE instructions. The boundary 
scan register order is described in Section 8.5.5. 
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8.5.2.3 Device Identification 
Register 


The Device Identification Register contains the man- 
ufacturer's identification code, part number code, 
and version code in the format shown in Figure 8.6. 
Table 8.6 lists the codes corresponding to the In- 
tel486 DX2 CPU. 


8.5.2.4 Runblst Register 


The Runbist Register is a one bit register used to 
report the results of the Intel486 DX2 CPU BIST 
when it is initiated by the RUNBIST instruction. This 
register is loaded with a "1" prior to invoking the 
BIST and is loaded with "0" upon successful com- 
pletion. 


The Instruction Register (IR) allows instructions to 
be serially shifted into the device. The instruction 
selects the particular test to be performed, the test 
data register to be accessed, or both. The instruc- 


tion register is four (4) bits wide. The most significant 
bit is connected to TDI and the least significant bit is 
connected to TOO. There are no parity bits associat· 
ed with the Instruction register. Upon entering the 
Capture-IR TAP controller state, the Instruction reg- 
ister is loaded with the default instruction "0001", 
SAMPLE/PRELOAD. Instructions are shifted into 
the instruction register on the rising edge of TCK 
while the TAP controller is in the Shift·IR state. 


8.5.3.1 Intel486 DX2 CPU Boundary SCan 
Instruction 
Set 


The Intel486 DX2 CPU supports all three mandatory 
boundary scan instructions (BYPASS, SAMPLE/ 
PRELOAD, and EXTEST) along with two optional in- 
structions (IDCODE and RUNBIST). Table 8.7 lists 
the Intel486 DX2 CPU boundary scan instruction 
codes. The instructions listed as PRIVATE cause 
TOO to become enabled in the Shift-DR state and 
cause "0" to be shifted out of TOO on the rising 
edge of TCK. Execution of the PRIVATE instructions 
will not cause hazardous operation of the Intel486 
DX2 CPU. 
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Figure 8.6. Format of Device Identification 
Register 


Table 8.6. Device Identification 
Codes 


Component Code 
Version Code 
Part Number Code 
Manufacturer Identity 


Intel486 DX2 CPU (Ax) 
OOh 
0432h 
09h 


Intel486 DX2 CPU (Bx) 
OOh 
0433h 
09h 


Instruction 
Code 
Instruction 
Name 


0000 
EXTEST 


0001 
SAMPLE 


0010 
IDCODE 


0011 
PRIVATE 


0100 
PRIVATE 


0101 
PRIVATE 


0110 
PRIVATE 


0111 
PRIVATE 


1000 
RUNBIST 


1001 
PRIVATE 


1010 
PRIVATE 


1011 
PRIVATE 


1100 
PRIVATE 


1101 
PRIVATE 


1110 
PRIVATE 


1111 
BYPASS 


EXTEST 
The instruction code is "0000". The EX- 
TEST instruction allows testing of cir- 
cuitry external to the component pack- 
age, typically board interconnects. It 
does so by driving the values loaded 
into the Intel486 DX2 CPU's boundary 
scan register out on the output pins cor- 
responding to each boundary scan cell 
and capturing the values on Intel486 
DX2 CPU input pins to be loaded into 
their corresponding boundary scan reg- 
ister locations. I/O pins are selected as 
input or output, depending on the value 
loaded into their control setting loca- 
tions in the boundary scan register. Val- 
ues shifted into input latches in the 
boundary scan register are never used 
by the internal logic of the Intel486 DX2 
CPU. 


NOTE: 
After using the EXTEST instruction, the 
Intel486 DX2 CPU must be reset before 
normal (non-boundary scan) use. 


SAMPLE/ 
The instruction code is "0001". 
The 
PRELOAD SAMPLE/PRELOAD has two functions 
that it performs. When the TAP control- 
ler is in the Capture-DR state, the SAM- 
PLE/PRELOAD 
instruction 
allows 
a 
"snap-shot" of the normal operation of 


the component without interfering with 
that normal operation. The instruction 
causes boundary scan register cells as- 
sociated with outputs to sample the val- 
ue being driven by the Intel486 DX2 
CPU. It causes the cells associated with 
inputs to sample the value being driven 
into the Intel486 DX2 CPU. On both out- 
puts and inputs the sampling occurs on 
the rising edge of TCK. When the TAP 
controller is in the Update-DR state, the 
SAMPLE/PRELOAD 
instruction 
pre- 
loads data to the device pins to be driv- 
en 
to 
the 
board 
by 
executing 
the 
EXTEST instruction. Data is preloaded 
to the pins from the boundary scan reg- 
ister on the falling edge of TCK. 
IDCODE 
The instruction code is "0010". The ID- 
CODE instruction selects the device 
identification register to be connected 
to TDI and TDO, allowing the device 
identification code to be shifted out of 
the device on TDO. Note that the de- 
vice identification register is not altered 
by data being shifted in on TD!. 


BYPASS 
The instruction code is "1111". 
The 
BYPASS instruction selects the bypass 
register to be connected to TDI or TDO, 
effectively bypassing the test logic on 
the Intel486 DX2 microprocessor by re- 
ducing the shift length of the device to 
one bit. Note than an open circuit fault 
in the board level test data path will 
cause the bypass register to be select- 
ed following an instruction scan cycle 
due to the pull-up resistor on the TDI 
input. This has been done to prevent 
any 
unwanted 
interference 
with 
the 
proper operation of the system logic. 
RUNBIST 
The instruction code is "1000". 
The 
RUNBIST instruction selects the one (1) 
bit runbist register, loads a value of "1" 
into the runbist register, and connects it 
to TDO. It also initiates the built-in self 
test (BIST) feature of the Intel486 DX2 
CPU, which is able to detect approxi- 
mately 60% of the stuck-at faults on the 
Intel486 DX2 CPU. The Intel486 DX2 
CPU AC/DC Specifications for Vcc and 
CLK must be met and reset must have 
been asserted at least once prior to ex- 
ecuting the RUNBIST boundary scan in- 
struction. After loading the RUNBIST in- 
struction code in the instruction register, 
the TAP controller must be placed in 
the Run-Test/Idle state. BIST begins on 
the first rising edge of TCK after enter- 
ing the Run-Test/Idle state. The TAP 


controller must remain in the Run-Test! 
Idle state until BIST is completed. It re- 
quires 1.2 million clock (ClK) cycles to 
complete BIST and report the result to 
the runbist register. After completing 
the 1.2 million clock (ClK) cycles, the 
value in the runbist register should be 
shifted out on TOO during the Shift-DR 
state. A value of "0" being shifted out 
on TOO indicates BIST successfully 
completed. A value of "1" indicates a 
failure occurred. After executing the 
RUNBIST instruction, the Intel486 OX2 
CPU must be reset prior to normal oper- 
ation. 


8.5.4 
TEST 
ACCESS 
PORT (TAP) 
CONTROLLER 


The TAP controller is a synchronous, finite state ma- 
chine. It controls the sequence of operations of the 
test logic. The TAP controller changes state only in 
response to the following events: 
1. a rising edge of TCK 
2. power-up. 


The value of the test mode state (TMS) input signal 
at a rising edge of TCK controls the sequence of the 
state changes. The state diagram for the TAP con- 
troller is shown in Figure 8.7. Test designers must 
consider the operation of the state machine in order 
to design the correct sequence of values to drive on 
TMS. 


8.5.4.1 
Test-Loglc-Reset 
State 


In this state, the test logic is disabled so that normal 
operation of the device can continue unhindered. 
This is achieved by initializing the instruction register 
such that the IOCOOEinstruction is loaded. No mat- 
ter what the original state of the controller, the con- 
troller enters Test-logic-Reset state when the TMS 
input is held high (1) for at least five rising edges of 2 
TCK. The controller remains in this state while TMS 
is high. The TAP controller is also forced to enter 
this state at power-up. 


A controller state between scan operations. Once in 
this state, the controller remains in this state as long 


as TMS is held low. In devices supporting the 
RUNBIST instruction, the BIST is performed during 
this state and the result is reported in the runbist 
register. For instruction not causing functions to exe- 
cute during this state, no activity occurs in the test 
logic. The instruction register and all test data regis- 
ters retain their previous state. When TMS is high 
and a rising edge is applied to TCK, the controller 
moves to the Select-DR state. 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-DR state, and a scan se- 
quence for the selected test data register is initiated. 
If TMS is held high and a rising edge is applied to 
TCK, the controller moves to the Select-IR-Scan 
state. 


8.5.4.4 
Capture-DR 
State 


In this state, the boundary scan register captures 
input pin data if the current instruction is EXTEST or 
SAMPLE/PRELOAD. The other test data registers, 
which do not have parallel input, are not changed. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-DR state if TMS is high or the Shift-DR state if 
TMS is low. 


In this controller state, the test data register con- 
nected between TDI and TOO as a result of the cur- 
rent instruction, shifts data one stage toward its seri- 
al output on each rising edge of TCK. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-DR state if TMS is high or remains in the Shift- 
DR state if TMS is low. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-DR state, which termi- 


nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Pause-DR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


8.5.4.7 
Pause-Dr 
State 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the test data 
register in the serial path between TDI and TOO. An 
example of using this state could be to allow a tester 
to reload its pin memory from disk during application 
of a long test sequence. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-DR 
state. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-DR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Shift-DR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


8.5.4.9 
Update-DR 
State 


The boundary scan register is provided with a 
latched parallel output to prevent changes at the 
parallel output while data is shifted in response to 
the EXTEST and SAMPLE/PRELOAD instructions. 
When the TAP controller is in this state and the 
boundary scan register is selected, data is latched 
onto the parallel output of this register from the shift- 
register path on the falling edge of TCK. The data 
held at the latched parallel output does not change 
other than in this state. 


All shift-register stages in test data register selected 
by the current instruction retains its previous value 
during this state. The instruction does not change in 
this state. 
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This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-IR state, and a scan se- 
quence for the instruction register is initiated. If TMS 
is held high and a rising edge is applied to TCK, the 
controller moves to the Test-Logic-Reset state. 


8.5.4.11 
Capture-IR 
State 


In this controller state the shift register contained in 
the instruction register loads the fixed value "0001" 
on the rising edge of TCK. 


The test data register selected by the current in- 
struction retains it previous value during this state. 
The instruction does not change in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or the Shift-IR state if TMS 
is held low. 


In this state the shift register contained in the in- 
struction register is connected between TOI and 
TOO and shifts data one stage towards its serial out- 
put on each rising edge of TCK. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or remains in the Shift-IR 
state if TMS is held low. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 


controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Pause-IR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the instruction 
register. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-IR 
state. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Shift-IR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 
ter path on the falling edge of TCK. Once the new 
instruction has been latched, it becomes the current 
instruction. 


Test data registers selected by the current instruc- 
tion retain the previous value. 


• 


The boundary scan register contains a cell for each 
pin, as well as cells for control of 1/0 and 3-state 
pins. 


The following is the bit order of the Intel486 OX2 
CPU boundary scan register: (from left to right and 
top to bottom). 


TOI 
- 
WRCTL 
ABUSCTL 
BUSCTL 
MISCCTL 
AOS# 
BLAST# 
PLOCK# 
LOCK# 
PCHK# 
BROY# BOFF# BS16# BS8# ROY# KEN# 
HOLO AHOLO CLK HLOA WR# BREQ BEO# 
BE1# 
BE2# 
BE3# 
MIO# 
OC# 
PWT PCO 
EAOS# A20M# 
RESET FLUSH# INTR NMI 
UP# FERR# IGNNE# 031 030029028027 
026 025 024 OP3 023 022 021 020 019 018 
017 016 OP2 015 014 013 012 011 01009 
08 OP1 07 06 05 04 03 02 01 00 OPOA31 
A30 A29 A28 A27 A26 A25 A24 A23 A22 A21 
A20 A19 A18 A17 A16 A15 A14 A13 A12 A11 
A10 A9 A8 A7 A6 RESERVEO A5 A4 A3 
A2-TOO 
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"RESERVEO" corresponds to no connect "NC" sig- 
nals on the Intel486 OX2 CPU. 


All the *CTL cells are control cells that are used to 
select the direction of bidirectional pins or 3-state 
output pins. If "1" 
is loaded into the control cell 
(*CTL), the associated pines)are 3-stated or select- 
ed as input. The following lists the control cells and 
their corresponding pins. 
1. WRCTL control.sthe 031-0 
and OP3-0 pins. 
2. ABUSCTL controls the A31-A2 pins. 
3. BUSCTL 
controls 
the 
AOS#, 
BLAST# , 
PLOCK#, LOCK#, WR#, BEO#, BEH, 
BEU, 
BE3#, MIO#, OC#, PWT, and PCO pins. 
4. MISCCTL controls the PCHK#, HLOA, BREQ. 
and FERR# pins. 


The TAP controller is automatically initialized when a 
device is powered up. In addition, the TAP controller 
can be initialized by applying a high signal level on 
the TMS input for five TCK periods. 
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entity 
lnte1486TM_DX2 
is 
generic(PHYSlCAL_PlN_MAP: 
string 
.- "PGA_17x17"); 


bit; 
bit; 
bit; 
bit_vector 
(4 to 
31); 
-- Address 
bus 
(words) 
bit; 
bit; 
bit_vector(O 
to 
3); 
bit; 
bit; 
bit; 
bit; 
bit; 
bit; 
bit; 
bit_vector(O 
bit; 
bit_vect 
bit; 
bit; 
bit· 
bit 


(A20M 
ABUS2 
ABUs3 
ABUS 
ADS 
AHOLD 
BE 
BLAST 
BOFF 
BRDY 
BREQ 
BS8 
BS16 
CLK 
DBUS 
DC 
DP 
EADS 
FERR 
FLUSH 
HLDA 
HOLD 
lGNNE 
lNTR 
KEN 
LOCK 
MIa 
NC 
NC1 
NMl 


PCD 
: ~ 
PCHK 
PLOCK 
~ou 
PWT 
ut 


RD~ 
n 
RES 
in 
TC 
, TDl 
: 
T 
: out 
UP 
: in 
bit; 
VCC 
iinkage 
bit_vector(l 
VSS 
linkage 
bit_vector(l 
WR 
out 
bit); 


Scan 
Port 
inputs 
Scan 
Port 
output 


to 
24); 
to 28); 


"A20M 
"ABUS2 
"ABUS3 
"ABUS 


D15, 
" 
& 
Q14, 
" 
& 
R15, 
" 
& 
(S16, 
Q12, 
S7, 
Q10, 
Q7, 
S3, 
S17, 
" 
& 
A17, 
" 
& 
(K15, 
J16, 


S15, 
Q13, 
R13, 
Q11, 
s13, 
R12," 
& 
S5, 
R7, 
Q9, 
Q3, 
R5, 
Q4, 
Q8, 
Q6, 
R2, 
52, 
Sl, 
R1, 
P2, 
P3, 
"ADS 
"AHOLD 
"BE 


Q5," 
& 
Ql)," 
& 


• 


"BLAST 
R16, 
& 
'BOFF 
017, 
& 
"BRDY 
H1s, 
& 
'BREQ 
Q1s, 
& 
"BS8 
016, 
& 
"BS16 
C17, 
& 
"CLK 
C3, 
& 
"DBUS 
IP1, N2, Nl,. H2, M3, J2, 
L2, L3, F2, Dl, E3, 
& 
Cl, G3, 
02, K3, F3, J3, 
03, C2, B1, A1, 
B2, 
& 
A2, A4, A6, 
B6, C7, C6, C8, A8, C9, B8), " 
& 
"DC 
M1s, 
& 
"DP 
(N3, 
Fl, 
H3, 
AS) , 
& 
"EADS 
B17, 
& 
"FERR 
C14, 
& 
'FLUSH 
C1s, 
& 
'HLDA 
P1s, 
& 
'HOLD 
E1S, 
& 
"IGNNE 
A1S, 
& 
'INTR 
A16, 
& 


~ 


'KEN 
FlS, 
& 
'LOCK 
N1S, 
& 
"MIO 
N16, 
& 
"NC 
(R17, G1s, 
C10, 
C12, 
A10, A12, 
AD), 
" & 
'NC1 
S4, 
& 
'NMI 
B1S, 
& 
'PCD 
J17, 
& 
'PCHK 
Q17, 
& 
'PLOCK 
Q16, 
& 
'PWT 
L1s, 
& 
"ROY 
F16, 
& 
'RESET 
C16, 
& 
"TCK 
A3, 
& 
'TDI 
A14, 
& 
"TOO 
B16, 
& 
'TMS 
B14, 
& 
'UP 
Cll, 
& 
'VCC 
IR8, 
R9, 
K2, 
K16, 
& 
J1, 
H16, 
B9, 
B11, 
& 
R3, 
R6) , 
"VSS 
(S6, 
S 
Q2, 
P17, M1, 
& 
M17, 
G17, 
El, 
E17, 
& 
B3 
B 
"WR 
N17'* 
; 


attribute 
true; 
attribute 
truej 
attribute 
true; 
attribute 
(2S.0e6, BOTH) ; 


attribute 
entity 
ls 4; 


attribute 
Instruc 
"BYPASS 
"EXTEST 
"SAMPLE 
"IDCODE 
'RUNBIST 
'PRIVATE 


code of Inte1486TM_DX2: 
entity 
is 
& 
& 
& 
& 
(1000),' 
& 
(0011,0100,0101,0110,0111,1001,1010,1011,1100,1101,1110)'; 


attribute 
Instruction_Capture 
of Inte1486TM_DX2: 
entity 
is '0001"; 
-- there 
is no Instruction_Disable 
attribute 
for Inte1486TM_DX2 


attribute 
Instruction_Private 
of 
Inte1486TM_DX2: 
entity 
is "private"; 


attribute 
Instruction_Usage 
of Inte1486TM_DX2: 
entity 
is 
'RUNBIST 
(registers 
BIST; 
" 
& 
'result 
0;' 
& 
'clock CLK 
in Run_Test_Id1e;"& 
"length 
1200000)"; 


attribute 
Idcode_Register 
of 
"0000" 
& 
"0000010000010001" 
& 
"00000001001' 
& 
'1" ; 


Inte1486TM_DX2: 
entity 
is 
--version 
--new 
part 
number 
--manufacturers 
identity 
--requlred 
by the standard 
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--{***** 
•• ************************************************************} 
--{ 
The 
first 
cell 
is 
closest 
to 
TOO 
--(* •••••••• 
* •• **** •• ** •••• 
*****.*** 
•••• * ••••• 
* •• ***.*****************} 


attribute 
Boundary_Register 
of 
Inte148GTM_OX2: 
entity 
is 
, 0 
(RC2, 
ABlJS2, 
output3, 
X, 
102, 
1, 
ZI, ' 
& 


'I 
(BC_2, 
ABUS3, 
output) 
, 
X, 
102, 
1, 
ZI, ' 
& 
'2 
IBC_G, 
ABUS(4) , 
bidi r, 
X, 
102, 
I. 
ZI, ' 
& 
'3 
(BC_G, 
ABUS(5), 
bidir, 
X, 
102, 
1. 
Z). ' 
& 
'4 
(BC_l. 
NCI. 
input, 
X), ' 
& 
'S 
(BC_G, 
ABUSlG), 
bidi r, 
X, 
102. 
ZI. ' 


'G 
IBC_G, 
ABUS(7) , 
bidi 
r. 
x, 
102. 
ZI. ' 


'7 
(BCG, 
ABUS(8), 
bidir. 
X, 
102, 
Z), ' 
'8 
(BC_G, 
ABUS(9), 
bidir, 
X, 
102, 
Z), ' 


'9 
(BC_G. 
ABUSI 10) • 
bidi 
r, 
X, 
102, 
ZI, ' 


'10 
lBC_G. 
ABUS(11) • 
bidi 
r, 
X. 
102, 
Z), ' 


'11 
lBC_G. 
ABUS(12) • 
bidi 
r, 
X. 
102, 
Z), ' 


'12 
lBC_G. 
ABUS(1)) 
• 
bidi 
r, 
X. 
102. 
Z), ' 


'J) 
lBC_G. 
ABUS(14) • 
bidi 
r, 
X. 
102. 
Z) 
'14 
(Be_G, 
ABUS(15) • 
bidir. 
X. 
102. 
Z 


'IS 
(BC_G. 
ABUS(IGI. 
bidi 
r. 
X. 
102. 


'lG 
(BCG. 
ABUS(171 , 
bidir. 
X. 
102. 
, 17 
(BC_G, 
ABUSI181. 
bidir. 
X. 
102. 


'18 
(BC_G, 
ABUs1191. 
bidir. 
X. 
102. 


'19 
lBC_G. 
ABUS(201. 
bidir. 
X. 
10 
'20 
lBC_G. 
ABUSI211, 
bidir. 
X. 
'21 
(BC_G. 
ABus1221. 
bidir. 
X. 
'n 
(A(·_.G. 
IIB1I5(2) I, 
bid1r. 
X 


'23 
lBC_G. 
ABUS{241, 
bidir. 
, 24 
IBC_G. 
ABUSI251, 
bidir, 


'25 
(BC_G, 
ABUSI2G), 
bidi 
'2G 
IBC_G, 
ABUS(27), 
bi 


'27 
(BC_G, 
ABU5(28), 
b 
'28 
(BC_G, 
ABUs1291, 
& 


'29 
(BC_G, 
ABUSI30), 
& 
'30 
(BC_G, 
ABUS1311 
& 
'31 
(BC_G, 
OPIO) , 
),' 
& 
'32 
(BC_G, 
OBU5( 
Z). ' 
& 
'33 
(BC_G, 
OBUSIf 
Z). ' 
& 
')4 
(BC_G, 
0~121. 
I. 
Z). ' 
& 
')5 
(BC_G, 
OBd 131. 
I. 
Z). ' 
& 
'3G 
lBC_G, 
OBUS(4) • 
r. 
1 
I. 
Z). ' 
& 


'37 
lBC_G, 
[)RUSISI. 
r. 
103. 
1, 
Z). ' 
& 


'38 
(BC_G, 
OBUSIGI 
i 
103. 
1. 
Z), ' 
& 
'39 
lBC_G, 
OBUS(71. 
1d 
103. 
I. 
Z), ' 
& 
'40 
lBC_G. 
OP( 1 
bidi 
103. 
1. 
Z), ' 
& 
'41 
lBC_G. 
OB 
bldi 
r, 
X. 
103. 
1. 
Z), ' 
& 
'42 
lBC_G. 
b1d1 r, 
X. 
103. 
I. 
Z), ' 
& 
'43 
(BC_G. 
), 
bidi 
r, 
X, 
103. 
1. 
ZI. ' 
& 
'44 
lBC_G 
11) • 
bidi 
r, 
X, 
103, 
1, 
Z). ' 
& 
'45 
lBC 
S (12), 
bidi r, 
X. 
103. 
1, 
ZI. ' 
& 
'4G 
lBC_ 
BUS(1)) 
• 
bidi 
r, 
X. 
103, 
1. 
Z). ' 
& 
'47 
lBC_G. 
OBU5(14) • 
bidi 
r. 
X. 
103, 
1. 
Z), ' 
& 
'48 
IBC_G, 
OBUS(IS) • 
bidi r, 
X. 
103. 
1. 
ZI, ' 
& 
, 49 
lBC_G. 
OP(2) , 
bid1 r, 
x, 
103. 
I. 
ZI. ' 
& 
'50 
1BC_G, 
OBUSlIG) • 
bidi 
r. 
X, 
103, 
I. 
ZI. ' 
& 
-51 
(RC_G, 
OBUS(17) , 
bid1r. 
X, 
103, 
I, 
Z). ' 
& 
'52 
(BC_G, 
OBUS(18) , 
bidi 
r. 
X. 
103, 
I, 
ZI, ' 
& 
'53 
(BC_G. 
OBUS(19) , 
bidir. 
X. 
103, 
1, 
Z). ' 
& 
'S4 
(BC_G, 
OBUS(201, 
bidi 
r. 
X. 
103, 
1, 
ZI. ' 
& 
'SS 
(BC_G. 
OBUS(21). 
bidi 
r. 
X. 
103. 
1, 
Z). ' 
& 


'5G 
(BC_G, 
OBUs1221. 
bidi r, 
X, 
103. 
1, 
Z). ' 
& 
'S7 
lBC_G, 
DBUS(231, 
b1dir. 
X. 
103, 
1, 
Z). ' 
& 
'SB 
lBC_G. 
OP(3) • 
bid1r. 
X. 
103, 
I. 
Z). ' 
& 
'59 
lBC_G. 
OBUS(241. 
b1di r, 
X, 
103. 
I. 
Z). ' 
& 
'GO 
IBC_G, 
OBUS(25) • 
bidi 
r, 
X. 
103, 
I. 
Z), ' 
& 
'G1 
IBC_G, 
OBUS(2G) • 
bidi 
r, 
X. 
103. 
1. 
Z), ' 
& 
'G2 
IBC_G, 
OBUS(271. 
bidi 
r, 
X. 
103. 
I. 
Z), ' 
& 
'G3 
(BC_G. 
OBUS(28) , 
b1di r, 
X. 
103. 
1, 
Z), ' 
& 
'G4 
(BC_G. 
OBUS(29) , 
bidi 
r, 
X. 
103. 
I. 
Z), ' 
& 
'G5 
IBC_G, 
OBUS(30) • 
bidi 
r, 
x, 
i03. 
I. 
Z), ' 
& 
'GG 
(BC_G, 
OBUS(31) , 
bidi 
r, 
X, 
103, 
I, 
Z), ' 
& 
'G7 
1BC_I. 
IGNNE, 
input, 
X), ' 
& 


'G8 
(BC_2, 
FERR, 
output), 
X, 
100. 
I. 
Z). ' 
& 
'G9 
(BC_I. 
UP, 
input, 
X), ' 
& 
'70 
(BC_I. 
NMI, 
input, 
X). ' 
& 
'71 
(BCI. 
INTR, 
input, 
X). ' 
& 


"72 
"73 
"74 
"75 
"76 
"77 
""lO 
"79 
"80 
"81 
"82 
"83 
"84 
"85 
"86 
"87 
"88 
"89 
"90 
"91 
"92 
"93 
"94 
"95 
"96 
"97 
"98 
"99 
"100 
"101 
"102 


input, 
Xl," 
input, 
Xl," 
input, 
Xl," 
input, 
X)," 
output3, 
X, 
output3, 
X, 
output3, 
X, 
output3, 
X, 
output3, 
X, 
output3, 
output3, 
output 
, 
outp 
ou 
o 


& 


& 
& 


& 
100, 
l. 


101, 
1, 
101, 
I, 
101, 
I, 
X, 
101, 
1, 
1) , " 
& 
1) , " 
& 
1) , " 
& 
1) "; 


Zl, 
" 
Z), 
" 
Z) , " 
Z) , " 
Z), 
" 
DISMISC 
DISBUS 
DISABUS 
DISWR 


The Intel486 microprocessor family provides several 
features which simplify the debugging process. The 
three categories of on-chip debugging aids are: 
1) the code execution breakpoint opcode (OCCH), 
2) the single-step capability provided by the TF bit 
in the flag register, and 
3) the code and data breakpoint capability provided 
by the Debug Registers DRO-3, DR6, and DR? 


A single-byte-opcode breakpoint instruction is avail- 
able for use by software debuggers. The breakpoint 
opcode is OCCH,and generates an exception 3 trap 
when executed. In typical use, a debugger program 
can "plant" the breakpoint instruction at all desired 
code execution breakpoints. The single-byte break- 
point opcode is an alias for the two-byte general 
software interrupt instruction, INT n, where n=3. 
The only difference between INT 3 (OCCh)and INT n 
is that INT 3 is never IOPL-sensitive but INT n is 
IOPL-sensitive in Protected Mode and Virtual 8086 
Mode. 


9.2 
Single-Step Trap 


If the single-step flag (TF, bit 8) in the EFLAG regis- 
ter is found to be set at the end of an instruction, a 


single-step exception occurs. The single-step ex- 
ception is auto vectored to exception number 1. Pre- 
cisely, exception 1 occurs as a trap after the instruc- 
tion following the instruction which set TF. In typical 
practice, a debugger sets the TF bit of a flag register 
image on the debugger's stack. It then typically 
transfers control to the user program and loads the 
flag image with a signal instruction, the IRET instruc- 
tion. The single-step trap occurs after executing one 
instruction of the user program. 


Since the exception 1 occurs as a trap (that is, it 
occurs after the instruction has already executed), 
the CS:EIP pushed onto the debugger's stack points 
to the next unexecuted instruction of the program • 
being debugged. An exception 1 handler, merely by 
ending with an IRET instruction, can therefore effi- 
ciently support single-stepping through a user pro- 
gram. 


9.3 
Debug Registers 


The Debug Registers are an advanced debugging 
feature of the Intel486 microprocessor family. They 
allow data access breakpoints as well as code exe· 
cution breakpoints. Since the breakpoints are indi- 
cated by on-chip registers, an instruction execution 
break-point can be placed in ROM code or in code 
shared by several tasks, neither of which can be 
supported by the INT3 breakpoint opcode. 


BREAKPOINT 
0 LINEAR ADDRESS 


BREAKPOINT 
1 LINEAR ADDRESS 


BREAKPOINT 
2 LINEAR ADDRESS 


BREAKPOINT 
3 LINEAR ADDRESS 


Intel reserved. 
Do not define. 


Intel reserved. 
Do not define. 


0 
B 
B B 00 o 0 0 0 00 
o B B B B 
T 
SO 
3 
2 
1 
0 


LEN I~Iw I LEN IRIw I LEN IRIw I LEN IRIw 
0 
OG 
00 
OG 
L G LG 
L G 
L G L 


333222111000 
0 
E E 3 
3 
2 
2 
1 
1 
0 
0 


The 
Intel486 
microprocessor 
contains 
six 
Debug 
Registers, 
providing 
the ability to specify 
up to four 
distinct 
breakpoints 
addresses, 
breakpoint 
control 
options, 
and read breakpoint 
status. Initially after re- 
set, breakpoints 
are in the disabled 
state. Therefore, 
no breakpoints 
will occur unless the debug registers 
are programmed. 
Breakpoints 
set up in the Debug 
Registers 
are autovectored 
to exception 
number 
1. 


9.3.1 
LINEAR 
ADDRESS 
BREAKPOINT 
REGISTERS 
(DRO-DR3) 


Up to four breakpoint 
addresses 
can be specified 
by 
writing 
into 
Debug 
Registers 
DRO-DR3, 
shown 
in 
Figure 
9.1. The breakpoint 
addresses 
specified 
are 
32-bit 
linear 
addresses. 
Intel486 
microprocessor 
hardware 
continuously 
compares 
the 
linear 
break- 
point 
addresses 
in DRO-DR3 
with 
the 
linear 
ad- 
dresses 
generated 
by executing 
software 
(a linear 
address 
is the result of computing 
the effective 
ad- 
dress and adding the 32-bit segment 
base address). 
Note that if paging is not enabled 
the linear address 


NOTE: 
indicates 
Intel reserved: 
Do not define; 
SEE SECTION 
2.3.10 
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equals 
the 
physical 
address. 
If paging 
is enabled, 


the linear address 
is translated 
to a physical 
32-bit 


address 
by the on-chip 
paging 
unit. Regardless 
of 


whether 
paging 
is enabled 
or 
not, 
however, 
the 


breakpoint 
registers 
hold linear addresses. 


A Debug Control 
Register, 
DR? shown in Figure 9.1, 


allows 
several 
debug 
control 
functions 
such as en- 


abling 
the breakpoints 
and setting 
up other 
control 


options 
for the breakpoints. 
The fields within the De- 


bug Control 
Register, 
DR?, are as follows: 


LENi (breakpoint 
length specification 
bits) 


A 2-bit LEN field exists 
for each 
of the four break- 


points. 
LEN specifies 
the length 
of the associated 
breakpoint 
field. 
The 
choices 
for data 
breakpoints 
are: 1 byte, 2 bytes, and 4 bytes. Instruction 
execu- 


tion breakpoints 
must have a length 
of 1 (LENi 
= 
00). Encoding 
of the LENi field is as follows: 


ORO 


DR1 


DR2 


DR3 


DR4 


DR5 


Usage of Least 
LENI 
Breakpoint 
Significant Bits In 
Encoding 
Field Width 
Breakpoint Add,.sa 
Register I, (I =0 - 3) 


00 
1 byte 
All 32-bits used to 
specify a single-byte 
breakpoint field. 


01 
2 bytes 
A1-A31 
used to specify 
a twO-byte, word- 
aligned breakpoint field. 
AO in Breakpoint 
Address Register is not 
used. 


10 
Undefined- 
do not use 
this encoding 


11 
4 bytes 
A2-A31 
used to specify 
a four-byte, dword- 
aligned breakpoint field. 
AOand A1 in 
Breakpoint Address 
Register are not used. 


The LENi field controls 
the size of breakpoint 
field i 
by controlling 
whether 
all low-order 
linear 
address 
bits in the breakpoint 
address 
register 
are used to 
detect 
the 
breakpoint 
event. 
Therefore, 
all break- 
point fields are aligned; 
2-byte 
breakpoint 
fields be- 
gin 
on 
Word 
boundaries, 
and 
4-byte 
breakpoint 
fields begin on Dword boundaries. 


The following 
is an example 
of various 
size break- 
point fields. Assume 
the breakpoint 
linear address 
in 
DR2 is 00000005H. 
In that 
situation, 
the following 
illustration 
indicates 
the 
region 
of 
the 
breakpoint 
field for lengths 
of 1, 2, or 4 bytes. 


DR2 = OOOOOOOSH; LEN2 = 008 
1 


31 


I 
1~¢fl~101= 


DR2=OOOOOOOSH; 
LEN2 
= 018 
31 
0 
I 
I 
+- bkpt fld2 -+ 
I 


00000008H 


00000004H 


OOOOOOOOH 


DR2=OOOOOOOSH; 
LEN2 = 
118 
31 
0 


00000008H 


00000004H 


OOOOOOOOH 


RWi (memory 
access 
qualifier 
bits) 


A 2-bit RW field 
exists 
for each 
of the four 
break- 
points. The 2-bit RW field specifies 
the type of usage 
which must occur in order to activate 
the associated 
breakpoint. 


RW 
Usage 
Encoding 
Causing Breakpoint 


00 
Instruction execution only 
01 
Data writes only 
10 
Undefined-<lo not use this encoding 
11 
Data reads and writes only 


RW encoding 00 is used to set up an instruction 
execution breakpoint. RW encodings 01 or 11 are 
used to set up write-only or read/write data break- 
points. 


Note that Instruction 
execution 
breakpoints 
are 
taken 
as 
faults 
(i.e., before the instruction exe- 
cutes), but data 
breakpoints 
are taken 
as traps 
(i.e., after the data transfer takes place). 


Using LENi and RWi to Set Data Breakpoint i 


A data breakpoint can be set up by writing the linear 
address into DRi (i = 0-3). 
For data breakpoints, 
RWi can = 01 (write-only) or 11 (write/read). LEN 
can = 00, 01, or 11. 


If a data access entirely or partly falls within the data 
breakpoint field, the data breakpoint condition has 
occurred, and if the breakpoint is enabled, an excep- 
tion 1 trap will occur. 


Using LENi and RWi to Set Instruction Execution 
Breakpoint i 


An instruction execution breakpoint can be set up by 
writing address of the beginning of the instruction 
(including prefixes if any) into DRi (i = 0-3). 
RWi 
must = 00 and LEN must = 00 for instruction exe- 
cution breakpoints. 


If the instruction beginning at the breakpoint address 
is about to be executed, the instruction execution 
breakpoint condition has occurred, and if the break- 
point is enabled, an exception 1 fault will occur be- 
fore the instruction is executed. 


Note that an instruction execution breakpoint ad- 
dress must be equal to the beginning 
byte address 
of an instruction (including prefixes) in order for the 
instruction execution breakpoint to occur. 


The Debug Registers can only be accessed in Real 
Mode or at privilege level 0 in Protected Mode. The 
GO bit, when set, provides extra protection against 
any Debug Register access even in Real Mode or at 
privilege level 0 in Protected Mode. This additional 
protection feature is provided to guarantee that a 
software debugger can have full control over the De- 


bug Register resources when required. The GO bit, 
when set, causes an exception 1 fault if an instruc- 
tion attempts to read or write any Debug Register. 
The GO bit is then automatically cleared when the 
exception 1 handler is invoked, allowing the excep- 
tion 1 handler free access to the debug registers. 


GE and LE (Exact data breakpoint match, global and 
local) 


The breakpoint mechanism of the Intel486 micro- 
processor family differs from that of the Inte1386. 
The Intel486 microprocessor always does exact 
data breakpoint matching, regardless of GE/LE bit 
settings. Any data breakpoint trap will be reported 
exactly after 
completion 
of the 
instruction that 
caused the operand transfer. Exact reporting is pro- 
vided by forcing the Intel486 microprocessor execu- 
tion unit to wait for completion of data operand 
transfers before beginning execution of the next in- 
struction. 


When the Intel486 microprocessor performs a task 
switch, the LE bit is cleared. Thus, the LE bit sup- 
ports fast task switching out of tasks, that have 
enabled the exact data breakpoint match for their 
task-local breakpoints. The LE bit is cleared by the 
processor during a task switch, to avoid having ex- 
act data breakpoint match enabled in the new task. 
Note that exact data breakpoint match must be re- 
enabled under software control. 


The Intel486 microprocessor GE bit is unaffected 
during a task switch. The GE bit supports exact data 
breakpoint match that is to remain enabled during all 
tasks executing in the system. 


Note that Instruction 
execution 
breakpoints are al- 
ways reported exactly. 


If either Gi or Li is set then the associated breakpoint 
(as defined by the linear address in DRi, the length 
in LENi and the usage criteria·in RWi) is enabled. If 
either Gi or Li is set, and the Intel486 microproces- 
sor detects the ith breakpoint condition, then the ex- 
ception 1 handler is invoked. 


When the Intel486 microprocessor performs a task 
switch to a new Task State Segment (TSS), all Li 
bits are cleared. Thus, the Li bits support fast task 
switching out of tasks that use some task-local 
breakpoint registers. The Li bits are cleared by the 
processor during a task switch, to avoid spurious ex- 
ceptions in the new task. Note that the breakpoints 
must be re-enabled under software control. 


All Intel486 microprocessor Gi bits are unaffected 
during a task switch. The Gi bits support breakpoints 
that are active in all tasks executing in the system. 
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9.3.3 
DEBUG 
STATUS 
REGISTER 
(DR6) 


A Debug Status Register, DR6 shown in Figure 9.1, 
allows the exception 1 handler to easily determine 
why it was invoked. Note the exception 1 handler 
can be invoked as a result of one of several events: 
1) DROBreakpoint fault/trap. 
2) DR1 Breakpoint fault/trap. 
3) DR2 Breakpoint fault/trap. 
4) DR3 Breakpoint fault/trap. 
5) Single-step (TF) trap. 


6) Task switch trap. 
7) Fault due to attempted debug register access 
when GD=1. 


The Debug Status Register contains single-bit flags 
for each of the possible events invoking exception 1. 
Note below that some of these events are faults (ex- 
ception taken before the instruction is executed), 
while other events are traps (exception taken after 
the debug events occurred). 


The flags in DR6 are set by the hardware but never 
cleared by hardware. Exception 1 handler software 
should clear DR6 before returning to the user pro- 
gram to avoid future confusion in identifying the 
source of exception 1. 


The fields within the Debug Status Register, DR6, 
are as follows: 


Bi (debug fault/trap due to breakpoint 0-3) 


Four breakpoint indicator flags, BO-B3, correspond 
one-to-one with the breakpoint registers in DRO- 
DR3. A flag Bi is set when the condition described 
by DRi, LENi, and RWi occurs. 


If Gi or Li is set, and if the ith breakpoint is detected, 
the processor will invoke the exception 1 handler. 
The exception is handled as a fault if an instruction 
execution breakpoint occurred, or as a trap if a data 
breakpoint occurred. 


IMPORTANT 
NOTE: A flag Bi is set whenever the 


hardware detects a match condition on enabled 
breakpoint i. Whenever a match is detected on at 
least one enabled 
breakpoint i, the hardware imme- 


diately sets all Bi bits corresponding to breakpoint 
conditions matching at that instant, whether enabled 
or not. Therefore, the exception 1 handler may see 
that multiple Bi bits are set, but only set Bi bits corre- 
sponding to enabled 
breakpoints (Li or Gi set) are 


true indications of why the exception 1 handler was 
invoked. 


BD (debug fault due to attempted ,register access 
when GD bit set) 


This bit is set if the exception 1 handler was invoked 
due to an instruction attempting to read or write to • 
the debug registers when GD bit was set. If such an 
event occurs, then the GD bit is automatically 
cleared when the exception 1 handler is invoked, 
allowing handler access to the debug registers. 


BS (debug trap due to single-step) 


This bit is set if the exception 1 handler was invoked 
due to the TF bit in the flag register being set (for 
single-stepping). 


BT (debug trap due to task switCh) 


This bit is set if the exception 1 handler was invoked 
due to a task switch occurring to a task having a 
Intel486 microprocessor TSS with the T bit set. Note 
the task switch into the new task occurs normally, 
but before the first instruction of the task is execut- 
ed, the exception 1 handler is invoked. With respect 
to the task switch operation, the operation is consid- 
ered to be a trap. 


9.3.4 
USE OF RESUME 
FLAG (RF) IN FLAG 


REGISTER 


The Resume Flag (RF) in the flag word can sup- 
press an instruction execution breakpoint when the 
exception 1 handler returns to a user program at a 
user address which is also an instruction execution 
breakpoint. 


This section describes the Intel486 DX2 microproc- 
essor instruction set. Tables 10.1 through 10.3 list all 
instructions along with instruction encoding dia- 
grams and clock counts. Further details of the in- 
struction encoding are then provided in Section 
10.2, which 
completely 
describes the 
encoding 
structure and the definition of all fields occurring 
within the Intel486 DX2 microprocessor instructions. 


10.1 
Intel486™ 
DX2 Microprocessor 
Instruction 
Encoding and Clock 
Count Summary 


To calculate elapsed time for an instruction, multiply 
the instruction clock count, as listed in Tables 10.1 
through 10.3 by the processor core clock period 
(e.g., 20 ns for a 50 MHz Intel486 DX2 microproces- 
sor). 


For more detailed information on the encodings of 
instructions, refer to Section 10.2 Instruction Encod- 
ings. Section 10.2 explains the general structure of 
instruction encodings, and defines exactly the en- 
codings of all fields contained within the instruction. 


The Intel486 DX2 microprocessor instruction core 
clock count tables give clock counts assuming data 
and instruction accesses hit in the cache. The com- 
bined instruction and data cache hit rate is over 
90%. 


A cache miss will force the Intel486 DX2 microproc- 
essor to run an external bus cycle. The Intel486 DX2 
microprocessor 
32-bit 
burst 
bus is 
defined 
as 
r-b-w. 


Where: 
r = The number of bus clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle in a non-burst read. 
b = The number of bus clocks for the second and 
subsequent cycles in a burst read. 


w = The number of bus clocks for a write. 


The fastest bus the Intel486 DX2 microprocessor 
can support is 2 -1 - 2 assuming 0 wait states. The 
clock counts in the cache miss penalty column as- 
sume a 2-1-2 
bus. For slower busses add r-2 
clocks to the cache miss penalty for the first dword 
accessed. Other factors also affect instruction clock 
counts. 


Instruction 
Clock Count 
Assumptions 
1. The external bus is available for reads or writes 
at all times. Else add bus clocks to reads until 
the bus is available. 


2. Accesses are aligned. Add three core clocks to 
each misaligned access. 
3. Cache fills complete before subsequent access- 
es to the same line. If a read misses the cache 
during a cache fill due to a previous read or pre- 
fetch, the read must wait for the cache fill to 
complete. If a read or write accesses a cache 
line still being filled, it must wait for the fill to 
complete. 
4. If an effective address is calculated, the base 
register is not the destination register of the pre- 
ceding instruction. If the base register is the 
destination register of the preceding instruction 
add 1 to the core clock counts shown. Back-to- 
back PUSH and POP instructions are not affect- 
ed by this rule. 
5. An effective address calculation uses one base 
register and does not use an index register. 
However, if the effective address calculation 
uses an index register, 1 core clock may be 
added to the clock count shown. 


6. The target of a jump is in the cache. If not, add r 
clocks for accessing the destination instruction 
of a jump. If the destination instruction is not 
completely contained in the first dword read, 
add a maximum of 3b bus clocks. If the destina- 
tion instruction is not completely contained in 
the first 16 byte burst, add a maximum of anoth- 
er r+ 3b bus clocks. 
7. If no write buffer delay, w bus clocks are added 
only in the case in which all write buffers are full. 


8. Displacement and immediate not used together. 


If displacement and immediate used together, 1 
core clock may 
be added to the core clock 
count shown. 
9. No invalidate cycles. Add a delay of 1 bus clock 
for each invalidate cycle if the invalidate cycle 
contends for the internal cache/external 
bus 
when the Intel486 DX2 CPU needs to use it. 


10. Page translation hits in TLB. A TLB miss will add 
13, 21 or 28 bus clocks + 1 possible core clock 
to the instruction depending on whether the Ac- 
cessed and/or Dirty bit in neither, one or both of 
the page entries needs to be set in memory. 
This assumes that neither page entry is in the 
data cache and a page fault does not occur on 
the address translation. 
11. No exceptions are detected during instruction 
execution. Refer to Interrupt core Clock Counts 
Table for extra clocks if an interrupt is detected. 


12. Instructions that read multiple consecutive data 
items (Le.task switch, POPA, etc.) and miss the 
cache are assumed to start the first access on a 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+3b) 
bus clocks to the cache miss penalty. 


INSTRUCTION 
FORMAT 
Cache Hit 
Nolee 


INTEGER OPERATIONS 


MOV ~ Move: 


reg110 reg2 
1000100W 
11 
reg1 reg2! 
1 


reg210reg1 
1000101w 
11 
reg1 reg21 
1 


memory 
to reg 
1000101w 
mod 
reg 
rIm I 
1 


reg to memory 
1000100w 
mod 
reg 
rIm I 
1 


Immediate 
to reg 
1100011 
w 
11000 
reg I immediate 
data 
1 


or 
1011w reg 
immediate 
data 
1 


Immediate 
to Memory 
1100011 
w 
mod 000 
I 
I displacement 
1 
r m 
immediate 


Memory 
to Accumulator 
1010000w 
full displacement 
1 


Accumulator 
to Memory 
1010001w 
full displacement 
1 


MOVSX/MOVZX 
= Move wllh Slgn/Zero 
EXlenelon 


reg210reg1 
1 00001111 
1 1011 
z11 w 
111 
reg1 reg21 
3 


memory 
to reg 
1 00001111 
1 1011 
z11 w 
1mod 
reg 
r/ml 
3 


z 
Instruction 


0 
MOVZX 
1 
MOVSX 


PUSH ~ Pueh 


reg 
I 
11111111 
11 
110 
reg 1 
4 


or 
101010 
reg 
1 


memory 
1 11111111 
mod 110 
rIm I 
4 


immediate 
1 01101000 
immediate 
data 
1 


PUSHA ~ Pueh All 
1 01100000 
11 


POP = Pop 


reg 
1 10001111 
111 
000 
reg 1 
4 


or 
101011 
reg 1 
1 


memory 
I 
10001111 
1mod 000 
rIm I 
5 


POPA = Pop A" 
1 01100001 
1 
9 


XCHG ~ Exchenge 


reg1 with reg2 
1 1000011w 
11 
reg1 reg21 
3 


Accumulator 
with reg 
110010 
reg 
3 


Memory 
with reg 
I 1000011w 
mod 
reg 
r/ml 
5 


NOP = No Operallon 
1 10010000 
1 


LEA ~ Load EA 10 Reglele. 
1 10001101 
mod 
reg 
rIm I 


no index register 
1 
'Mth index register 
2 


• 


INSTRUCTION 
FORMAT 
caetleHIt 
Not •• 


INTEGER OPERATIONS 
(Continued) 


lnetructlon 
TTT 


ADD = Add 
000 


ADC = Add with Carry 
010 
AND ~ Logical AND 
100 


OR - 
Logical OR 
001 


SUB ~ Subtract 
101 


SBB - 
Subtract with Borrow 
011 


XOR - 
Logical Exclusive OR 
110 


regltoreg2 
OOTTTOOw 
11 
regl 
reg21 
1 


reg2to regl 
OOTTTOlw 
11 
regl 
reg21 
1 


memory 
to regtster 
OOTTTOlw 
mod 
reg 
rIm I 
2 


register to memory 
OOTTTOOw 
mod 
reg 
rIm I 
3 
U/L 


immed4ate 
to register 
100000sw 
11 
TTT 
reg I immediate 
register 
1 


immediate 
to accumulator 
00TTT10w 
immediate 
data 
1 


immediate 
to memory 
I 100000sw 
mod TTT 
rIm I immediate 
data 
3 
U/L 


lnetructlon 
TTT 


INC - 
Increment 
000 
DEC - 
Decrement 
001 


reg 
I l111111w 
111 
TTT 
reg I 
1 


or 
101TTT 
reg I 
1 


memory 
I l111111w 
I mod TTT 
rIm I 
3 
U/L 


lnetructlon 
TTT 


NOT = Logical Complement 
010 


NEG = Negate 
011 


reg 
I 1111011w 
111 
TTT 
reg I 
1 


memory 
I 1111011 
w I mod TTT 
rIm I 
3 
U/L 


CMP-~ 


regl with reg2 
0011100w 
11 
regl 
reg2 
1 


reg2 with regl 
0011101 
w 
11 
regl 
reg2 
1 


memory 
with register 
0011100w 
mod 
reg 
rIm 
2 


register 
with memory 
0011101 
w 
mod 
reg 
rIm 
2 


immediate 
with register 
100000sw 
11 
111 
reg 
immediate 
data 
1 


immediate 
with ace. 
0011110w 
immediate data 
1 


immediate with memory 
100000sw 
mod 111 
rim I immediate 
data 
2 


TEST - 
Loglc8I Com,. 
•• 


regl and reg2 
I 1000010w 
111 
regl 
reg21 
1 


memory 
and register 
I 1000010w 
I mod 
reg 
rIm I 
2 


immediate 
and register 
I 1111011w 
111 
000 
reg I immediate 
data 
1 


immediate 
and ace. 
I 1010100w 
I immediate 
data 
1 


immediate 
and memory 
I 1111011 
w I mod 000 
rIm I immediate 
data 
2 


INSTRUCTION 
FORMAT 
ClIche Hit 
Not •• 


INTEGER OPERATIONS 
(Continued) 


MUL = Multiply 
(unsigned) 


ace. with register 
1 1111011 
w 
111 
100 
reg 1 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


ace. with memory 
1 1111011 
w 
1mod 
100 
rIm I 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


IMUL = Integer 
Multiply 
(signed) 


ace. with register 
1 1111011 
w 
111 
101 
reg 1 
, 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


ace. with memory 
1 1111011 
w 
1mod 
101 
r/ml 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 
Dword 
13/42 
MN/MX,3 


reg1 with reg2 
I 00001111 
I 
10101111 
111 
reg1 
reg21 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 
Dword 
13/42 
MN/MX,3 


register with memory 
I 00001111 
1 10101111 
1mod 
reg 
rIm I 


Multiplier-Byte 
13/18 
MN/MX,3 
Word 
13/26 
MN/MX,3 
Dword 
13/42 
MN/MX,3 


reg1 with imm.10 reg2 
I 01101 
Os 1 
111 
reg1 
reg21 immediate data 


Multiplier-Byte 
13/18 
MN/MX,3 
Word 
13/26 
MN/MX,3 
Dword 
13/42 
MN/MX,3 


memo with imm. to reg. 
1 01101 
Os 1 
1mod 
reg 
rim I immediate data 


Multiplier-Byte 
13/18 
MN/MX,3 
Word 
13/26 
MN/MX,3 
Dword 
13/42 
MN/MX,3 
DIY = Divide (unsigned) 


acc. by register 
I 
1111011 
w 
111 
110 
reg I 


Divisor-Byte 
16 
Word 
24 
Dword 
40 


ace. by memory 
1 1111011 
w 
1mod 
110 
rIm I 


Divisor-Byte 
16 
Word 
24 
Dword 
40 


IDlY = Integer 
Divide (signed) 


acc. by register 
I 1111011 
w 
111 
111 
reg 1 


Divisor-Byte 
19 
Word 
27 
Dword 
43 


• 


INSTRUCTION 
FORMAT 
CKIleHil 
NotM 


INTEGER OPERATIONS 
(Continued) 


ace. by memory 
11111011Wlmod111 
rIm 1 


Divisor-Byte 
20 


Word 
28 
- 


Dwcxd 
4-4 


CBW/CWOE 
= Convert 
Byte to WfKdl 
1100110001 
Convert 
WfKd to OwfKd 
3 


CWOICDQ 
= Convert 
WfKd to OwfKdl 


11001100d 
Convert 
OwfKd to 
3 
Q_fKd 


lnatrucllon 
TTT 


ROL = Rotate Left 
000 


ROO - 
Rotate Right 
001 
RCL - 
Rotate through GerTy Left 
010 


RCR ~ Rotate through Carry Right 
011 


SHL/SAL 
= Shift Logicall Arithmetic 
Left 
100 


SHR = Shift Logical Right 
101 
SAR = Shift Arithmetic 
Right 
111 


Not Through 
Cerry (ROL, ROR, SAL, SAR, SHL, end SHR) 


reg by1 
1101000w 
11 
TTT 
reg 1 
3 


memory by 1 
1101000w 
mod TTT 
rIm I 
4 


regbyCL 
1101001 
w 
11 
TTT 
reg 1 
3 


memory by CL 
1101001 
w 
mod TTT 
rIm I 
4 


reg by immediate count 
1100000w 
11 
TTT 
reg I immediate 8-bit data 
2 


mem by immediate count 
1100000w 
mod TTT 
rIm 1immediate 8-bit data 
4 


Through 
Carry (RCL end RCR) 


reg by 1 
1 1101000w 
11 
TTT 
reg 1 
3 


memory by 1 
I 1101000w 
mod TTT 
rIm I 
4 


reg by CL 
I 1101001 
w 
11 
TTT 
reg I 
8/30 
MN/MX,4 


memory by CL 
I 1101001 
w 
mod TTT 
rIm I 
9/31 
MN/MX,5 


reg by immediate count 
1 1100000w 
11 
TTT 
reg I immediate 8-bit data 
8/30 
MN/MX.4 


mem by immediate count 
1 1100000w 
mod TTT 
rIm 1immediate 8-bit data 
9/31 
MN/MX.5 


Inatruction 
TTT 


SHLD - 
Shift Left Double 
100 


SHRD - 
Shift Right Double 
101 


register with immediate 
I 00001111 
10TTT100 
111 
reg2 
reg1 I imm 8-bit data 
2 


memory by immediate 
I 00001111 
10TTT100 
1 mod 
reg 
rim Iimm a-bit data 
3 


register by CL 
1 00001111 
10TTT101 
111 
reg2 
reg11 
3 


memorybyCL 
1 00001111 
10TTT101 
I mod 
reg 
rIm 1 
4 


BSWAP = Byte Swap 
I 00001111 
11001 
reg 1 
1 


XADO - 
Exchllnge 
and Add 


reg 1, reg2 
I 00001111 
11100000W 
111 
reg2 
reg11 
3 


memory, reg 
1 00001111 
11100000w 
I mod 
reg 
rIm 1 
4 
U/L 


CMPXCHG 
= Compare 
and Exchanga 


regl. 
reg2 
I 00001111 
11011000W 
111 
reg2 
reg11 
8 


memory,reg 
1 00001111 
11011000W 
I mod 
reg 
r/ml 
7/10 
8 


intel~ 


INSTRUCTION 
FORMAT 
C8CIle Hit 
Not •• 


CONTROL 
TRANSFER 
(wilIlIn 
ugment) 


NOTE: Ti..- 
are jump !eken/not 
!eken 


Jccc = Jump on ccc 


8·bit displacement 
I 
0111 
tlln 
I 
8·bildisp. 
I 
3/1 
T/NT,23 


full displacement 
I 00001111 
I 
1000tlln 
Ifull displacement 
3/1 
TINT,23 


NOTE: Tl..- 
are jump takenlnot 
taken 


SETcccc = Set Byte on cccc (TImes are cccc truelf_) 


reg 
I 00001111 
I 
100111tn 
111 
000 
reg I 
4/3 


m&molY 
I 00001111 
I 
100111tn 
I mod 
000 
rIm I 
3/4 


Mnemonic 
Condlllon 
llln 
cccc 
0 
Overllowססoo 


NO 
No Overflow 
0001 


BINAE 
BelowlNot 
Above or Equal 
0010 


NB/AE 
Not Belowl Above or Equal 
0011 


E/Z 
Equal/Zero 
0100 


NEINZ 
Not Equal/Not 
Zero 
0101 


BEINA 
Below or Equal/Not 
Above 
0110 


NBE/A 
Not Below or Equal/ Above 
0111 


S 
Sign 
1000 


NS 
Not Sign 
1001 


PIPE 
Pari1y/Pari1y Even 
1010 


NP/PO 
Not Pari1y/Pari1y Odd 
1011 


L/NGE 
Less Than/Not 
Greater or Equal 
1100 


NUGE 
Not Less ThanlGreeter 
or Equal 
1101 


LEING 
Less Than or Equal/Greater 
Than 
1110 


NLE/G 
Not Less Than or Equal/Greater 
Than 
1111 


LOOP - 
LOOP CX Tomes 
I 
11100010 
8·bitdisp. 
I 
7/6 
LINL,23 


LooPZlLooPE 
= Loop with 
I 
11100001 
8·bltdisp. 
I 
9/6 
LINL,23 


ZerolEqual 


LooPHZ/LooPNE 
- 
Loop willie 
I 
11100000 
6·bitdisp. 
I 
9/6 
LINL,23 


Not Zero 


JeXZ = Jump on CX Zero 
I 
11100011 
6-bitdisp. 
I 
6/5 
TINT,23 


JECXZ = Jump on ECX zero 
I 
11100011 
6·bitdisp. 
I 
6/5 
TINT,23 


(Address Size PrefIX Differentiates 
JCXZ lor JECXZj 


JMP = Uncondlllonal 
Jump (within 
_ment) 


Short 
I 
11101011 
I 
6·bitdisp. 
I 
3 
7.23 


Direct 
I 
11101001 
I full displacement 
3 
7,23 


Register Indirect 
I 
11111111 
111 
100 
reg I 
5 
7,23 


Memory Indirect 
I 
11111111 
I mod 
100 
rIm I 
5 
7 


CALL = Can (within 
_ment) 


Direct 
I 
11101000 
I full displacement 
3 
7,23 


Register 
Indirect 
I 
11111111 
111 
010 
reg I 
5 
7,23 


MemolY Indirect 
I 
11111111 
I mod 
010 
rIm I 
5 
7 


RET = Return 
from CALL (within 
_ment) 
I 
11000011 
I 
5 


Adding Immediate to SP 
I 
11000010 
I 
1a-bit disp. 
I 
5 


•• 


INSTRUCTION 
FORMAT 
ClIche Hit 
Not •• 


CONTROL TRANSFER (within segment) 
(Continued) 


_. 


ENTER = Enter Procedure 
I 
11001000 
11&-bitdisp.,&-bit1ev4 


Level - 
0 
14 


Level = 1 
17 


Level (L) > 1 
17+3L 
8 


LEAVE = Leave Procedure 
I 
11001001 
I 
5 


MULTIPLE·SEGMENT 
INSTRUCTIONS 


MOV = Move 


reg. to segment reg. 
I 
10001110 
111 
sreg3 
reg I 
3/9 
RVlP, 9 


memory to segment reg. 
I 
10001110 
I mod sreg3 
rIm I 
3/9 
RVlP,9 


segment reg. to reg. 
I 
10001100 
111 
sreg3 
reg I 
3 


segment reg. to memory 
I 
10001100 
I mod sreg3 
rIm I 
3 


PUSH = Push 


segment reg. 
1000sre921101 
3 


(E5, CS, 55, or OS) 


segment reg. (FS or GS) 
I 00001111 
110 
sreg3000! 
3 


POP = Pop 


segment reg. 
000sreg2111 
3/9 
RVlP,9 


(ES, 55, or OS) 


segment reg. (FS or GS) 
00001111 
10 
sreg3001 
3/9 
RVlP,9 


LOS = Load Pointer to OS 
11000101 
mod 
reg 
rIm 
6/12 
RV/P, 9 


LES = Load Pointer to ES 
11000100 
mod 
reg 
rIm 
6/12 
RVlP, 9 


LFS - 
Load Pointer to FS 
00001111 
10110100 
mod 
reg 
rIm 1 
6/12 
RVlP, 9 


LGS - 
Load Pointer to OS 
00001111 
10110101 
mod 
reg 
rIm! 
6/12 
RVlP,9 


LSS = Loed Pointer to SS 
00001111 
10110010 
mod 
reg 
rIm! 
6/12 
RV/P, 9 


CALL = Call 


Direct intersegment 
1 10011010 
I unsigned full offset, selector 
16 
R,7,22 


to same level 
20 
P,9 


thru Gate to same level 
35 
P,9 


to inner level. no parameters 
69 
P,9 


to inner level, x parameter (d) words 
77+4X 
P,11,9 


toTSS 
37+TS 
P,10,9 


thru Task Gate 
36+TS 
P,10,9 


Indirect intersegment 
1 11111111 
I mod 011 
r/ml 
17 
R,7 


to same level 
20 
P,9 


thru Gate to same level 
35 
P,9 


to inner level, no parameters 
69 
P,9 


to inner level, x parameter (d) words 
77+4X+n 
P.ll,9 


toTSS 
37+TS 
P.l0,9 
thru Task Gata 
36+TS 
P.10,9 
RET = Retum from CALL 


intersegment 
1 11001011 
1 
13 
R,7 


to same level 
17 
P,9 
to outer level 
35 
P,9 


intersegment 
adding 
I 
11001010 
1 
16-bitdisp. 
I 
imm.toSP 
14 
R,7 


to same level 
16 
P,9 


to outer level 
36 
P,9 
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INSTRUCTION 
FORMAT 
Cache HIt 
Not •• 


MULT1PLE-sEGMENT INSTRUCTIONS (Continued) 


JMP = UncondltIofIllI 
Jump 


Direct intersegment 
1 '1101010 
I unsigned full offset, selector 
17 
R.7,22 


to same level 
19 
P,9 


thru Call Gate to same level 
32 
P,9 


thruTSS 
42+T5 
P,10,9 


thru Task Gate 
43+T5 
P,10,9 


Indirect intersegment 
1 11111111 
I mod 101 
,1m I 
13 
R,7,9 


to same level 
18 
P,9 


thru Call Gate to same level 
31 
P,9 


thru TSS 
41+TS 
P,10,9 


thru Task Gate 
42+T5 
P,10,9 


BIT MANIPULATION 


BT ~ TMtblt 


register, immediate 
1 0000'111 
I 
10111010 
111 
100 
'eel 
imm. 8-bit data 
3 


memory I immediate 
1 0000'1'1 
I 10111010 
I mod 
'00 
,1m I imm. 8-bit data 
3 


'ee1,'ee2 
1 0000'111 
110100011 
111 
'eg2 
'ee1 I 
3 


memory,'ee 
I 00001111 
I 10'00011 
I mod 
'ee 
,1m I 
8 


Inatructlon 
TTT 


BTS ~ Test Bit and Set 
101 
BTR ~ Test B~ and Reset 
110 


BTC = Test B~ and Compliment 
111 


register. immediate 
I 00001111 
I 
10111010 
111 
TTT 
'eel 
imm. 8-bit data 
6 


memory, immediate 
I 000011'1 
I 
10111010 
I mod TTT 
,1m Iimm. 8·bitdata 
8 
UIL 


'ee1, 'ee2 
100001111 
I 10TTT011 
111 
'eg2 
'ee1 I 
6 


memory, 'ee 
I 00001111 
I 'OTTT011 
I mod 
'ee 
,1m I 
13 
UIL 


&SF = scan BIt Forward 


'ee1, 'ee2 
I 00001111 
1 10111100 
111 
'eg2 
'ee1 I 
6/42 
MN/MX,12 


memory, reg 
I 00001111 
I 10111100 
I mod 
'ee 
,1m I 
7/43 
MN/MX,13 


BSR ~ scan BIt R.v •••• 


'ee1, 'eg2 
1 00001111 
I 
'0111101 
111 
,eg2 
'ee1 I 
6/103 
MN/MX,14 


memory, reg 
1 00001111 
I 
'0'11101 
I mod 
'ee 
,1m I 
71104 
MN/MX,15 


STRING INSTRUCTIONS 


CMPS = Compare Byt. 
Word 
I 1010011 
w I 
6 
16 


LOOS - 
LoMI Byt./Word 
I 1010110w 
I 
5 
to ALl AX/EAX 


MOYS = Move Byt./Word 
I 1010010w 
I 
7 
16 


SCAS = scan Byt./Word 
I 1010111 
w I 
6 


STOS - 
Store Byt./Word 
I 1010101w 
I 
5 
f,om All AX/EX 


XLA T = TranalIIl. 
String 
I 
11010111 
I 
4 


INSTRUCTION 
FORMAT 
Cache Hit 
Notes 


REPEATED STRING INSTRUCTIONS 


Repealed by Count in CX or ECX (C = Count in CX or ECX) 


REPE CMPS ~ Compare Strlng 
I 11110011 
I 
1010011 
w I 


(Find Non-Match) 
C~O 
5 
C>O 
7+7c 
16,17 


REPNE CMPS ~ Compare Strlng 
I 11110010 
I 1010011 
w I 
(Find Match) 


C=O 
5 
C>O 
7+7c 
16,17 


REP lODS 
= load 
Siring 
I 
11110011 
I 
1010110w 
I 
C-O 
5 
C>O 
7+4c 
16,16 


REP MOYS ~ Moya String 
I 
11110011 
I 
1010010w 
I 
C=O 
5 
C~1 
13 
16 
C>1 
12+3c 
16,19 


REPE SCAS = SCan String 
I 
11110011 
I 
1010111 
w I 
(Find Non-All AX/EAX) 
C~O 
5 
C>O 
7+5c 
20 


REPNE SCAS ~ scan Strlng 
I 11110010 
I 
1010111 
w I 
(Find All AX/EAX) 
C=O 
5 
C>O 
7+50 
20 


REP STOS - 
Store Strlng 
I 
11110011 
I 
1010101 
w I 
C~O 
5 
C>O 
7+4c 


FLAG CONTROL 


ClC 
~ Claar Carry Flag 
11111000 
2 


STC = Set carry Flag 
11111001 
2 


CMC ~ Complement 
Carry Ftag 
11110101 
2 


ClD 
- 
Clear Dlractlon Flag 
11111100 
2 


STD ~ Set Direction Flag 
11111101 
2 


Cli 
~ Clear Intarrupt 
11111010 
5 


Enable Flag 


STI ~ Set Interrupt 
I 
11111011 
I 
5 
EnableFleg 


LAHF - 
load 
AH Into Flag 
I 10011111 
3 


SAHF = Store AH Into Flego 
I 
10011110 
2 


PUSHF ~ Puoh Flogo 
I 
10011100 
4/3 
RV/P 


POPF - 
Pop Flogo 
I 
10011101 
9/6 
RV/P 


DECIMAL ARITHMETIC 


AAA ~ ASCIt Adjuotlor 
Add 
I 00110111 
I 
3 


AAS ~ ASCII Adjuotlor 
I 00111111 
I 
3 


Subtract 


AAM = ASCII Adjuotlor 
I 
11010100 
I 
00001010 
I 
15 
Multiply 


intel~ 


INSTRUCTION 
FORMAT 
Cache Hit 
Not •• 


DECIMAL ARITHMETIC (Continued) 


AAD ~ ASCII Adjust lor 
I 
11010101 
, 
00001010 
, 
14 


Divide 


DAA = Decimal Adjust lor Add 
, 
00100111 
, 
2 


DAS = Decima' Adjust lor SUbtract 
, 
00101111 
, 
2 


PROCESSOR CONTROL INSTRUCTIONS 


HLT = Halt 
I 
11110100 
I 
4 


MOV = Move To and From ControllDebuglTeat 
Reglate •• 


CRO from register 
, 
00001111 
00100010 
111 
000 
reg , 
17 


CR2/CR3 from register 
, 
00001111 
00100010 
111 
eee 
reg , 
4 


Reg Irom CRO-3 
, 
00001111 
00100000 
11 
eee 
reg I 
4 


DRO-3 from regist", 
I 00001111 
00100011 
11 
eee 
reg I 
10 


DR6-7 from register 
00001111 
00100011 
11 
eee 
reg I 
10 


Register from DR6-7 
00001111 
00100001 
11 
eee 
reg 
9 


Register from DRO-3 
00001111 
00100001 
11 
eee 
reg 
9 


TR3 from register 
00001111 
00100110 
11 
011 
reg 
4 


TR4-7 
from register 
00001111 
00100110 
11 
eee 
reg 
4 


Register 
from TR3 
00001111 
00100100 
11 
011 
reg 
3 


Register from TR4-7 
00001111 
00100100 
11 
eee 
reg 
4 


CLTS = Clear Taak Switched Flag 
00001111 
00000110 
7 


INVD - 
Invalidate Data Cache 
00001111 
00001000 
4 


WBINVD = Write-Back and Invalidate 
00001111 
00001001 
5 


Deta Cache 


INVLPG = Invalidate TLB Entry 


INVLPG memory 
I 00001111 
, 
00000001 
I mod 111 
rim' 
12/11 
H/NH 


PREFIX BYTES 


Add •••• 
SIze Prefix 
I 01100111 
I 
1 


LOCK = Bus Lock Prefix 
, 
11110000 
I 
1 


Operand Size Prefix 
I 01100110 
I 
1 


Segment Override Prefix 


cs: 
, 
00101110 
I 
1 


DS: 
I 00111110 
I 
1 


ES: 
I 00100110 
I 
1 


FS: 
I 01100100 
I 
1 


GS: 
I 01100101 
, 
1 


SS: 
I 00110110 
, 
1 


• 


INSTRUCTION 
FORMAT 
ClIChe Hit 
Not •• 


PROTECTION 
CONTROL 


ARPL ~ AdJuet Requ •• tecl Prtvllege 
Level 


From register 
I 01100011 
111 
regl 
reg21 
9 


From memory 
I 0110001 
I I mod 
reg 
rim I 
9 


LAR = Loecl Acceaa 
Rlghta 


From register 
I 00001111 
I 00000010 
111 
regl 
reg21 
11 


Frommemocy 
I 00001111 
I 00000010 
I mod 
reg 
rim I 
11 


LGOT ~ LoecI GIobllI Deacrtptor 


Table register 
I 00001111 
I 00000001 
ImodOl0 
rim I 
12 


L10T ~ LOlld Interrupt 
Deacrtptor 


Table register 
I 00001111 
I 00000001 
I mod 
all 
rim I 
12 


lLDT 
= LOlld locIIl 
Deacrtptor 


Table register from reg. 
I 
00001111 
I 00000000 
111 
010 
reg I 
11 


Table register from memo 
I 
00001111 
I 
00000000 
ImodOl0 
rim I 
11 


LMSW ~ Load Machine 
Statua 
Word 


From register 
I 
00001111 
I 00000001 
III 
110 
reg I 
13 


From memory 
I 
00001111 
I 00000001 
I mod 
110 
rim I 
13 


LSL ~ lOlld segment 
Limit 


From register 
I 00001111 
I 
00000011 
III 
regl 
reg21 
10 


From memory 
I 00001111 
I 
00000011 
I mod 
reg 
r/ml 
10 


LTR = lOlld Taak Reglater 


From Register 
I 
00001111 
I 
00000000 
111 
011 
reg I 
20 


From Memocy 
I 00001111 
I 00000000 
I mod 
011 
rim I 
20 


SGOT ~ Store 
GIobllI Deacrtptor 
Table 
I 00001111 
I 00000001 
I mod 
000 
rim I 
10 


SlOT ~ StOAllnterrupt 
Deacrtptor 
Table 
I 00001111 
I 
00000001 
I mod 
001 
r/ml 
10 


SLOT ~ SlOAlloclll 
Deacrtptor 
Table 


To register 
I 00001111 
I 
00000000 
III 
000 
reg I 
2 


Tomemocy 
I 00001111 
I 
00000000 
I mod 
000 
rim I 
3 


SMSW = StOAl Machine 
Statua 
Word 


To register 
I 
00001111 
I 00000001 
III 
lOa 
reg I 
2 


TomemOf)' 
I 
00001111 
I 00000001 
I mod 
100 
rim I 
3 


STR = Store 
Taak Reglater 


To register 
I 00001111 
I 
00000000 
III 
001 
reg I 
2 


To memory 
I 
00001111 
I 
00000000 
I mod 
001 
rim I 
3 


VERR ~ Vertty Read Acce •• 


Register 
I 00001111 
I 00000000 
III 
100 
rim I 
II 


Memory 
I 00001111 
I 00000000 
I mod 
100 
rim I 
II 


VEJlW - 
Vertty Wrtte Ace ••• 


To register 
I 
00001111 
I 
00000000 
111 
101 
reg I 
11 


To memory 
I 00001111 
I 00000000 
I mod 
101 
rim I 
11 
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INTERRUPT INSTRUCTIONS 


INT n - 
Interrupt 
Type n 
I 11001101 
I 
type 
I 
INT+4/0 
AV/P,21 


INT 3 - 
InterruptType 
3 
I 
11001100 
I 
INT+O 
21 


INTO ~ Interrupt4 
If 
I 
11001110 
I 
Overflow 
Flag sat 


Taken 
INT+2 
21 
NotTaken 
3 
21 


BOUND - 
Interrupt 
5 If Detect 
I 01100010 
I mod 
reg 
rim I 
Value Out Range 


If in range 
7 
21 


If out of range 
INT+24 
21 


IRET = Interrupt 
Return 
I 
11001111 
I 


Aeal ModeNirtual 
Mode 
15 
Protected Mode 


To same level 
20 
9 


To outer level 
36 
9 


To nested task (EFLAGS.NT = 1,> 
TS+32 
9,10 


Extemallnterrupt 
INT+11 
21 


NMI - 
Non-Maakabla 
Interrupt 
INT+3 
21 


Page Fault 
INT+24 
21 


VM86 ExcepUona 


ell 
INT+8 
21 


STI 
INT+8 
21 


INTn 
INT+9 


PUSHF 
INT+9 
21 
POPF 
INT+8 
21 
IAET 
INT+9 
IN 
Fixed Port 
INT+SO 
21 


Variable Port 
INT+51 
21 
OUT 


Fixed Port 
INT+SO 
21 
Variable Port 
INT+S1 
21 
INS 
INT+SO 
21 
OUTS 
INT+SO 
21 
AEP INS 
INT+S1 
21 
AEPOUTS 
INT+S1 
21 


Task Switch Clock Counts Table 


Method 
Value forTS 
Cache Hit 


VM/intel486 
DX2 CPU/286 
TSS To Intel486 
DX2 CPU TSS 
162 
VM/intel486 
DX2 CPU/286 
TSS To 286 TSS 
143 
VM/intel486 
DX2 CPU/286 
TSS To VM TSS 
140 


• 


Interrupt 
Clock Counts Table 


Method 
Value for INT 


Cache 
Hit 
Notes 


Real Mode 
26 


Protected 
Mode 
Interrupt/Trap 
gate, same level 
44 
9 


Interrupt/Trap 
gate, different 
level 
71 
9 


Task Gate 
, 
37 + TS 
9, 10 


Virtual Mode 
Interrupt/Trap 
gate, different 
level 
82 


Task gate 
37 + TS 
10 


Abbreviations 
16/32 
U/L 
MN/MX 
LINL 
RV/P 
R 
P 
TINT 
HINH 


Definition 
16/32 bit modes 
unlocked/locked 
minimum/ maximum 
loop/no loop 
real and virtual mode/protected mode 
real mode 
protected mode 
taken/not taken 
hit/no hit 


NOTES: 
1. Assuming that the operand address and stack address fall in different cache sets. 
2. Always locked, no cache hit case. 
3. Clocks = 10 + max(log2(lml),n) 
m = multiplier value (min clocks for m=0) 
n = 3/5 for ±m 


4. Clocks = (quotient(count/operand length)I'7+9 


= 8 if count,,; 
operand length (8/16/32) 
5. Clocks = {quotient(count!operand length)I'7+9 


= 9 if count,,; 
operand length (8/16/32) 


6. Equal/not equal cases (penalty is the same regardless of lock). 
7. Assuming that addresses for memory read (for indirection), stack push/pop, and branch fall in different cache sets. 
8. Penalty for cache miss: add 6 clocks for every 16 byles copied to new stack frame. 
9. Add 11 clocks for each unaccessed descriptor load. 
10. Refer to task switch clock counts table for value of TS. 
11. Add 4 ex1raclocks to the cache miss penalty for each 16 byles. 
For notes 12-13: (b = 0-3, non·zero byle number); 


(i = 0-1, non·zero nibble number); 
(n = 0-3, non bit number in nibble); 


12. Clocks = 8+4 
(b+ 1) + 3(i+ 1) + 3(n+ 1) 


= 6 if second operand = 0 
13. Clocks = 9+4(b+1) 
+ 3(i+1) 
+ 3(n+1) 
= 7 if second operand = 0 
For notes 14-15: (n = bit position 0-31) 
14. Clocks = 7 + 3(32 - n) 
6 if second operand = 0 
15. Clocks = 8 + 3(32 - n) 
7 if second operand = 0 
16. Assuming that the two string addresses fall in different cache sets. 
17. Cache miss penalty: add 6 clocks for every 16 byles compared. Entire penalty on first compare. 
18. Cache miss penalty: add 2 clocks for every 16 byles of data. Entire penalty on first load. 
19. Cache miss penalty: add 4 clocks for every 16 byles moved. 


(1 clock for the first operation and 3 for the second) 
20. Cache miss penalty: add 4 clocks for every 16 byles scanned. 
(2 clocks each for first and second operations) 
21. Refer to interrupt clock counts table for value of INT 
22. Clock count includes one clock for using both displacement and immediate. 
23. Refer to assumption 6 in the case of a cache miss. 


inteJ~ 


Re.1 
Protected 
Protected 
Vlrtu.lell 
INSTRUCTION 
FORMAT 
Mode 
Mode 
Mode 
Mode 
Note. 


(CPL,;;IOPL) (CPL>IOPL) 


110 INSTRUCTIONS 


IN ~ Input from: 


Fixed Port 
1111001 
Ow I 
pert number I 
17 
12 
32 
30 


Variable Port 
1111011 
Ow I 
17 
11 
31 
30 


OUT ~ Output to: 


Fixed Port 
11110011 
w I 
port number I 
19 
14 
34 
32 


Variable Port 
I 1110111 
wi 
19 
13 
33 
32 


INS ~ Input Byte/Word 
0110110w 
20 
13 
35 
33 
Irom DX Port 


OUTS - 
Output Byte/Word 
0110111 
w 
20 
13 
35 
33 
1 


to DX Port 


REP INS = Input String 
I 11110011 
1011011 
Ow I 
19+110 
13+110 
33+110 
32+110 
2 


REP OUTS ~ Output String 
I 11110011 
10110111 
wi 
20+8c 
14+8c 
34+8c 
33+80 
3 


NOTES: 
1. Two clock 
cache 
miss penalty 
in all cases. 


2. c = count 
in CX or ECX. 
3. Cache 
miss penalty 
in all modes: 
Add 2 clocks 
for every 
16 bytes. 
Entire penalty 
on second 
operation. 


Cache Hit 


INSTRUCTION 
FORMAT 
Avg(Lower 
Notea 


Range •.• 
Upper Range) 


DATA TRANSFER 


FLD ~ Real Load to ST(O) 


32-bit memory 
111011 
001!mOd 
000 
r/ml 
s-i-b/disp. 
I 
3 


54-bit memory 
111011 
1011 
mod 000 
rlml 
s-i-bl disp. 
1 
3 


aO-bit memory 
111011 
011 
1mod 
101 
rim I 
s-i-bldisp. 
1 
6 


ST(i) 
111011 
001111000 
ST(i) I 
4 


FILD ~ Integer Load to ST(O) 


16-bit memory 
111011 
1111 
mod 000 
r/ml 
s-i-bl disp. 
I 
14.5(13-16) 


32-bit memory 
111011 
0111 
mod 000 
r/ml 
s-i-bl disp. 
1 
11.5(9-12) 


54-bit memory 
1110'1 
1111 
mod 
101 
rim! 
s-i·bl diap. 
1 
16.8(10-18) 


FBLD ~ BCD Load to ST(O) 
111011 
111 
1mod 
100 
rim I 
s-i-bldiap. 
1 
75(70-103) 


FST ~ Store Real from ST(O) 


32-bit memory 
1'1011 
0011mod 
010 
r/ml 
s-i-b/diap. 
1 
7 
1 


64-bit memory 
111011 
1011 
mod 010 
r/ml 
9-i-b/disp. 
I 
8 
2 


ST(i) 
11'011 
101111010 
ST(i) 1 
3 


FSTP - 
Store Real from ST(O)and Pop 


32-bit memory 
111011 
0011mod 
011 
rim! 
a-i-bl disp. 
1 
7 
1 


64-bit memory 
111011 
1011 
mod 011 
rim 1 
s·j-bl disp. 
1 
8 
2 


aD-bit memory 
1'101' 
01' 
1mod 
111 
rim 1 
a·i-b/diap. 
1 
6 


ST(i) 
1'101' 
'0' 
111001 
ST(i) 1 
3 


FIST ~ Store Integer from ST(O) 


16-bit memory 
111011 
1111 
mod 
010 
r/ml 
s-i-bldisp. 
I 
33.4(29-34) 


32-bit memory 
111011 
011! 
mod 010 
rlml 
a-i-b/diap. 
I 
32.4(28-34) 


FISTP ~ Store Integer from ST(O)and Pop 


16-bit memory 
111011 
111 
mod 011 
rim 
a-i-bl disp. 
1 
33.4(29-34) 


32·bit memory 
1110'1 
011 
I'[)od 011 
rim 
a-i-bl diap. 
I 
33.4(29-34) 


54-bit memory 
111011 
111 
mod 
111 
rim 
a-i-b/disp. 
I 
33.4(29-34) 


FBSTP ~ Store BCD from 
111011 
111 
mod 
110 
rim 
s-i-bl disp. 
I 
175(172-176) 


ST(O)and Pop 


FXCH ~ Exchange ST(O)and ST(I) 
111011 
001 
11001 
ST(i) 
4 


COMPARISON INSTRUCTIONS 


FCOM ~ Compare ST(O)with Real 


32-bit memory 
111011 
ooolmod 
010 
rim 1 
s-i-b/diap. 
1 
4 


54-bit memory 
1110'1 
100lmod 
010 
rim 1 
s-i-bl disp. 
1 
4 


ST(i) 
111011 
000111010 
ST(i) I 
4 


FCOMP ~ Compare ST(O)with Real and Pop 


32-bit memory 
111011 
oooimod 
011 
rim 1 
s-i-bl disp. 
1 
4 


64-bit memory 
111011 
100lmod 
011 
rim 1 
a-i-bl diap. 
I 
4 


ST(i) 
111011 
000111011 
ST(i) I 
4 


CecheHlt 


INSTRUCTION 
FORMAT 
AVll(Lower 
Not •• 


Rellll" 
••• 


Upper Renge) 


COMPARISON INSTRUCTIONS (Continued) 


FCOMPP ~ Com ••••• ST(O)with 
1" 
0 11 
110 
1" 
0 1 
10011 
5 


ST(I) end Pop Twice 


FICOM = Compe •• ST(O)with Integer 


16-bitmemory 
111011 
1101 
mod 010 
rIm I 
s-i-b/disp. 
I 
18(16-20) 


32-bilmemory 
1,10" 
ololmodolo 
rlml 
s-i-b/disp. 
I 
16.5(15-17) 


FICOMP ~ Compe •• ST(O)with Integer 


16-bit memory 
1,101, 
110 
mod 011 
rIm 
s-i-bl disp. 
I 
18(16-20) 


32-bilmemory 
1,101, 
010 
mod 011 
rIm 
s-i-b/disp. 
I 
16.5(15-17) 


FTST = Com ••••• ST(O)with 0.0 
1,10" 
001 
1110 
0100 
4 


FUCOM ~ Unordered 
compe •• 
111011 
101 
11100 
STeil 
4 


ST(O)with ST(I) 


FUCOMP - 
U_red 
com ••••• 
1" 
0 11 
101 
11101 
ST(i) 
4 


ST(O)wlth ST(I) end Pop 


FUCOMPP = Unordered 
com ••••• 
1" 
011 
010 
1110 
10011 
5 


ST(O) with ST(I) end Pop Twice 


FXAM = Exemlne ST(O) 
1" 
011 
001 
1110 
01011 
8 


CONSTANTS 


FLDZ = Loed +0.0 Into ST(O) 
111011 
0011,1,0 
1110 
4 


FLDI 
= Loed + 1.0 Into ST(O) 
1" 
011 
0011,1,0 
1000 
4 


FLDPI = Loed 1T Into ST(O) 
1" 
011 
ood 
",0 
, 01' 
8 


FLDL2T = Loed logil0) 
Into ST(O) 
1,,0,1 
ood 
"'0 
'00' 
8 


FLDL2E = Loed logie) 
Into ST(O) 
1"011 
0011,1,0 
'010 
8 


FLDLG2 = Loed log,o(2) Into ST(O) 
1" 
011 
00,1",0 
1'00 
8 


FLOLN2 ~ Loed 1Olle(2)Into ST(O) 
1"0,, 
00,1",0 
"0' 
8 


ARITHMETIC 


FADD - 
Add Reel with ST(O) 


ST(O)+- ST(O) + 32-bi1memory 
1'101' 
OOOlmod 
000 
rIm I 
s-i-b/disp. 
I 
10(8-20) 


ST(O)+- ST(O) + 64-biI memory 
I"01' 
'OOlmod 
000 
rIm I 
s-i-b/disp. 
I 
'0(8-20) 


ST(d) +- ST(O) + STeil 
I"0 1 1 
dool"ooo 
STeilI 
10(8-20) 


FADDP = Add •• el with ST(O)end 
1,10" 
,101,,000 
STeilI 
'0(8-20) 
Pop (ST(I) +- ST(O) + ST(Il) 


FSUB = SUbtract reel from ST(O) 


ST(O)+- ST(O) - 
32-biI memory 
1"0,, 
000 I mod 
100 
rlml 
s-i-b/disp. 
I 
'0(8-20) 


ST(O)+- ST(O) - 64-b~ memory 
111011 
100lmod 
'00 
rIm I 
s-i-b/disp. 
I 
'0(8-20) 


ST(d) +- ST(O) - 
STeil 
I" 011 
dool,1'Od 
STeilI 
10(8-20) 


FSUBP = SUbtract reel from ST(O) 
1"0,, 
"0 
1 "'0' 
STOlI 
'0(8-20) 


end Pop (ST(I) +- ST(O) - 
ST(I) 


• 


I 
Cache Hit 


INSTRUCTION 
FORMAT- 
Avg(Lower 
Notea 


Range ..• 


Upper Range) 


ARITHMETIC 
(Continued) 


FSUBR = Subtract 
real reversed 
(Subtract 
ST(O) from raal) 


ST(O) +- 32-bit memory - 
ST(O) 
111011 
ooolmod 
101 
rlml 
s-i-b/disp. 
I 
10(8-20) 


ST(O) +- 64-bit memory - 
ST(O) 
111011 
100lmod 
101 
rlml 
s-j-b/dlsp. 
I 
10(8-20) 


ST(d) +- ST(i) - 
ST(O) 
111011 
dOO!1110d 
ST(i) I 
10(8-20) 


FSUBRP = Subtract 
real reveraed 
111011 
110 
111100 
ST(i) I 
10(8-20) 


and Pop (ST(I) +- ST(1l - 
ST(O)) 


FMUL ~ Multiply 
real with ST(O) 


ST(O) +- ST(O) x 32-bit memory 
111011 
000 I mod 
001 
rim I 
s-i-bl disp. 
I 
11 


ST(O) +- ST(O) x 64-bit memory 
111011 
100lmod 
001 
rim I 
s-j-bl disp. 
I 
14 


ST(d) +- ST(O) x ST(i) 
111011 
dool11001 
ST(i) I 
16 


FMULP ~ Multiply 
ST(O) with ST(I) 
111011 
110 
11 1001 
ST(i) I 
16 


and Pop (ST(I) +- ST(O) x ST(I)) 


FDIV = Divide ST(O) by Real 


ST(O) +- ST(0)/32-bit 
memory 
111011 
000 I mod 
110 
rim I 
s-i-bl disp. 
I 
73 
3 


ST(O) +- ST(O)/54-bit 
memory 
111011 
100lmod 
110 
rim I 
s-i-bl disp. 
I 
73 
3 


ST(d) +- ST(O)/ST(i) 
111011 
dool1111d 
ST(i) I 
73 
3 


FDIVP = Divide ST(O) by ST(I) and 
111011 
110 
111111 
ST(i) I 
73 
3 


Pop (ST(I) +- ST(O)/ST(I)) 


FDIVR ~ Divide real reveraed 
(ReaIlST(O» 


ST(O) +- 32-bit memoryIST(O) 
111011 
ooolmod 
111 
rlml 
s-i-bl disp. 
I 
73 
3 


ST(O) +- 54-bit memoryIST(O) 
111011 
100lmod 
111 
rlml 
s-i-b/disp. 
I 
73 
3 


ST(d) +- ST(i)/ST(O) 
111011 
dOO 11111 
d 
ST(i) I 
73 
3 


FDIVRP = Divide 
real reversed 
and 
111011 
110 
11111 
0 
ST(i) I 
73 
3 


Pop (ST(I) +- ST(I)/ST(O)) 


FIADD ~ Add Integer 
to ST(O) 


ST(O) +- ST(O) + 16-bit memory 
111011 
1101 
mod 
000 
rIm I 
s-i-bl disp. 
I 
24(20-35) 


ST(O) +- ST(O) + 32-bit memory 
111011 
0101 
mod 
000 
rIm I 
s-i-b/disp. 
I 
22.5(19-32) 


FISUB ~ Subtract 
Integer 
from ST(O) 


ST(O) +- ST(O) - 
16-bit memory 
111011 
1101 
mod 
100 
rIm I 
s-i-bl disp. 
I 
24(20-35) 


ST(O) +- ST(O) - 
32-bit memory 
111011 
ololmod 
100 
rlml 
s-i-b/disp. 
I 
22.5(19-32) 


FISUBR ~ Integer 
SUbtract 
Reversed 


ST(O) +- 16-bit memory 
- 
ST(O) 
111011 
1101 
mod 
101 
rlml 
s-i-b/disp. 
I 
24(20-35) 


ST(O) +- 32·bit memory 
- 
ST(O) 
111011 
010lmod 
101 
rIm I 
s-i-b/disp_ 
I 
22.5(19-32) 


FIMUL ~ Multiply 
Integer 
with ST(O) 


ST(O) +- ST(O) x 16-bit memory 
111011 
1101 
mod 
001 
rIm I 
s-i~b/disp. 
I 
25(23-27) 


ST(O) +- ST(O) x 32-bit memory 
111011 
ololmod 
001 
rlml 
s-i-b/ disp. 
I 
23.5(22-24) 


FIDIV ~ Integer 
Divide 


ST(O) +- ST(0)/16-bit 
memory 
111011 
1101 
mod 
110 
rlml 
s-i-b/disp. 
I 
87(85-89) 
3 


ST(O) +- ST(0)/32-bit 
memory 
111011 
0101 
mod 
110 
rlml 
s-i-b/disp. 
I 
85.5(84-86) 
3 


Cach.HIt 


INSTRUCTION 
FORMAT 
Avg(Low.r 
Not •• 


Rang •.•. 
Upper Rang.) 


ARITHMETIC (Continued) 


FIDIVR ~ Integer Divide Reveraed 


ST(O)+- 16-bit memoryIST(O) 
111011 
1101 
mod 
111 
rim 
s-i-bl disp. 
1 
87(85-89) 
3 


ST(O)+- 32-bit memoryIST(O) 
111011 
010lmod 
111 
rim 
s-i-b/disp. 
1 
85.5(84-86) 
3 


FSQRT = Squar. 
Root 
111011 
00111111 
1010 
85.5(83-87) 


FSCALE = Scale ST(O) by ST(1) 
111011 
00111111 
1101 
31(30-32) 


FXTRACT = Extract componenta 
111011 
00111111 
0100 
19(16-20) 


oIST(O) 


FPREM - 
Partial Reminder 
11011 
00111111 
10001 
84(70-138) 


FPREM1 ~ Partial R.mlnd.r 
(IEEE) 
11011 
00111111 
01011 
94.5(72-167) 


FRNDINT = Round ST(O) to Int.g.r 
11011 
001 
11111 
1100 I 
29.1(21-30) 


FABS ~ Abaolute 
valu. 
01 ST(O) 
11011 
00111110 
00011 
3 


FCHS = Change algn 01 ST(O) 
11011 
00111110 
00001 
6 


TRANSCENDENTAL 


FCOS ~ Coalne 01 ST(O) 
11011 
00111111 
1111 
241(193-279) 
6,7 


FPTAN = Partial tangent 01 ST(O) 
11011 
001 
11111 
0010 
244(200-273) 
6,7 


FPATAN ~ Partial arctangent 
11011 
00111111 
0011 
289(218-303) 
6 


FSIN = Sine 01 ST(O) 
11011 
001 
11111 
1110 
241(193-279) 
6,7 


FSINCOS = Sin. and coalne 01 ST(O) 
11011 
00111111 
1011 
291(243-329) 
6,7 


F2XM1 = 2"T(O) - 
1 
11011 
00111111 
0000 
242(140-279) 
6 


FYL2X ~ ST(1) x 100021ST(0» 
11011 
00111111 
0001 
311(196-329) 
6 


FYL2XP1 = ST(1) x 100021ST(0)+ 1.0) 
11011 
001 
11111 
1001 
313(171-326) 
6 


PROCESSOR CONTROL 


FINIT ~ Inltlallz. 
FPU 
111011 
01111110 
0011 
1 
17 
4 


FSTSW AX - 
Store ltatUI 
word 
111011 
111 
11110 
00001 
3 
5 
Into AX 


FSTSW = Store ltatua 
word 
111011 
101 
mod 
111 
rim I 
s-i-bl disp. 
1 
3 
5 


Into memory 


FLDCW ~ Load control 
word 
111011 
001 
mod 
101 
rim 
s-i-b/disp. 
1 
4 


FSTCW = Store control 
word 
111011 
001 
mod 
111 
rim 
s·j·b/disp. 
1 
3 
5 


FCLEX ~ Clear exceptlonl 
111011 
011 
1110 
0010 
7 
4 


FSTENV ~ Store .nvlronm.nt 
111011 
001 
mod 
110 
rim 
s-i-b/disp. 
1 
Real and Virtual modes 1&-bit Address 
67 
4 


Real and Virtual modes 32-bit Address 
67 
4 


Protected mode 16-bit Address 
56 
4 


Protected mode 32-bi1 Address 
56 
4 


FLDENV = Load environment 
111011 
0011mod 
100 
rim 1 
s-i-b/ disp. 
1 
Real and Virtual modes 16-bit Address 
44 


Real and Virtual modes 32-bit Address 
44 


Protected mode 16-bit Address 
34 


Protected mode 32-bit Address 
34 


• 


c.etle 
Hit 


INSTRUCTION 
FORMAT 
AvglLower 
Not •• 
Aange ••• 
Upper Aange) 


PROCESSOfI 
CONTROL 
(Continued) 


FSAVE = SIlva agt. 
11 1011 
1011mod110 
r/ml 
s-i-b/disp. 
I 
Real and Virtual modes 16-bit Address 
154 
4 
Real and Virtual modea 32-bit Address 
154 
4 
Protected mode 16-bit Addrass 
143 
4 
Protected mode 32-bit Address 
143 
4 


FRSTOR - 
Aeef •••• _ 
111011 
10d 
mod 
100 
rim I 
s+bl 
I 
Real and Virtual modea 16-bit Address 
131 
Real and Virtual modes 32-bit Address 
131 
Protected mode 16-bit Address 
120 
Protected mode 32-bit Address 
120 


FINCSTP 
- 1_ 
S_ 
PoInter 
11 to 
11 
001 
1111 
01111 
3 


fDECSTP 
= Decrement a- PoInter 
11 1 0 1 1 
001 
1111 
01101 
3 


FFREE = Free STII) 
111011 
101 
11000 
STeil I 
3 


FNOP-No 
__ 
111011 
001 
1101 
00001 
3 


WAIT = Walt until FPU rMdy 
I 
10011011 
(Minimum/Maximum) 
1/3 


NOTES: 
1. If operand 
is 0 clock 
counts 
= 27. 
2. If operand 
is 0 clock 
counts 
= 28. 
3. If CW.PC 
indicates 
24 bit precision 
then subtract 
38 clocks. 
If CW.PC 
indicates 
53 bit precision 
then subtract 
11 clocks. 
4. If there 
is a numeric 
error pending 
from a previous 
instruction 
add 17 clocks. 
5. If there 
is a numeric 
error pending 
from a previous 
instruction 
add 18 clocks. 
6. The INT pin is polled 
several 
times while this instruction 
is executing 
to assure 
short 
interrupt 
latency. 
7_ If ABS(operand) 
is greater 
than 1r/4 then add n clocks. 
Where 
n = (operand/(1r/4». 


10.2.1 
OVERVIEW 


All instruction 
encodings 
are subsets 
of the general 
instruction 
format 
shown 
in Figure 
10.1. Instructions 
consist of one or two primary opcode 
bytes, possibly 
an address 
specifier 
consisting 
of the 
"mod 
rIm" 
byte and "scaled 
index" 
byte, a displacement 
if re- 
quired, and an immediate 
data field if required. 


Within 
the primary 
opcode 
or opcodes, 
smaller 
en- 
coding 
fields 
may be defined. 
These 
fields vary ac- 
cording 
to the class of operation. 
The fields 
define 
such information 
as direction 
of the operation, 
size 
of the displacements, 
register 
encoding, 
or sign ex- 
tension. 


Almost 
all 
instructions 
referring 
to 
an operand 
in 
memory 
have 
an addressing 
mode 
byte 
following 
the primary 
opcode 
byte(s). This byte, the mod rIm 
byte, specifies 
the address 
mode to be used. certain 
encodings 
of the 
mod rIm 
byte indicate 
a second 


addressing 
byte, the scale-index-base 
byte, follows 
the 
mod rIm 
byte 
to fully 
specify 
the 
addressing 
mode. 


Addressing 
modes 
can include 
a displacement 
im- 
mediately 
following 
the mod rIm byte, or scaled 
in- 
dex byte. If a displacement 
is present, 
the possible 
sizes are 8, 16 or 32 bits. 


If the 
instruction 
specifies 
an immediate 
operand, 
the 
immediate 
operand 
follows 
any 
displacement 
bytes. The immediate 
operand, 
if specified, 
is always 
the last field of the instruction. 


Figure 
10.1 illustrates 
several 
of the fields that can 
appear 
in an instruction, 
such as the mod field and 
the rIm field, but the Figure does not show all fields . 
Several 
smaller fields also appear 
in certain 
instruc- 
tions, 
sometimes 
within 
the 
opcode 
bytes 
them- 
selves. 
Table 
10.4 is a complete 
list of all fields ap- 
pearing 
in the Intel486 
DX2 microprocessor 
instruc- 


tion set. Further ahead, following 
Table 10.4, are de- 


tailed tables for each field. 


IT T T T T T T TIT 
T T T T T T T I mod T T T rIm I ss index base Id32 116 18 1none data32 
116 I 8 1none 


opcode 
(one or two bytes) 
(T represents 
an 
opcode 
bit.) 


"mod rIm" 
byte 


<>,;\7653201\7653201\ 
1 \ 
. 
. 
'--------- 
us..i-b" 


byte 
immediate 
data 
(4, 2, 1 bytes 
or none) 
register and address 
mode specifier 


address 
displacement 
(4, 2, 1 bytes 
or none) 


Field Name 
Description 
Number 
of Bits 


w 
Specifies 
if Data is Byte or Full Size (Full Size is either 16 or 32 Bits) 
1 
d 
Specifies 
Direction 
of Data Operation 
1 


s 
Specifies 
if an Immediate 
Data Field Must be Sign-Extended 
1 


reg 
General 
Register 
Specifier 
3 


mod rIm 
Address 
Mode Specifier 
(Effective 
Address 
can be a General 
Register) 
2 for mod; 
3 for rIm 
ss 
Scale Factor for Scaled Index Address 
Mode 
2 
index 
General 
Register to be used as Index Register 
3 
base 
General Register 
to be used as Base Register 
3 


sreg2 
Segment 
Register 
Specifier 
for CS, SS, OS, ES 
2 
sreg3 
Segment 
Register 
Specifier 
for CS, SS, OS, ES, FS, GS 
3 


mn 
For Conditional 
Instructions, 
Specifies 
a Condition 
Asserted 
or a Condition 
Negated 
4 


NOTE: 
Tables 
10.1-10.3 
show encoding 
of individual 
instructions. 


• 


10.2.2 32-BIT EXTENSIONS OF THE 
INSTRUCTION SET 


The Intel486 DX2 supports allintel486 extensions to 
the 8086/80186/80286 
instruction set. 


With the Intel486 microprocessor, the 8086/801861 
80286 instruction set was extended in two orthogo- 
nal directions: 32-bit forms of all 16-bit instructions 
are added to support the 32-bit data types, and 
32-bit addressing modes are made available for all 
instructions referencing memory. This orthogonal in- 
struction set extension is accomplished having a De- 
fault (D) bit in the code segment descriptor, and by 
having 2 prefixes to the instruction set. 


Whether the instruction defaults to operations of 16 
bits or 32 bits depends on the setting of the 0 bit in 
the code segment descriptor, which gives the de- 
fault length (either 32 bits or 16 bits) for both oper- 
ands and effective addresses when executing that 
code segment. In the Real Address Mode or Virtual 
8086 Mode, no code segment descriptors are used, 
but a 0 value of 0 is assumed internally by the In- 
tel486 DX2 microprocessor when operating in those 
modes (for 16-bit default sizes compatible with the 
8086/80186/80286). 


Two prefixes, the Operand Size Prefix and the Effec- 
tive Address Size Prefix, allow overriding individually 
the Default selection of operand size and effective 
address size. These prefixes may precede any op- 
code bytes and affect only the instruction they pre- 
cede. If necessary, one or both of the prefixes may 
be placed before the opcode bytes. The presence of 
the Operand Size Prefix and the Effective Address 
Prefix will toggle the operand size or the effective 
address size, respectively, to the value "opposite" 
from the Default setting. For example, if the default 
operand size is for 32-bit data operations, then pres- 
ence of the Operand Size Prefix toggles the instruc- 
tion to 16-bit data operation. As another example, if 
the default effective address size is 16 bits, pres- 
ence of the Effective Address Size prefix toggles the 
instruction to use 32-bit effective address computa- 
tions. 


These 32-bit extensions are available in all Intel486 
microprocessor modes, including the Real Address 
Mode or the Virtual 8086 Mode. In these modes the 
default is always 16 bits, so prefixes are needed to 
specify 32-bit operands or addresses. For instruc- 
tions with more than one prefix, the order of prefixes 
is unimportant. 
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Unless specified otherwise, instructions with 8-bit 
and 16-bit operands do not affect the contents of 
the high-order bits of the extended registers. 


10.2.3 ENCODING OF INTEGER 
INSTRUCTION FIELDS 


Within the instruction are several fields indicating 
register selection, addressing mode and so on. The 
exact encodings of these fields are defined immedi- 
ately ahead. 


For any given instruction performing a data opera- 
tion, the instruction is executing as a 32-bit operation 
or a 16-bit operation. Within the constraints of the 
operation size, the w field encodes the operand size 
as either one byte or the full operation size, as 
shown in the table below. 


Operand Size 
Operand Size 
wFleld 
During 16-Blt 
During 32-Blt 
Data Operations 
Data Operations 


0 
8 Bits 
8 Bits 
1 
16 Bits 
32 Bits 


10.2.3.2 Encoding of the General 
Register (reg) Field 


The general register is specified by the reg field, 
which may appear in the primary opcode bytes, or as 
the reg field of the "mod rim" byte, or as the rim 
field of the "mod rim" byte. 


Encoding of reg Field When w Field 
Is not Present In Instruction 


Register selected 
Register selected 
reg Field 
During 16-Blt 
During 32-Blt 
Data Operations 
Data Operations 


000 
AX 
EAX 
001 
CX 
ECX 
010 
OX 
EDX 
011 
BX 
EBX 
100 
SP 
ESP 
101 
BP 
EBP 
110 
SI 
ESI 
111 
01 
EDI 


Encoding of reg Field When w Field 
Is Present In Instruction 


Register Specified by reg Field 
During 16-Blt Data Operations: 


Function of w Field 
reg 
(whenw 
= 0) 
(when w = 1) 


000 
Al 
AX 


001 
Cl 
CX 


010 
ol 
OX 
011 
Bl 
BX 
100 
AH 
SP 


101 
CH 
BP 


110 
oH 
SI 


111 
BH 
01 


Register Specified by reg Field 
During 32·Blt Data Operations 


Function of w Field 
reg 
(whenw 
= 0) 
(whenw 
= 1) 


000 
Al 
EAX 
001 
Cl 
ECX 
010 
ol 
EoX 


011 
Bl 
EBX 


100 
AH 
ESP 
101 
CH 
EBP 
110 
oH 
ESI 


111 
BH 
Eol 


10.2.3.3 Encoding of the Segment 
Register (sreg) Field 


The sreg field in certain instructions is a 2-bit field 
allowing one of the four 80286 segment registers to 
be specified. The sreg field in other instructions is a 
3-bit field, alloWingthe Intel486 OX2 Microprocessor 
FS and GS segment registers to be specified. 


2·Blt 
segment 


areg2Fleid 
Register 
Selected 


00 
ES 
01 
CS 


10 
SS 
11 
OS 


3·Blt 
Segment 


sreg3 Field 
Register 
Selected 


000 
ES 


001 
CS 


010 
SS 


011 
OS 


100 
FS 


101 
GS 


110 
do not use 
111 
do not use 


10.2.3.4 Encoding of Address Mode 


Except for special instructions, such as PUSH or 
POP,where the addressing mode is pre-determined, 
the addressing mode for the current instruction is 
specified by addressing bytes following the primary 
opcode. The primary addressing byte is the "mod 
rim" byte, and a second byte of addressing informa- 
tion, the "s-i-b" 
(scale-index-base) byte, can be 


specified. 


The s-i-b byte (scale-index-base byte) is specified 
when using 32-bit addressing mode and the "mod 
rim" byte has rim = 100 and mod = 00,01 or 10. 
When the sib byte is present, the 32-bit addressing 
mode is a function of the mod, ss, index, and base 
fields. 


The primary addressing byte, the "mod rim" 
byte, 


also contains three bits (shown as TTT in Figure 
10.1) sometimes used as an extension of the pri- 
mary opcode. The three bits, however, may also be 
used as a register field (reg). 


When calculating an effective address, either 16-bit 
addressing or 32·bit addressing is used. 16-bit ad- 
dressing uses 16-bit address components to calcu- 
late the effective address while 32-bit addressing 
uses 32-bit address components to calculate the ef- 
fective address. When 16-bit addressing is used, the 
"mod rim" byte is interpreted as a 16·bit addressing 
mode specifier. When 32-bit addressing is used, the 
"mod rim" byte is interpreted as a 32-bit addressing 
mode specifier. 


Tables on the following three pages define all en- 
codings of all 16-bit addressing modes and 32-bit 
addressing modes. 


• 
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mod rIm 
Effective 
Address 


00000 
oS:[BX+SI] 
00001 
oS:[BX+ol] 
00010 
SS:[BP+SI] 
00011 
SS:[BP+ol] 
00100 
oS:[SI] 
00101 
OS:[01] 
00110 
oS:d16 
00111 
OS:[BX] 


01000 
OS:[BX + SI+ d8] 
01 001 
OS:[BX + 01+ d8] 
01010 
SS:[BP+ SI+ d8] 
01 011 
SS:[BP+ 01+ d8] 
01 100 
oS:[SI+ d8] 
01 101 
oS:[01+d8] 
01110 
SS:[BP+d8] 
01 111 
oS:[BX+d8] 


Register 
Specified 
by rIm 
During 16-81t Data Operations 


mod rIm 
Function 
of w Field 


(whenw=O) 
(whenw 
= 1) 


11000 
AL 
AX 
11 001 
CL 
CX 
11 010 
oL 
OX 
11 011 
BL 
BX 
11 100 
AH 
SP 
11 101 
CH 
BP 
11110 
oH 
SI 
11111 
BH 
01 


mod rIm 
Effective 
Address 


10000 
oS:[BX+SI+d16] 
10001 
OS:[BX+ 01+ d16] 
10010 
SS:[BP+SI+d16] 
10011 
SS:[BP+ 01+ d16] 
10100 
oS:[SI+d16] 
10 101 
oS:[01+d16] 


10110 
SS:[BP+d16] 


10111 
oS:[BX+d16] 


11000 
register-see 
below 
11 001 
register-see 
below 
11 010 
register-see 
below 
11 011 
register-see 
below 
11 100 
register-see 
below 
11 101 
register-see 
below 
11110 
register-see 
below 
11111 
register-see 
below 


Register 
Specified 
by rIm 
During 32·81t Data Operations 


mod rIm 
Function 
of w Field 


(whenw=O) 
(when w =1) 


11 000 
AL 
EAX 
11 001 
CL 
ECX 
11 010 
oL 
EoX 
11 011 
BL 
EBX 
11 100 
AH 
ESP 
11 101 
CH 
EBP 
11110 
oH 
ESI 
11111 
BH 
Eol 
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mod rIm 
Effective 
Address 


00000 
DS:[EAX) 
00001 
DS:[ECX) 
00010 
DS:[EDX) 
00011 
DS:[EBX] 
00100 
s-i-b is present 
00101 
DS:d32 
00110 
DS:[ESI) 
00 111 
DS:[EDI) 


01000 
DS:[EAX+dS) 
01001 
DS:[ECX+dS) 
01010 
DS:[EDX+dS) 
01 011 
DS:[EBX+dS) 
01 100 
s-i-b is present 
01 101 
SS:[EBP+dS) 
01 110 
DS:[ESI+dS) 
01 111 
DS:[EDI+dS) 


Register 
Specified 
by reg or rIm 
during 16-81t Data Operations: 


mod rIm 
Function 
of w field 


(whenw=O) 
(whenw=1) 


11000 
AL 
AX 


11 001 
CL 
CX 


11 010 
DL 
DX 


11 011 
BL 
BX 


11100 
AH 
SP 


11 101 
CH 
BP 


11110 
DH 
SI 


11111 
BH 
DI 


mod rIm 
Effective 
Address 


10000 
DS: [EAX + d32) 
10001 
DS: [ECX + d32) 
10010 
DS: [EDX + d32) 
10011 
DS: [EBX + d32) 
10100 
s-j-b is present 
10101 
SS: [EBP + d32) 
10110 
DS: [ESI + d32) 
10111 
DS: [EDI + d32) 


11000 
register-see 
below 
11 001 
register-see 
below 
11 010 
register-see 
below 
11 011 
register-see 
below 
11 100 
register-see 
below 
11 101 
register-see 
below 
11110 
register-see 
below 
11111 
register-see 
below 


Register 
Specified 
by reg or rIm 
during 32-81t Data Operations: 


mod rIm 
Function 
of w field 


(whenw=O) 
(when w= 
1) 


11000 
AL 
EAX 
11 001 
CL 
ECX 
11 010 
DL 
EDX 
11 011 
BL 
EBX 
11100 
AH 
ESP 
11 101 
CH 
EBP 
11110 
DH 
ESI 
11111 
BH 
EDI 
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mod base 
Effective 
Address 


00000 
os: [EAX + (scaled index)] 
00001 
OS: [ECX + (scaled index)] 
00010 
OS: [EoX + (scaled index)] 
00011 
OS: [EBX + (scaled index)] 
00100 
SS: [ESP + (scaled index)] 
00101 
OS: [d32 + (scaled index)] 
00110 
OS: [ESI + (scaled index)] 
00 111 
OS: [Eol + (scaled index)] 


01000 
OS: [EAX + (scaled index) + dB] 
01001 
OS: [ECX + (scaled index) + dB] 
01010 
OS: [EoX + (scaled index) + dB] 
01 011 
OS: [EBX + (scaled index) + dB] 
01 100 
SS: [ESP + (scaled index) + dB] 
01 101 
SS: [ESP + (scaled index) + dB] 
01 110 
OS: [ESI + (scaled index) + dB] 
01 111 
OS: [Eol + (scaled index) + dB] 


10000 
OS: [EAX + (scaled index) + d32] 


10001 
OS: [ECX + (scaled index) + d32] 


10010 
OS: [EoX + (scaled index) + d32] 


10011 
OS: [ESX + (scaled index) + d32] 


10100 
SS: [ESP + (scaled index) + d32] 
10 101 
SS: [ESP + (scaled index) + d32] 
10110 
OS: [ESt + (scaled index) + d32] 
10111 
OS: [Eol + (scaled index) + d32] 


NOTE: 
Mod field in "mod rim" 
byte; ss, index, base fields in 


"s-i-b" byte. 


ss 
Scale Factor 


00 
x1 
01 
x2 
10 
x4 
11 
xB 


Index 
Index Register 


000 
EAX 
001 
ECX 


010 
EoX 
011 
EBX 


100 
no index reg" 


101 
EBP 
110 
ESt 


111 
Eol 


"IMPORTANT 
NOTE: 
When index field is 100, indicating "no index register," then 
ss field MUST equal 00. If index is 100 and ss does not 
equal 00, the effective address is undefined. 


10.2.3.5 
Encoding 
of Operation 
Direction 
(d) Field 


In many two-operand 
instructions 
the d field is pres- 
ent 
to 
indicate 
which 
operand 
is considered 
the 
source 
and which 
is the destination. 


d 
Direction 
of Operation 


0 
Register/Memory 
<- - Register 
"reg" 
Field Indicates 
Source Operand; 
"mod 
r/m" 
or "mod ss index base" 
Indicates 
Destination 
Operand 


1 
Register <- - Register/Memory 
"reg" 
Field Indicates 
Destination 
Operand; 
"mod 
r/m" 
or "mod ss index base" 
Indicates 
Source Operand 


10.2.3.6 
Encoding 
of Sign-Extend 
(s) Field 


The s field 
occurs 
primarily 
to instructions 
with im- 
mediate 
data fields. The s field has an effect 
only if 
the size of the immediate 
data is 8 bits and is being 
placed 
in a 16-bit or 32-bit destination. 


Effect 
on 
Effect 
on 
s 
Immediate 
Immediate 
Data8 
Data 16132 


0 
None 
None 


1 
Sign-Extend 
Data8 to Fill 
None 
16-Bit or 32-Bit Destination 


10.2.3.7 
Encoding 
of Conditional 
Test 
(tUn) 
Field 


For the 
conditional 
instructions 
(conditional 
jumps 
and set on condition), 
mn is encoded 
with n indicat- 
ing to use the condition 
(n = 0) or its negation 
(n = 1), 
and m giving the condition 
to test. 


Mnemonic 
Condition 
tttn 
0 
Overflow 
0000 
NO 
No Overflow 
0001 
B/NAE 
Below/Not 
Above or Equal 
0010 
NB/AE 
Not Below/Above 
or Equal 
0011 
E/Z 
Equal/Zero 
0100 
NE/NZ 
Not Equal/Not 
Zero 
0101 
BEINA 
Below or Equal/Not Above 
0110 
NBE/A 
Not Below or.Equall Above 
0111 
S 
Sign 
1000 
NS 
Not Sign 
1001 
PIPE 
Parity/Parity Even 
1010 
NP/PO 
Not Parity/Parity Odd 
1011 
LlNGE 
Less Than/Not 
Greater or Equal 
1100 
NL/GE 
Not Less Than/Greater 
or Equal 
1101 
LE/NG 
Less Than or Equal/Greater 
Than 
1110 
NLE/G 
Not Less or Equal/Greater 
Than 
1111 


10.2.3.8 
Encoding 
of Control 
or Debug 
or Test 
Register 
(eee) Field 


For the 
loading 
and storing 
of the Control, 
Debug 
and Test registers. 


eeeCode 
Reg Name 


000 
CRO 
010 
CR2 
011 
CR3 


Do not use any other encoding 


eeeCode 
Reg Name 


000 
ORO 
001 
DR1 
010 
DR2 
011 
DR3 
110 
DR6 
111 
DR? 


Do not use any other encoding 


eeeCode' 
Reg Name 


011 
TR3 
100 
TR4 
101 
TR5 
110 
TR6 
111 
TR? 


Do not use any other encoding 


• 


Instruction 
Optional 


First Byte 
second 
Byte 
Fields 


11011 
OPA 
1 
mod 
1 
I 
OPB 
rIm 
s-i-b 
I 
disp 


11011 
MF 
OPA 
mod 
OPB 
rIm 
s-i-b 
I 
disp 


11011 
d 
P 
OPA 
1 
1 
OPB 
ST(i) 


11011 
0 
0 
1 
1 
1 
1 
I 
OP 


11011 
0 
1 
1 
1 
1 
1 
I 
OP 


10.2.4 
ENCODING 
OF FLOATING 
POINT 
INSTRUCTION 
FIELDS 
R XOR d = o-Oestination 
(op) Source 
R XOR d = 1-Source 
(op) Destination 


Instructions for the FPU assume one of the five 
forms shown in the following table. In all cases, in- 
structions are at least two bytes long and begin with 
the bit pattern 11011B. 


ST(i) = Register stack element i 
000 = Stack top 
001 = Second stack element 


OP = 
Instruction opcode, possible split into two 
fields OPA and OPB 


MF = Memory Format 
00-32-bit 
real 
01-.32·bit integer 
1Q-64-bit 
real 
11-16-bit 
integer 


mod (Mode field) and rim (Register/Memory specifi- 
er) have the same interpretation as the correspond- 
ing fields of the integer instructions. 


P=Pop 
0-00 
not pop stack 
1-Pop 
stack after operation 


d = Destination 
o-Oestination 
is ST(O) 
1-0estination 
is ST(i) 


s-i-b (Scale Index Base) byte and disp (displace- 
ment) are optionally present in instructions that have 
mod and rim fields. Their presence depends on the 
values of mod and rim, as for integer instructions. 
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11.0 DIFFERENCES 
BETWEEN THE Intel486™ 
DX2 
MICROPROCESSOR 
AND THE Intel386™ 
MICROPROCESSOR 
PLUS THE INTEL387 
MATH 
COPROCESSOR 
EXTENSION 


The differences between the Intel486 DX2 micro- 
processor and the Intel386 microprocessor are due 
to performance enhancements. The differences be- 
tween the microprocessors are listed below. 
1. Instruction clock counts have been reduced to 
achieve higher performance. See Section 10. 


2. The Intel486 DX2 microprocessor bus is signifi- 
cantly faster than the Intel386 microprocessor 
bus. Differences include an internally doubled 
clock, parity support, burst cycles, cacheable cy- 
cles, cache invalidate cycles and 8-bit bus sup- 
port. The Hardware Interface and Bus Operation 
Sections (Sections 6 and 7) of the data sheet 
should be carefully read to understand the In- 
tel486 DX2 microprocessor bus functionality. 
3. To support the on-chip cache new bits have been 
added to control register 0 (CD and NW) (Section 
2.1.2.1), new pins have been added to the bus 
(Section 6) and new bus cycle types have been 
added (Section 7). The on-chip cache needs to 
be enabled after reset by clearing the CD and 
NW bit in CRO. 


4. The complete Intel387 math coprocessor instruc- 


tion set and register set have been added. No 
1/0 cycles are performed during Floating Point 
instructions. The instruction and data pointers are 
set to 0 after FINIT/FSAVE. Interrupt 9 can no 
longer occur, interrupt 13 occurs instead. 


5. The Intel486 DX2 microprocessor supports new 
floating point error reporting modes to guarantee 
DOS compatibility. These new modes required a 
new bit in control register 0 (NE) (Section 2.1.2.1) 
and new pins (FERR# and IGNNE#) (Section 
6.2.13 and 7.2.14). 
6. In some cases FERR# is asserted when the next 
floating point instruction is encountered and in 
other cases it is asserted before the next floating 
point instruction is encountered, depending upon 
the execution state the instruction causing ex- 
ception (see Sections 6.2.13 and 7.2.14). For 
both of these cases, the Intel387 Math Coproc- 


essor asserts ERROR# when the error occurs 
and does not wait for the next floating point in- 
struction to be encountered. 
7. Six new instructions have been added: 
Byte Swap (BSWAP) 
Exchange-and-Add (XADD) 


Compare and Exchange (CMPXCHG) 


Invalidate Data Cache (INVD) 
• 
Write-back 
and 
Invalidate 
Data 
Cache 
(WBINVD) 
Invalidate TLB Entry (INVLPG) 
8. There are two new bits defined in control regis- 
ter 3, the page table entries and page directory 
entries (PCD and PWT) (Section 4.5.2.5). 
9. A new page protection feature has been added. 
This feature required a new bit in control register 
o (WP) (Section 2.1.2.1 and 4.5.3). 


10. A new Alignment Check feature has been add- 
ed. This feature required a new bit in the flags 
register (AC) (Section 2.1.1.3) and a new bit in 
control register 0 (AM) (Section 2.1.2.1). 
11. The replacement algorithm for the translation 
lookaside buffer has been changed from a ran- 
dom algorithm to a pseudo least recently used 
algorithm like that used by the on-chip cache. 
See Section 5.5 for a description of the algo- 
rithm. 
12. Three new testability registers, TR3, TR4 and 
TR5, have been added for testing the on-chip 
cache. TLB testability has been enhanced. See 
Section 8. 


13. The prefetch queue has been increased from 16 
bytes to 32 bytes. A jump always needs to exe- 
cute after modifying code to guarantee correct 
execution of the new instruction. 


14. After reset, the 10 in the upper byte of the OX 
register is 04. The contents of the base regis- 
ters including the floating point registers may be 
different after reset. 
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This chapter contains the specifications for the 
Pentium OverDrive Processor Socket for systems 
based on the Intel486 DX2 Microprocessor. All of 
the specifications described herein are based on the 
specifications of the Intel486 DX2 Microprocessor. 


One of the most important features of the Intel486 
family architecture, compared with previous Intel ar- 
chitectures, is its "end user easy" upgradability via 
the Pentium OverDrive Processor Socket. Inclusion 
of the socket in systems based on the Intel486 fami- 
ly of microprocessors provides the end user with an 
easy and cost-effective way to increase system per- 
formance. The paradigm of simply installing an addi- 
tional component into an empty socket to achieve 
enhanced system performance is familiar to the mil- 
lions of end users and dealers who have purchased 
Intel Math CoProcessor upgrades to boost system 
floating point performance. The Pentium OverDrive 
Processor will provide up to 50% integer perform- 
ance improvement and up to 150% floating point 
performance improvement over the base system 
performance. The Pentium OverDrive Processor 
takes advantage of Intel's Pentium processor tech- 
nology to provide this performance improvement. 


The Pentium OverDrive Processor will implement a 
superset of the Intel486 DX2 Microprocessor sig- 
nals. The new signals for the socket, in addition to 
the Intel486 DX2 CPU signals, support a writeback 
protocol for the on-chip cache in Intel's future proc- 
essors. Implementation of the cache writeback ca- 
pability for the Pentium OverDrive Processor Socket 
is optional, although implementation of the Level 1 
writeback protocol enables maximum performance 
gain. The signals required to implement this write- 
back are detailed in a separate document and are 
marked reserved in this databook. For more informa- 
tion, please contact Intel. 


As a new system architecture feature, the provision 
of the Pentium OverDrive Processor Socket as a 
means for PC users to take advantage of the ever 
more rapid advances in software and hardware tech- 
nology will help to maintain the competitiveness of 
Intel architecture PC-compatible systems over other 
architectures. 


The majority of upgrade installations which take ad- 
vantage of the Pentium OverDrive Processor Socket 
will be performed by end users and resellers. There- 


fore, it is important that the design be "end user 
easy", and that the amount of training and technical 
expertise required to install the Pentium OverDrive 
Processors be minimized. Upgrade installation in- 
structions should be clearly described in the system 
user's manual. In addition, by making installation 
simple and foolproof, PC manufacturers can reduce 
the risk of system damage, warranty claims and 
service calls. Feedback from Intel's Math CoProces- 
sor upgrade customers highlights three main charac- 
teristics of end user easy designs: accessible Over- 
Drive Processor Socket location, clear indication of 
upgrade component orientation, and minimization of 
insertion force. 


Upgrade 
Socket 
Location: 
The Pentium OverDrive 


Processor Socket can be located on either the 
motherboard or modular CPU card. The socket 
should be easily accessible for installation and readi- 
ly visible when the PCcase is removed. The Pentium 
OverDrive Processor Socket should not be located 
in a position that requires removal of any other hard- 
ware (such as hard disk drives) in order to install the 
Pentium OverDrive Processor. 


Component 
Orientation: 
The most common mis- 


take made by end users and resellers when install- 
ing Math CoProcessor upgrades is incorrect orienta- 
tion of the chip. This can result in irreversible dam- 
age to the chip and/or the PC. To solve this prob- 
lem, Intel has designed the Pentium OverDrive Proc- 
essor Socket and the Pentium OverDrive Processor 
with a keying mechanism to ensure proper orienta- 
tion of the upgrade component by the PC user. The 
keying mechanism for the Pentium OverDrive Proc- 
essor is four missing pins on one corner of the de- 
vice. To be effective as a keying mechanism the cor- 
responding locations in the socket must be plugged. 
The Pentium OverDrive Processor Socket is de- 
signed to be backward compatible with the 169-pin 
OverDrive Socket of Intel486 SX and Intel486 DX 
systems. In order to maintain compatibility, the Pen- 
tium OverDrive Processor Socket should include the 
Key Pin at location E5. In addition, the location of 
the key corner should be clearly marked on the 
motherboard or CPU card, for example by silk 
screening. 


Insertion 
Force: The third major concern voiced by 


end users refers to how much pressure should be 
exerted on the upgrade chip and PC board for prop- 
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er installation 
without 
damage. 
This becomes 
even 
more of a concern 
with the larger components 
which 
require 
up to 200 pounds 
of pressure 
for insertion 
into a standard 
screw machine 
socket. 
This level of 
pressure 
can 
easily 
result 
in cracked 
traces 
and 
stress 
to solder 
joints. 
To minimize 
the risk of sys- 
tem damage, 
it is recommended 
that a Zero 
Inser- 
tion 
Force 
(ZIF) 
socket 
be 
used 
for 
the 
Pentium 
OverDrive 
Processor 
Socket. 
Designing 
with a ZIF 
socket 
eliminates 
the 
need 
to design 
in additional 
structural 
support 
to prevent 
flexing of the PC board 
during installation, 
and results 
in improved 
end user 
and reseller 
product 
satisfaction 
due to easy "drop- 
in" installation. 


12.0.1 
Pentlum™ 
OVERDRIVETM 
PROCESSOR 
SOCKET 
OVERVIEW 


The 
Pentium 
OverDrive 
Processor 
Socket 
is de- 
signed such that when a Pentium OverDrive 
Proces- 
sor is installed 
in the socket, 
the original 
CPU relin- 
quishes 
control 
of the system 
to the Pentium 
Over- 
Drive Processor 
by backing 
off the bus. The circuit 
design 
requirements 
for 
the 
Pentium 
OverDrive 
Processor 
Socket 
are discussed 
in Section 
12.1. In 
addition 
to the Pentium OverDrive 
Processor 
Socket 
circuits, there are layout considerations 
for the sock- 
et and processor 
spatial requirements. 
These issues 
are discussed 
in Section 
12.2. Because 
the Pentium 
OverDrive 
Processor 
must 
function 
in the 
socket, 
the Pentium OverDrive 
Processor 
Socket 
heat dissi- 


CTRL 


ADDR 


DATA 


pation 
specifications 
must be implemented. 
Section 
12.3 shows 
the Pentium 
OverDrive 
Processor 
heat 
dissipation 
requirements 
for a hypothetical 
system 
design at 25 MHz and 33 MHz. Because 
the system 
must operate 
correctly 
with any OverDrive 
Proces- 
sor without 
a BIOS change, 
BIOS and software 
re- 
strictions 
and 
recommendations 
are 
provided 
in 
Section 
12.4. Section 
12.5 discusses 
Pentium Over- 
Drive 
Processor 
Socket 
test 
requirements. 
Finally, 
Sections 
12.6 and 12.7 specify 
the pinout 
and elec- 
trical characteristics 
of the Pentium 
OverDrive 
Proc- 
essor, respectively. 


12.1 Pentium TM OverDrive TM 
Processor Circuit Design 


The 
Pentium 
OverDrive 
Processor 
Socket 
is de- 
signed to reside on the same processor 
bus as the 
Intel486 
DX2 CPU. This socket 
specifies 
a UP# 
out- 
put (Upgrade 
Present) 
pin which should be connect- 
ed directly 
to the UP# 
input pin of the Intel486 
DX2 
Microprocessor. 
When the Pentium 
OverDrive 
Proc- 
essor 
occupies 
the socket, 
the UP# 
signal 
(active 
low) 
forces 
the 
Intel486 
DX2 
Microprocessor 
to 
3-state 
all outputs 
and reduce 
power 
consumption. 
When the Pentium OverDrive 
Processor 
is not in the 
socket, 
a pullup resistor, 
internal to the Intel486 
DX2 
Microprocessor, 
drives UP# 
inactive 
and allows the 
Intel486 
DX2 Microprocessor 
to control 
the proces- 
sor bus. 


DATA ADDR CTRL 
DATA ADDR CTRL 


7 
~ 
RESERVED 
UP# 
UP# 
, 
Pentium 
OverDrive 
Intel486 
DX2 
Processor 
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Socket 


LK 


rC>CLK 
I~CLK 
-- 


This section discusses four aspects for the Pentium 
OverDrive Processor Socket: compatibility, size, up- 
gradability, and vendors. 


The 
Pentium 
OverDrive 
Processor 
Socket 
for 
Intel486 DX2 Microprocessor-based systems is de- 
signed to be compatible with the OverDrive Proces- 
sor for Intel486 SX CPU- and Intel486 DX CPU- 
based systems. 


The Pentium OverDrive Processor Socket has a 
fourth row of contacts around the outside of the 169 
contacts defined for the Intel486 SX CPU- and 
Intel486 DX CPU-based OverDrive Processor sock- 
ets. The three inner rows, with inner key pin, are 
100% compatible with the 169-pin PGA OverDrive 
Processor, for Intel486-based systems. For back- 
ward compatibility, the inner row key pin location 
(E5) must be included in the Pentium OverDrive 
Processor Socket. 
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12.2.2 MECHANICAL DESIGN 
CONSIDERATIONS 


The Pentium OverDrive Processor is designed to fit 
in a standard 240-lead (19 x 19) PGA socket with 
four corner pins removed. The Pentium OverDrive 
Processor uses an active heat sink, and therefore, 
requires vertical clearance to allow adequate air cir- 
culation. 


The maximum and minimum dimensions of the Pen- 
tium OverDrive Processor package with a fan/heat 
sink are shown in Table 12.1. The fan/heat sink unit 
is divided into the size of the actual heat sink, and 
the required free space above the heat sink. The 
total height required for the Pentium OverDrive Proc- 
essor from the motherboard will depend on the 
height of the PGA socket. The total external height 
given in Table 12.1 is only measured from the PGA 
pin stand-ofts. Table 12.1 also details the minimum 
clearance needed around all four sides of the PGA 
package. 


Component 
Length and Width (Inches) 
Height (Inches) 
Minimum 
Maximum 
Minimum 
Maximum 


PGAPackage 
1.950 
1.975 
0.140 
0.180 
Adhesive 
N/A 
N/A 
0.008 
0.012 
HeatSinkUnit 
1.830 
1.850 
N/A 
N/A 


HeatSink 
N/A 
N/A 
0.790 
0.810 
Req'd FreeSpace 
N/A 
N/A 
0.400 
0.400 


External Total 
1.950 
1.975 
1.338 
1.402 
SpaceFromPackage 
0.200 
0.200 
N/A 
N/A 


Free 
Side 
Space 


t 
t 
0.160" 
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Since 
the 
Pentium 
OverDrive 
Processor 
dissipates 
more power than the Intel486 
CPU family members, 
it requires 
a larger cooling 
capacity. 
To facilitate 
the 
task of cooling 
the Pentium OverDrive 
Processor, 
In- 
tel will ship the product 
with a fan/heat 
sink. No ex- 
ternal 
connections 
(Le., power) 
will be required 
for 
the fan/heat 
sink. All the needed connections 
will be 
made through 
the pins of the processor. 
The amount 
of extra power needed for the fan is accounted 
for in 
the 
Ice 
numbers 
of 
the 
processor 
(see 
Section 
12.3). To ensure 
adequate 
air circulation, 
the addi- 
tional clearance 
specified 
in Table 12.1 must be pro- 
vided. 


12.2.3 
"END 
USER EASY" 
RECOMMENDATIONS 


PC buyers value easy and safe upgrade 
installation. 


PC manufacturers 
can make upgrade 
component 
in- 
stallation 
in the Pentium 
OverDrive 
Processor 
sock- 


et simple and foolproof 
for the end user and reseller 
by 
implementing 
the 
suggestions 
listed 
in Table 
12.2. 


"End User Easy" 
Implementation 
Feature 


Visible Pentium OverDrive 
The Pentium OverDrive 
Processor 
Socket should be easily visible when 
Processor 
Socket 
the PC's cover is removed. 
Label the Pentium OverDrive 
Processor 
Socket and the location 
of pin 1 by silk screening 
this information 
on the 
PC board. 


Accessible 
Pentium OverDrive 
Make the Pentium OverDrive 
Processor 
Socket easily accessible 
to the 
Processor 
Socket 
end user (Le., do not place the Pentium OverDrive 
Processor 
Socket 
under a disk drive). Be sure to leave enough clearance 
to open the Zero 
Insertion 
Force (ZIF) socket. 


Foolproof 
Chip Orientation 
This Pentium OverDrive 
Processor 
Socket 
must insure proper orientation 
of not only the Pentium OverDrive 
Processor 
but also the OverDrive 
Processor 
for Intel486 SX CPU based systems. 
The PGA package 
of the 
Pentium OverDrive 
Processor 
is oriented 
by the four corner pins that 
have been removed 
from the "pin 1" corner. These four contacts 
(A2, 


A3, B1 and C1) in the socket should be plugged, such that PGA pins 
cannot 
be inserted, to assure correct orientation. 
The 169 pin, PGA 
package 
of the OverDrive 
Processor 
for Intel486 SX CPU systems 
is 
oriented 
by the "key" 
pin located 
in the inside corner of the "pin 1" 


corner. All inside contacts 
(11 innermost 
rows) should be plugged, 


except the "key" 
pin (E5), to insure correct 
orientation 
and alignment. 


The total number of contacts 
for the Pentium OverDrive 
Processor 
Socket is therefore 
237; a standard 
240-pin socket plus the inside "key" 
pin and less the four outside corner pins. Supplying 
a 237 -pin socket as 
the Pentium OverDrive 
Processor 
Socket eliminates 
the possibility 
of end 
users or resellers 
damaging 
the PC board or the Pentium OverDrive 
Processor 
by powering 
up the system with the Pentium OverDrive 
Processor 
in an incorrect 
orientation. 


Zero Insertion 
Force 
The high pin count of the Pentium OverDrive 
Processor 
makes the 
Pentium OverDrive 
insertion 
force required for installation 
into a screw machine 
PGA socket 


Processor 
Socket 
excessive. 
Even most Low Insertion 
Force (L1F) sockets 
often require 
more than 60 Ibs. of insertion 
force. A Zero Insertion 
Force (ZIF) socket 
insures that the chip insertion 
force does not damage the PC board. Be 
sure to allow enough clearance 
for the ZIF socket 
handle. Do not use a 
L1F or screw machine 
socket. 


"Plug and Play" 
Jumper or switch changes 
should not be needed to electrically 
configure 
the system for the Pentium OverDrive 
Processor. 


Thorough 
Documentation 
Describe 
the Pentium OverDrive 
Processor 
Socket and the Pentium 
OverDrive 
Processor 
installation 
procedure 
in the PC's User's Manual. 


• 


The following 
lists provide socket vendor 
information 
based on products 
offered 
for the OverDrive 
Socket 
for Intel486 
DX and Intel486 
SX CPU Systems 


NOTE: 
This is not a comprehensive 
list. Intel cannot 
guar- 
antee that these 
sockets 
will meet every PC manu- 
facturer's 
specific 
requirements. 


Zero 
Insertion 
Force 
OverDrive 
Processor 
Sockets 
and Vendors: 


1. AMP Inc. 
219 American 
Avenue 
Greensboro, 
N.C. 27409-1803 
Part Number: 
TBD 
Contact: 
James 
Crompton 
- (919) 855-2338 


2. Yamaichi 
Electronics 
1420 Koll Circle, Suite B 
SanJose,CA 
95112 
Part Number: 
TBD 
Contact: 
Jim Bennett, 
Sales Manager - 
(408) 452-0797 


The Pentium OverDrive 
Processor 
and system chas- 
sis have several 
unique design requirements 
due to 
the attached 
active heat sink. The following 
sections 
provide 
sample 
maximum 
system 
operating 
temper- 
ature 
calculations 
so systems 
may be designed 
to 
comply 
with 
the 
thermal 
requirements 
of 
the 
Pentium 
OverDrive 
Processor. 


Thermal 
Calculations 
for 
a Hypothetical 
System 


The following 
equation 
can be used to calculate 
the 
maximum 
operating 
temperature 
of a system: 


TA(in): The temperature 
of the air going Into the fan/ 
heat sink. 


TSINK: Temperature 
of heat sink base, as measured 
in the center. 


851: Heat Sink to Internal Temperature 
[TA(in)] Ther- 
mal Resistance 


TA (out): The temperature 
of the air outside 
the sys- 
tem. 
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Since the Pentium OverDrive 
Processor 
uses an ac- 
tive heat sink, 851 (as shown 
in Table 
12.3) is rela- 
tively constant, 
regardless 
of the airflow 
provided 
to 
the processor. 
Table 
12.4 details 
the maximum 
cur- 
rent requirements 
of the Pentium 
OverDrive 
Proces- 
sor. The 
maximum 
ambient 
temperature 
specifica- 
tion for the Pentium OverDrive 
Processor 
is 55°C for 
both 25 MHz and 33 MHz processors 
with the heat 
sink attached. 
Therefore, 
the internal temperature 
of 
the 
air (TA (in)) 
may 
not 
exceed 
55°C 
under 
the 
worst 
case 
operating 
conditions 
specified 
for 
the 
system. 
This ensures 
that the value 
of TSINK does 
not exceed 
85°C. 


Table 
12.3. Thermal 
Resistance 
("C/W)-8SI 


Processor 
Type 
8S1 - 
°C/W 


Fan/Heat 
Sink 
2.4 


Table 
12.4. OverDrive 
Processor 
Typical 
and 
Maximum 
lee Values 


System 
Processor 
Processor 
Frequency 
Typical 
lee 
Maximum 
lee 
(MHz) 
(mA) 
(mA) 


25 
TBD 
1900 


33 
TBD 
2500 


Ice is dependent 
upon the Vee level of the system, 


processor 
bus loading, 
software 
code 
sequences, 
and silicon 
process 
variations. 


Maximum 
TA(in) is specified 
and 
be verified 
using 
the equation 
and parameters 
provided 


TA(in) = 85°C - 
((2.5A • 5V) • 2.4°C/W) 


TA(in) = 85°C - 
((12.5W 
• 2.4°C/W) 


TA(in) = 85°C - 
30°C 


TA(in) = 55°C 


Assuming 
the internal 
system 
ambient 
TA(in) is with- 
in 5°C-10·C 
of TA(out), this would 
allow 
the maxi- 
mum 
TA(out) 
temperature 
to 
be 
approximately 
45°C-50°C. 
It is the responsibility 
of the system 
de- 
signer 
to ensure 
TA(in) meets 
this specification 
by 
providing 
sufficient 
airflow around the Pentium Over- 
Drive Processor 
to remove 
the heated 
air expelled 
by the fan/heat 
sink. 
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The following should be considered when designing 
the Pentium OverDrive Processor Socket for an In- 
tel486 DX2 microprocessor-based system. 


12.4.1 
OverDrive 
PROCESSOR 
DETECTION 


The 
component 
identifier 
and 
stepping/revision 


identifier for the Pentium OverDrive Processor is 
readable in the DH and DL registers respectively, 
immediately after RESET, where 


DH = 15h 


DL = 30h-3Fh 


As with the Intel486 DX2 microprocessor specifica- 
tion, it is recommended that the BIOS save the con- 
tents of the OX register, immediately after RESET, 
so that this information can be used later, if required. 


The Pentium OverDrive Processor for Intel486 DX2 
microprocessor-based 
systems executes 
instruc· 
tions at a multiple of the frequency of the input 
clock. This Pentium OverDrive Processor also will 
use advanced design techniques to decrease the 
number of clocks per instruction (cpi) from that of 
the Intel486 DX2 microprocessor. Thus software, 
such as instruction-based timing loops, will execute 
faster on the Pentium OverDrive Processor than on 
either the Intel486 OX CPU or the Intel486 DX2 mi- 
croprocessor at the same input clock frequency. In- 
structions such as NOP, LOOP, and JMP $+2 
are 
frequently used by the BIOS to implement timing 
loops that are required, for example, to enforce re- 
covery time between consecutive accesses for I/O 
devices. These instruction-based, timing-loop imple- 
mentations may require modification to be compati- 
ble with this Pentium OverDrive Processor Socket. 


In order to avoid any incompatibilities, timing loops 
can be implemented in hardware rather than in soft- 
ware. This provides transparency and also does not 
require any change in BIOS or I/O device drivers in 
the future when moving to higher processor clock 
speeds. 


As an example, a timing loop may be implemented 
as follows: The software performs a dummy I/O in- 
struction to an unused I/O port. The hardware for 
the bus controller logic recognizes this I/O instruc- 
tion and delays the termination of the I/O cycle by 
keeping ROY# or BRDY# deasserted for the appro- 
priate amount of time. 


12.5 
Test ReqUirements 


The Pentium OverDrive Processor Socket's electri- 
cal functionality can be verified by fully testing the 
PC with a populated Pentium OverDrive Processor 
Socket. We recommend that the system is tested 
with all available 
OverDrive Processors to ensure 
that there are no BIOS issues. The socket can be 
electrically tested with the OverDrive Processor for 
Intel486 SX/lntel486 OX CPU-based systems. The 
Pentium OverDrive Processor should also be used 
to test the hardware and software when it is avail- 
able. The BIOS requirements to maintain compatibil- 
ity with all OverDrive Processors are discussed in 
Section 12.4 of this document. All OverDrive Proces- 
sors undergo thorough application software compat- 
ibility testing prior to their introduction. 


12.6 
Pentium™ 
OverDrive™ 
Processor 
Socket Pinout 


The Pentium OverDrive Processor Socket pinout is 
shown in Figures 12.3 and 12.4. As mentioned in 
Section 12.2, the key pins are critical for component 
orientation and should be used on any Pentium 
OverDrive Processor Socket. 


• 
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A 
8 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 
T 
U 


19 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
19 
Ne 
RES 
Vss 
Vee 
VSS 
INIT 
Vss 
VSS 
Vee 
Vee 
Vee 
Vss. 
Vss 
RES 
Vss 
Vee 
Vss 
RES 
RES 


18 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
18 
RES 
AHOLOEAOS. 8516" 
80fT" 
Vss 
8E3" 
Vss 
Vss 
PeO 
Vss 
Vss 
Vss 
W/R- 
Vss 
PCHKlflI 
INe 
ADS- 
RES 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
Vss 
INTR 
RES 
RESET 
BS8- 
Vee 
ROY· 
Vee 
Vee 
BEt" 
Vee 
Vee 
Vee 
W/IO- 
Vee 
PLOCK# BLAST" 
A4 
Vss 


16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
Vee 
IGNNE- 
NWI 
flUSH" 
A20 ••- 
HOLD 
KEN- 
STPeLK-BROY- 
BE2- 
BEO- 
PWT 
o/e- 
LOCK- 
HLOA 
BRED 
A3 
A6 
Vee 


15 
0 
0 
0 
0 • • • 
• • • 
0 
0 
0 '0 
15 
vss 
RES 
UP- 
INe 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
A2 
Vee 
Vss 
Vss 
14 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
14 
Vss 
FERR" 
INe 
Ne 
PLUG 
PLUG 
A7 
A8 
AID 
vss 
13 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
13 
Vss 
INe 
INe 
SWIAeT- 
PLUG 
PLUG 
A5 
All 
Vss 
Vss 


Pentium 
OverDrive 
Processor 
Socket 
for 


12 
0 
0 
0 
0 
5V 1486 DX2 CPU-bssed 
systems 
0 
0 
0 
0 
12 
Vss 
Vss 
Vee 
INe 
A9 
Vee 
Vss 
Vss 
11 
0 
0 
0 
0 
0 
0 
0 
0 
11 
Vee 
INe 
SWII" 
INe 
AI3 
Vee 
Vss 
Vee 
10 
0 
0 
0 
0 
0 
0 
0 
0 
10 
Vee 
Vss 
Vee 
030 
TOP SIDE VIEW 
AI6 
vee 
vss 
Vee 
9 
0 
0 
0 
0 
237 Contsct 
PGA 
0 
0 
0 
0 
9 
vee 
029 
031 
028 
A20 
Vee 
Vss 
Vee 
8 
0 
0 
0 
0 
0 
0 
0 
0 
8 
Vss 
Vss 
Vee 
026 
A22 
AI5 
AI2 
Vss 
R••. 8/4/93 
7 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
7 
RES 
024 
025 
027 
PLUG 
PLUG 
A24 
Vee 
Vss 
Vss 
6 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
6 
RES 
OP3 
Vss 
Vee 
PLUG 
PLUG 
A2 , 
AI8 
AI4 
Vss 
5 
0 
0 
0 
0 
0 • • 
• • • 
0 
0 
0 
0 
5 
vss 
023 
Vss 
Vee 
KEY 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
AI9 
Vss 
INe 
Vss 
4 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
4 
vee 
RES 
Vss 
eLK 
017 
DID 
015 
012 
OP2 
016 
0'4 
07 
04 
OPO 
A30 
AI7 
Vee 
A23 
Vee 
3 
• 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
PLUG 
022 
021 
018 
013 
Vee 
08 
Vee 
OJ 
05 
Vee 
06 
Vee 
01 
A29 
Vss 
A25 
A26 
Vss 
2 
• 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 


PLUG 
020 
019 
011 
09 
Vss 
OPI 
Vss 
Vss 
Vee 
Vss 
Vss 
Vss 
02 
DO 
A31 
A28 
A27 
RES 
1 
0 • • 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 


'{EY 
PLUG 
PLUG 
Vee 
Vss 
RES 
RES 
Vss 
Vee 
Vee 
Vee 
Vss 
RES 
RES 
Vss 
Vee 
Vss 
RES 
RES 


A 
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E 
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S 
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U 
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NOTE: 
All NC and RES pins must remain unconnected 


Figure 12-3. Pentium OverDrive Processor Socket Pinout 
for 5V Intel486 DX2 CPU·Based Systems (Top Side View) 


U 
T 
S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 


19 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
19 
RES 
RES 
VSS 
Vee 
Vss 
RES 
VSS 
Vss 
Vee 
Vee 
Vee 
Vss 
VSS 
INIT 
Vss 
Vee 
Vss 
RES 
NC 


18 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
18 
RES 
ADS- 
INC 
PCHKtf. 
Vss 
W/R- 
Vss 
Vss 
V'S 
PeO 
Vss 
Vss 
BO- 
Vss 
Borr- 
BS16- 
EADS- 
AHOlO 
RES 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
Vs, 
A4 
BlAST- PlOCK- 
Vee 
lot/IO- 
Vee 
Vee 
Vee 
BEll' 
Vee 
Vee 
ROY· 
Vee 
BS8- 
RESET 
RES 
INTR 
V'S 


16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
Vee 
A6 
A3 
BREQ 
HLOA 
LOCK- 
o/c- 
PWT 
BEO- 
8E2" 
BRDY-STPCLK- 
KEN- 
HOLD 
A20 •••• rLUSH. 
Nlotl 
IGNNE- 
Vee 


15 
0 
0 
0 
0 • • • 
• • • 
0 
0 
0 
0 
15 
Vss 
Vss 
Vee 
A2 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
INC 
UP- 
RES 
vss 


14 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
14 
Vss 
Al0 
A8 
A7 
PLUG 
PLUG 
NC 
INC 
FERR· 
V's 


13 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
13 
Vss 
Vss 
All 
AS 
PLUG 
PLUG SlotIACT- 
INC 
INC 
V'S 
Pentium 
OverDrive 
Processor 
Socket 
for 
12 
0 
0 
0 
0 
5V 1486 DX2 CPU-based 
systems 
0 
0 
0 
0 
12 
Vss 
Vss 
Vee 
A9 
INC 
Vee 
Vss 
Vs, 


11 
0 
0 
0 
0 
0 
0 
0 
0 
11 
Vee 
v,s 
Vee 
A 13 
INC 
SWI" 
INC 
Vee 


10 
0 
0 
0 
0 
0 
0 
0 
0 
10 
Vee 
Vss 
Vee 
A16 
BOTTOM 
SIDE VIEW 
030 
Vee 
V'S 
Vee 


9 
0 
0 
0 
0 
237 Contact 
PGA 
0 
0 
0 
0 
9 
Vee 
Vss 
Vee 
A20 
028 
031 
029 
vee 


8 
0 
0 
0 
0 
0 
0 
0 
0 
8 
V,s 
A12 
A15 
A22 
026 
Vee 
Vss 
Vss 


R.,. 8/4/93 
7 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
7 
Vss 
Vss 
Vee 
A24 
PLUG 
PLUG 
027 
025 
024 
RES 


6 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
6 
Vss 
A14 
A18 
A21 
PLUG 
PLUG 
Vee 
Vss 
OP3 
RES 


5 
0 
0 
0 
0 • • • 
• • 
0 
0 
0 
0 
0 
5 
Vss 
INC 
VS, 
A 19 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
KEY 
Vee 
Vss 
023 
vss 


4 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
4 
Vee 
A23 
Vee 
A17 
A30 
OPO 
04 
07 
014 
016 
OP2 
012 
015 
010 
017 
CLK 
Vss 
RES 
vee 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 • 
3 
VS, 
A26 
A25 
V,s 
A29 
01 
Vee 
06 
Vee 
05 
03 
Vee 
08 
Vee 
013 
018 
021 
022 
PLUG 


2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 • 
2 
RES 
A27 
A28 
A31 
DO 
02 
V'S 
Vss 
Vss 
Vee 
V'S 
Vss 
OPl 
V'S 
09 
011 
019 
020 
PLUG 


1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 • • 
0 
1 
RES 
RES 
Vss 
Vee 
Vss 
RES 
RES 
Vss 
Vee 
Vee 
Vee 
Vss 
RES 
RES 
Vss 
Vee 
PLUG 
PLUG 
KEY 


U 
T 
S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 


241245-58 


NOTE: 
All NC and RES pins must remain unconnected 


Figure 12-4. Pentium OverDrive Processor Socket 
Pinout for SV Intel486 DX2 CPU·Based System (Bottom Side View) 


• 


intel~ 


Address 
Data 
Control 
Control 
Res(1) 
Vcc 
Vss 


A2 
R15 
00 
02 
A20M# 
E16 
ROY# 
G17 
A6 
A4 
L1 
A5 
M18 
A3' 
516 
01 
P3 
A05# 
T18 
RE5ET 
017 
A7 
A9 
L3 
A8 
M19 
A4 
T17 
02 
P2 
AHOLO 
818 
5MI# 
C11 
A18 
A10 
L17 
A12 
N2 
A5 
R13 
03 
J3 
8EO# 
L16 
5MIACT# 
013 
A19 
A11 
L19 
A13 
N18 
A6 
T16 
04 
N4 
8E1# 
K17 
5TPCLK# 
H16 
84 
A16 
M17 
A14 
N19 
A7 
R14 
05 
K3 
8E2# 
K16 
UP# 
C15 
815 
C8 
N3 
A15 
01 
A8 
514 
D6 
M3 
8E3# 
G18 
W/R# 
P18 
819 
C10 
N17 
A17 
018 
A9 
R12 
07 
M4 
8LA5T# 
517 
C17 
C12 
017 
88 
019 
A10 
T14 
08 
G3 
80FF# 
E18 
F1 
01 
R1 
810 
R3 
A11 
513 
09 
E2 
8ROY# 
J16 
G1 
05 
R19 
812 
51 
A12 
T8 
010 
F4 
8REO 
R16 
N1 
06 
54 
C4 
55 
A13 
R11 
011 
02 
858# 
E17 
P1 
019 
57 
C5 
519 
A14 
T6 
012 
H4 
8516# 
018 
P19 
F3 
59 
C8 
n 
A15 
58 
013 
E3 
CLK 
04 
T1 
F17 
510 
C19 
T9 
A16 
R10 
014 
L4 
O/C# 
N16 
T19 
H3 
511 
E1 
no 
A17 
R4 
015 
G4 
OPO 
P4 
Position 
U1 
H17 
512 
E19 
n1 
A18 
56 
016 
K4 
OP1 
G2 
U2 
J1 
515 
F2 
T12 
A19 
R5 
017 
E4 
OP2 
J4 
KEY 
E5 
U18 
J17 
U4 
F18 
n3 
A20 
R9 
018 
03 
OP3 
86' 
KEY 
A1 
U19 
J19 
U9 
G19 
n5 
A21 
R6 
019 
C2 
EA05# 
C18 
PLUG 
A2 
N/C(1) 
K1 
U10 
H1 
U3 
A22 
R8 
020 
82 
FERR# 
814 
PLUG 
A3 
K2 
U11 
H2 
U5 
A23 
T4 
021 
C3 
FLU5H# 
016 
PLUG 
81 
A19 
K19 
U16 
H18 
U6 
A24 
R7 
022 
83 
HLOA 
016 
PLUG 
C1 
014 
H19 
U7 
A25 
53 
023 
85 
HOLO 
f16 
PLUG 
E6 
INC 
J2 
U8 
A26 
T3 
024 
87 
IGNNE# 
816 
PLUG 
E14 
J18 
U12 
A27 
T2 
025 
C7 
INIT 
F19 
PLUG 
E15 
811 
L2 
U13 
A28 
52 
026 
08 
INTR 
817 
PLUG 
F5 
813 
L18 
U14 
A29 
03 
027 
07 
KEN# 
G16 
PLUG 
F15 
C13 
M1 
U15 
A30 
04 
028 
09 
LOCK# 
P16 
PLUG 
P5 
C14 
M2 
U17 
A31 
R2 
029 
89 
M/IO# 
P17 
PLUG 
P15 
011 


030 
010 
NMI 
C16 
PLUG 
05 
012 


031 
C9 
PCO 
K18 
PLUG 
06 
015 


PCHK# 
R18 
PLUG 
014 
S18 


PLOCK# 
R17 
PLUG 
015 
T5 


PWT 
M16 


NOTE: 
1. All RES pins are reserved 
for later use by Intel. To ensure 
proper 
operation 
of the microprocessor, 
all RES and N/C pins 
should 
be left unconnected. 
Please 
contact 
Intel for design 
information. 


The signal pin descriptions for the Pentium Over- 
Orive Processor are identical to the pin descriptions 
for the Intel486 OX2 Microprocessor except for the 
Upgrade Present pin (UP#) and KEY pin. The pin 
descriptions for these two signals are shown in Ta- 
ble 12.7. 


Many pins in the Pentium OverOrive Processor 
Socket are defined as reserved (RES). The function 
of these pins is documented separately. These sig- 
nals will be used to implement a Write 8ack level 1 
(on-chip) cache protocol. These pins must not be 
connected unless they are used to implement a level 
1 Write 8ack solution using the information available 
separately. To insure proper operation, pins marked 
as NC must be left unconnected as well. 


Symbol 
Type 
Name and Function 


Intel486 
DX2 CPU INTERFACE 


UP# 
0 
The Upgrade Present 
pin is used to signal the Intel486 
DX2 microprocessor 
to float its 
outputs 
and stop driving the bus. It is active low and is never floated. 
UP# 
is driven 
low at power-up 
and remains active for the entire duration 
of the Pentium OverDrive 
Processor 
operation. 


KEYPJN 


KEY 
The Key pin is an electrically 
non-functional 
pin which provides 
backward 
compatibility 
to the OverDrive 
Processor 
for Intel486 SX/lntel486 
OX CPU-based 
systems and is 
used to ensure correct 
orientation 
for 169-pin upgrade products. 
Proper orientation 
of 
the Pentium OverDrive 
Processor 
is insured by the four socket contacts 
which must 
be plugged 
(A2, A3, 81 and C1); the Pentium OverDrive 
Processor 
will not have pins 
in these locations. 


compatible 
to the maximum 
ratings and A.C. Specifi- 
cations 
of the Intel486 
DX2 Microprocessor. 
Table 
12.8 provides 
the D.C. Operating 
Conditions 
for the 
Pentium 
OverDrive 
Processor. 
The electrical 
specifications 
in this section 
represent 
the 
electrical 
interface 
of 
the 
Pentium 
OverDrive 
Processor. 
The Pentium OverDrive 
Processor 
will be 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input Low Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
VCC + 0.3 
V 


VOL 
Output low 
Voltage 
0.45 
V 
(Note 2) 


VOH 
Output 
High Voltage 
2.4 
V 
(Note 3) 


Icc 
Power Supply Current 
ClK 
= 25 MHz 
1900 
mA 
ClK 
= 33 MHz 
2500 


III 
Input leakage 
Current 
±15 
p.A 
(Note 4) 


IIH 
Input leakage 
Current 
200 
p.A 
(Note 5) 


IlL 
Input leakage 
Current 
-400 
p.A 
(Note 6) 


ILO 
Output leakage 
Current 
±15 
p.A 


CIN 
Input Capacitance 
13 
pF 
Fc = 1 MHz(7) 


Co 
I/O or Output Capacitance 
17 
pF 
Fc = 1 MHz(7) 


CCLK 
ClK 
Capacitance 
15 
pF 
Fc = 1 MHz(7) 


NOTES: 
1. Functional 
operating 
range: Vcc 
= 5V; Ts = O"C to +80·C. 


2. This parameter 
is measured 
at: 
- 
Address, 
Data, BEn 
4.0 mA 
- 
Definition, 
Control 
5.0 mA 
3. This parameter 
is measured 
at: 
- 
Address, 
Data, BEn 
- 1.0 mA 
- 
Definition, 
Control 
-0.9 
mA 
4. This parameter 
is for inputs without 
pullups 
or pulldowns 
and 0 ~ VIN ~ Vcc. 
5. This parameter 
is for inputs with pulldowns 
and VIH = 2.4V. 


6. This parameter 
is for inputs with pull ups and VIL = 0.45V. 


7. Not 100% 
tested. 


• 


13.0 CONVERTING AN EXISTING 
Intel486™ OX CPU DESIGN 


Converting an Intel486 OX CPU system design to an 
Intel486 OX2 CPU design provides more perform- 
ance for a small difference in cost. Three conversion 
possibilities are available as shown in Table 13.1. 
Migrating from a 33 MHz Intel486 OX CPU to a 50 
MHz Intel486 OX2 CPU could increase performance 
by 35%, and migrating from a 25 MHz Intel486 OX 
CPU to a 50 MHz Intel486 OX2 CPU could increase 
performance by an average of 70%. See the I'}- 
te148(JTMDX2 Microprocessor Performance Bnef 
(Order #241254) for more details on performance. 
Conversion can be as easy as replacing one or two 
devices. 


Table 13.1. Converting 
Intel486TM OX CPU 
Designs 
to Intel486TM DX2 CPU Designs 


Initial 
Converted 
Typical 
Design 
Design 
Performance 
(Inte1486 OX CPU) (Intel486 OX2 CPU) 
Gain 


25 MHz 
50 MHz 
70% 


33 MHz 
50 MHz 
35% 


33 MHz 
66 MHz 
70% 


A few system details should be checked first to be 
sure the design is ready for the Intel486 OX2 CPU. 
Check with your BIOS vendor to be sure any BIOS 
issues have been resolved. The BIOS for the In- 
tel486 OX CPU may have timing loops. Since the 
Intel486 OX2 CPU runs instructions twice as fast as 
the Intel486 OX CPU, timing loops may no longer 
return the required results. Most of the timing loops 
have been removed from a standard BIOS, but there 
may be some versions that need updating. Another 
BIOS issue that may not be critical, is the processor 
identification code. There are different 10 codes in 
the Intel486 OX CPU and the Intel486 OX2 CPU. 
The BIOS may need to be modified to identify the 
Intel486 OX2 CPU properly. Refer to Table 6.3 for 
the component 10 code. 


Other system parameters to watch out for are the 
thermal and power supply specifications. Table 14.2 
details the Power Supply Current information, and 
Table 15.2 outlines the Thermal Resistance. Since 
the processor core runs twice as fast for the same 
input clock, the Intel486 OX2 CPU uses more power 
and generates more heat than the Intel486 OXCPU. 
Be sure that there is adequate cooling and adequate 
power built into the design. A heat sink is a recom- 
mended method to help provide cooling for the In- 
tel486 OX2 CPU. 


The system checks mentioned above are common 
to all conversions from an Intel486 OX CPU to an 
Intel486 OX2 CPU regardless of the speed of the 
processor or system. 


A few system implications exist for converting from 
one frequency to another as shown in Figure 13.1. 
The first case is migrating from a 25 MHz Intel486 
OX CPU to a 50 MHz Intel486 OX2 CPU (the bus 
runs at the same speed for both parts). System 
hardware modifications need not be made to plug in 
the 50 MHz Intel486 OX2 CPU and achieve the de- 
sired performance. When all instructions are running 
out of the on-chip cache, performance increases by 
a maximum of 100%. 


The second case is migrating from a 33 MHz In- 
tel486 OXCPU to a 50 MHz Intel486 OX2 CPU. This 
conversion is a two step process. The first step is to 
change the frequency source for the CPU from 33 
MHz to 25 MHz. The Intel486 OX2 CPU can then be 
inserted into the system. Without any tuning of the 
memory and depending on the application, only a 
modest 
performance 
improvement 
may be 
ob- 
served. For programs running entirely out of the on- 
chip cache, however, performance can increase up 
to 50%. There are many factors which contribute to 
the performance of an application, including whether 
there is a second-level (L2) cache, the cache size if 
present, the memory subsystem design, and many 
other factors beyond the scope of this introduction. 
A comprehensive memory subsystem design guide, 
AP469: Cache and Memory Design Considerations 
for Intel48(JTM DX2 Microprocessor, 
is available 
which includes more detailed information on how 
each of these many factors affects Intel486 OX2 
CPU-based system performance. 


Because the Intel486 OX2 core runs twice as fast as 
its external bus, it is more sensitive to wait states. 
The Intel486 OX2 CPU needs to be fed instructions 
and data quickly. Either a high performance memory 
subsystem is needed or an external cache should be 
added. An external cache benefits the Intel486 OX2 
CPU even more than it benefits the Intel486 OX 
CPU, and helps to hide the effects of a slower mem- 
ory subsystem. The Intel486 OX CPU gains an aver- 
age of 3%-9% 
performance by the addition of a 
second-level cache, but the Intel486 OX2CPU gains 
an average of 20-30% performance by adding a 
second level cache. It should be noted however, 
that an external cache does not preclude the bene- 
fits of tuning the memory subsystem. 


REMOVE 
REMOVE 
Intel486 
OX 
Intel486 
OX 
CPU 
CPU 


REPLACE 


INSTALL 
33 MHz CPU 
CRYSTAL 
NEW BIOS 
WITH 25 MHZ 
VERSION 


INSTALL • 
NEW BIOS 
MODIFY 
VERSION 


/ 
POWER AND 
COOLING 


MODIFY 
INSERT 
POWER AND 


Intel486 
™ 
DX2 
COOLING 
CPU 


INSERT 


Intel486 
™ 
DX2 
241245-92 
CPU 


TUNE 
MEMORY 
SYSTEM 


The graph shown in Figure 13.2 shows a set of 
benchmarks known to have a poor cache hit rate. 
This is shown for purposes of memory tuning and 
not to be taken as absolute perlormance. Please re- 
fer to the /nte/486TM 
DX2 Microprocessor 
Perform- 
ance 
Brief (Order /I 241254) for perlormance de- 
tails. 


With the absence of a second-level cache, the mem- 
ory subsystem becomes critical to gaining perlorm- 
ance when converting from a 33 MHz Intel486 DX 
CPU to a 50 MHz Intel486 DX2 CPU. For slow mem- 
ory systems without tuning, the 50 MHz Intel486 
DX2 CPU can possibly run slower than the 33 MHz 
Intel486 DX CPU (see Figure 13.2). By tuning the 
memory design, the 50 MHz Intel486 DX2 CPU can 
reach equivalent perlormance to the 33 MHz In- 
tel486 DX CPU running applications with low cache 
hit rates, and increase perlormance for applications 
with higher hit rates. Tuning the memory design can 
be done easily by either removing a wait state from 
the memory design (if timing permits), andlor adding 
faster DRAM and removing wait state(s) from the 
memory design. 


Changing the wait state configuration for the system 
is often done by programming the DRAM controller 
in the chip set on the motherboard. Each chip set is 
programmed differently at the BIOS level, requiring a 
BIOS modification. For testing purposes, the chip 
set may be programmed on the fly from a DOS pro- 
gram if the register locations are known. 


intel~ 


A typical ISA chip set with an L2 cache, for example, 
allows 6-4-4-4 bus cycles at 33 MHz with 80 ns 
DRAMs for the Intel486 DX CPU. Without modifying 
the memory subsystem, the 50 MHz Intel486 DX2 
CPU achieved an average of 7%-12% 
improve- 
ment over the 33 MHz Intel486 DX CPU. By reduc- 
ing the bus cycles at 25 MHz to 5-2-2-2 (still with 
80 ns DRAMs), the 50 MHz Intel486 DX2 CPU im- 
proved to achieve an average of 15%-20% 
more 
perlormance than the 33 MHz Intel486 DX CPU. By 
replacing the DRAMs with faster devices (70 ns) bus 
cycles could be reduced to 4-2-2-2 at 25 MHz, im- 
proving the perlormance of the 50 MHz Intel486 
DX2 CPU even greater. 


A typical EISA solution is shown in Figure 13.3, using 
the Intel 82350DT Chip Set, which was specifically 
designed to permit variation in CPU type and fre- 
quency. 


In an 82350DT based design the memory subsys- 
tem is controlled by the combination of a flexible 
Programmable State Tracker (PST) and a highly 
configurable 82359 DRAM Controller. The PST, 
which is typically implemented as a 3 to 5 PLD solu- 
tion, is responsible for converting the CPU's clock- 
dependent handshake protocol into a clock-less 
memory interlace protocol. The 82359 in turn uses 
the clock-less memory interlace protocol to control 
main memory as well as to forward host CPU cycles 
to the EISA bus if needed. As a result of this clock- 


--- 
33 
101Hz 
Inlo'486 ox 
CPU 


-- 
50lolHz 
Inlol4860X2 
CPU 


-<>- 50 101Hz 
Inlol4860X2 
CPU 
wi cache 


~ 
33101Hz 
Inlol486 OX 
CPU 
wi cecho 


~ 
100. DOli 
.. 
E~~.. 
co.... 
>... 


O.OOll 


0000(0) 
3121(6) 
3121(6) 
3222(7) 
3222(8) 
4222(8) 
4333(8) 
6444(9) 
0(0) 
2(6) 
3(6) 
2(6) 
3(6) 
3(6) 
3(6) 
4(7) 


MAIN 


MEMORY 


II 


--€: 


less protocol, the system design becomes indepen- 
dent of the CPU and cache combination being used 
and the speed of the CPU clock. Therefore, whenev- 
er a new CPU and cache combination is to be used 
with an 82350DT based system, the only re-design 
that is necessary is to the CPU subsection, leaving 
the main memory and EISA subsections unchanged. 
Typically, this CPU subsection re-design entails 
modifying only the PST functionality and the pro- 
grammable registers of the 82359. 


There are five steps to determine whether wait 
states can be removed from the main memory de- 
sign of an 82350DT system when converting from a 
33 MHz Intel486 OX CPU to a 50 MHz Intel486 DX2 
CPU. An overview of these five steps is covered 
here; 
the 
system 
designer 
is 
referred 
to 
the 
82350DT EISA Chip Set Design Guide (Order 
# 296911) for detailed design information. 
1. Calculate the 82359 delay line tap values for opti- 
mal 25 MHz operation 
2. Determine all memory cycle lengths for operation 
at 25 MHz 
3. Re-evaluate the 
PST design (deterministic & 
snoop cycle trackers) 
4. Modify the PLD equations for the PST 
5. Update system BIOS to reflect new 82359 pro- 
grammable register values 


Step one is to perform a timing analysis of the main 
memory subsystem to determine the minimum num- 
ber of CPU clocks required for each memory cycle. 
Once the memory cycle lengths are known, the PST 
design can be re-evaluated with the goal of remov- 
ing unnecessary wait states. Before the system de- 
signer can determine the memory cycle lengths, the 
delay line timings of the 82359 must be analyzed. 


The timing for the DRAM control and address sig- 
nals of the 82359 is based on four integrated asyn- 
chronous delay line elements which can be con- 
trolled by the 82359 programmable registers. Once 
the delay line tap values have been verified or modi- 
fied, the system designer should determine the mini- 
mum number of CPU clocks required for the differ- 
ent memory cycles (Le., read page hit, page miss 
write, burst read, etc.). Armed with the delay line tap 
programming values and the number of CPU clocks 
required for each type of memory cycle, the system 
designer can now evaluate the PST design to deter- 
mine if any unneeded wait states can be removed. 


The PST for an 82350DT based system can be sep- 
arated into four primary functions: bus cycle control 
(including arbitration and posted write control), cycle 


length 
tracking, 
CPU "Ready" 
generation, 
and 
cache invalidation control. Although the PST con- 
tains a number of state machines only a few of the 
state machines need to be re-evaluated to deter- 
mine whether any wait states can be removed for 
converting from a 33 MHz Intel486 OX CPU to a 
50 MHz Intel486 DX2 CPU. The state machines that 
need to be re-evaluated are the deterministic cycle 
tracker and the snoop cycle tracker. 


The deterministic cycle tracker is responsible for 
generating ROY or BRDY to the CPU and cache for 
cycles that are deterministic in length. All main mem- 
ory cycles, except locked cycles which require EISA 
arbitration, are deterministic cycles. Once the deter- 
ministic cycle tracker knows that a deterministic cy- 
cle is occurring, it uses the 82359 CYCLN(2:0) and 
PAGEHIT# outputs to determine when to generate 
ROY or BRDY to the CPU. For burst cycles the de- 
terministic cycle tracker also uses the IF(1:0) and 
SPEED(1:0) outputs of the 82359 for generating 
BRDY. After this analysis has been completed the 
system designer can then determine if the determi- 
nistic cycle tracker can be optimized to take advan- 
tage of converting from a 33 MHz Intel486 OX CPU 
to a 50 MHz Intel486 DX2 CPU. 


The other state machine that must be re-evaluated 
for correct system functionality is the snoop cycle 
tracker. The snoop cycle tracker state machine de- 
sign must meet two goals; respond to a SNUPRQ 
with a SNUPACK# within 180 ns (based on an EISA 
burst write cycle), and maintain a snoop cycle fre- 
quency capability that is equal to or faster than the 
fastest system bus master write cycle frequency. 
Due to the change of CPU clock frequency from 
33 MHz to 25 MHz, the system designer must re- 
evaluate the snoop cycle tracker state machine to 
determine if the two design goals are still being met 
in the 50 MHz Intel486 DX2 CPU implementation. 


In conclusion, when converting an 82350DT based 
design from a 33 MHz Intel486 OXCPU to a 50 MHz 
Intel486 DX2 CPU the system designer must re-eval- 
uate the main memory cycle timings to determine 
whether the PST and 82359 programmable registers 
need to be modified to take advantage of the in- 
creased performance benefits of the 50 MHz In- 
tel486 DX2 CPU. Once the system designer has de- 
cided to modify the PST and the 82359 programma- 
ble registers, the conversion from a 33 MHz Intel486 
OX CPU to a 50 MHz Intel486 DX2 CPU is usually 
just as simple as modifying the PLD equations, re- 
programming the PST PLDs, and upgrading the sys- 
tem BIOS to reflect the new 82359 programmable 
register values. 


The following sections describe recommended elec- 
trical connections for the Intel486 DX2 microproces- 
sor, and its electrical specifications. 


The Intel486 DX2 microprocessor is implemented in 
CHMOS V technology and has modest power re- 
quirements. However, its high clock frequency out- 
put buffers can cause power surges as multiple out- 
put buffers drive new signal levels simultaneously. 
For clean on-chip power distribution at high frequen- 
cy, 24 Vcc and 28 Vss pins feed the Intel486 DX2 
microprocessor. 


Power and ground connections must be made to all 
external Vcc and GND pins of the Intel486 DX2 mi- 
croprocessor. On the circuit board, all Vcc pins must 
be connected on a Vcc plane. All Vss pins must be 
likewise connected on a GND plane. 


14.1.2 
POWER 
DECOUPLING 
RECOMMENDATIONS 


Liberal decoupling capacitance should be placed 
near the Intel486 DX2 microprocessor. The Intel486 
DX2 microprocessor driving its 32-bit parallel ad- 
dress and data busses at high frequencies can 
cause transient power surges, particularly when driv- 
ing large capacitive loads. 


Low inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. Inductance can be reduced by shortening 
circuit board traces between the Intel486 DX2 micro- 
processor and decoupling capacitors as much as 
possible. Capacitors specifically for PGA packages 
are also commercially available. 


14.1.3 
OTHER 
CONNECTION, 


RECOMMENDATIONS 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Active LOW in- 
puts should be connected to Vcc through a pullup 
resistor. Pullups in the range of 20 KO are recom- 
mended. Active HIGH inputs should be connected to 
GND. 


Table 14.1 is a stress rating only, and functional op- 
eration at the maximums is not guaranteed. Function 
operating conditions are given in 14.3 D.C. Specifi- 
cations and 14.4 A.C. Specifications. 


Extended exposure to the Maximum Ratings may af- 
fect device reliability. Furthermore, although the In- 
tel486 DX2 microprocessor contains protective cir- 
cuitry to resist damage from static electric discharge, 
always take precautions to avoid high static voltages 
or electric fields. 


Case Temperature under Bias 
- 65°C to + 110°C 
Storage Temperature 
-65°C to + 150°C 
Voltage on Any Pin with 
Respect to Ground 
-0.5 to Vcc + 0.5V 
Supply Voltage with 
RespecttoVss 
-0.5Vto 
+6.5V 


14.3 
Intel486 DX2 D.C. Specifications 
Functional 
Operating 
Range: Vcc = 5V ±5%; 
TCASE = O°C to +85°C 


Symbol 
Parameter 
Min 
Max 
Unit 
Notes 


Vil 
Input low 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
VCC +0.3 
V 


Val 
Output low 
Voltage 
0.45 
~ 
V 
(Note 1) 


VOH 
Output High Voltage 
2.4 
. 
.~~~ 
V 
(Note 2) 


Icc 
Power Supply Current 
~ 
:,.~ 


(66 MHz) 
L :~ 


mA 
(Note 3) 
(50 MHz) 


ICCF 
Power Supply Current in 
~~.~d >,,;-"50 
mA 
(Note 8) 
Power Down Mode 
_!J 
~-~ 


III 
Input Leakage Current 
.- 
1'7~',.,~ 
+15 
/LA 
(Note 4) 


IIH 
Input leakage 
Current 
••••~.~ 
200 
/LA 
(Note 5) 
'. 


III 
Input leakage 
Curren~" 
-400 
/LA 
(Note 6) 


ILO 
Output leakage 
Curr~t 
±15 
/LA 


CIN 
Input Capacitance 
13 
pF 
Fc = 1 MHz (Note 7) 


Co 
I/O or Output Capacitance 
17 
pF 
Fc = 1 MHz (Note 7) 


CClK 
ClK 
Capacitance 
15 
pF 
Fc = 1 MHz (Note 7) 


NOTES: 
1. This parameter 
is measured 
at: 


Address, 
Data, BEn 
4.0 mA 
Definition, 
Control 
5.0 mA 
2. This parameter 
is measured 
at: 


Address, 
Data, BEn 
-1.0 
mA 
Definition, 
Control 
-0.9 
mA 
3. Typical 
supply 
current: 


775 mA 
@ 50 MHz 
975 mA 
@ 66 MHz 
4. This parameter 
is for inputs without 
internal 
pullups 
or pull downs 
and 0 ,;; VIN ,;; Vcc. 


5. This parameter 
is for inputs with internal 
pulldowns 
and VIH = 2.4V. 
6. This parameter 
is for inputs with internal 
pullups 
and VIL = 0.45V. 
7. Not 100% 
tested. 
B. The ICCF specification 
in the above 
table is a target 
value. 
It has not been tested. 


the 
voltage 
levels 
indicated 
by Figure 
14.3 when 
A.C. specifications 
are measured. 
Intel486 
DX2 mi- 
croprocessor 
output 
delays 
are specified 
with mini- 
mum and maximum 
limits, measured 
as shown. 
The 
minimum 
Intel486 
DX2 microprocessor 
delay times 
are hold times provided 
to external 
circuitry. 
Intel486 
DX2 microprocessor 
input setup and hold times are 
specified 
as 
minimums, 
defining 
the 
smallest 
ac- 
ceptable 
sampling 
window. 
Within the sampling 
win- 
dow, a synchronous 
input signal 
must be stable 
for 
correct 
Intel486 
DX2 microprocessor 
operation. 


The A.C. specifications, 
given in Table 
14.3, consist 
of output delays, 
input setup requirements 
and input 
hold 
requirements. 
All A.C. specifications 
are rela- 
tive to the rising edge of the ClK 
signal. 


A.C. specifications 
measurement 
is defined 
by Fig- 
ures 14.1-14.7. 
All timings 
are referenced 
to 1.5V 
unless otherwise 
specified. 
Inputs must be driven to 


Table 14.4.1. 50 MHz Intel486 
DX2 Microprocessor 
A.C. Characteristics 


VCC = 5V ± 5%; Tcase = O·C to + 85·C; CI = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
25 
MHz 
1X Clock Driven to Intel486 
DX2 


t1 
ClK 
Period 
40 
125 
ns 
14.1 


t1a 
ClK 
Period Stability 
0.1% 
A 
Adjacent 
Clocks 


t2 
ClK 
High Time 
14 
ns 
14.1 
at2V 


t3 
ClK 
low 
Time 
14 
ns 
14.1 
at 0.8V 
~ 
ClK 
Fall Time 
4 
ns 
14.1 
2Vto 
0.8V 


t5 
ClK 
Rise Time 
'4 
ns 
14.1 
0.8V to 2V 


ts 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
19 
ns 
14.5 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
lOCK#,FERR#,BREQ,HlDA 
Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
28 
ns 
14.6 
(Note 1) 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 


lOCK # Float Delay 


t8 
PCHK# 
Valid Delay 
3 
24 
ns 
14.4 
(Note 3) 


tea 
BLAST #, PLOCK # Valid Delay 
3 
24 
ns 
14.5 
(Note 3) 


t9 
BLAST #, PLOCK # Float Delay 
28 
ns 
14.6 
(Note 1) 


t10 
DO-D31, 
DPO-3 Write Data Valid 
3 
20 
ns 
14.5 
(Note 3) 
I 


Delay 


t11 
DO-D31, 
DPO-3 Write Data Float 
28 
ns 
14.6 
(Note 1) 
Delay 


t12 
EADS # Setup Time 
8 
ns 
14.2 


t13 
EADS # Hold Time 
3 
ns 
14.2 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
8 
ns 
14.2 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
3 
ns 
14.2 


t16 
RDY #, BRDY # Setup Time 
8 
ns 
14.3 


t1] 
RDY #, BRDY # Hold Time 
3 
ns 
14.3 


t18 
HOLD, AHOlD, 
BOFF # Setup Time 
8 
ns 
14.2 


t19 
HOLD, AHOlD, 
BOFF # Hold Time 
3 
ns 
14.2 


t20 
RESET, FlUSH#, 
A20M#, 
NMI, 
8 
ns 
14.2 
(Note 4) 
INTR, IGNNE# 
Setup Time 


t21 
RESET, FlUSH#, 
A20M#, 
NMI, 
3 
ns 
14.2 
(Note 4) 
INTR, IGNNE# 
Hold Time 


t22 
DO-D31, 
DPO-3, 
A4-A31 
Read 
5 
ns 
14.2,14.3 
Setup Time 


t23 
DO-D31, 
DPO-3, 
A4-A31 
Read 
3 
ns 
14.2,14.3 
Hold Time 


NOTES: 
1. Not 100% tested. Guaranteed by design characterization. 
2, All timing specifications assume CL = 50 pF, Charts 14.4,3 provides the charts that may be used to determine the delay 
due to derating, depending on the lumped capacitive loading, that must be added to these specification values. 
3. The minimum Intel486 DX2 output valid delays are hold times provided to external circuitry. 
4. A reset pulse width of 15 ClK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and elK 
are stable. 


• 


Table 14.4.266 
MHz Intel486 
DX2 Microprocessor 
A.C. Characteristics 


VCC = 5V ±5%; 
Tcase = O°C to + 85°C; CI = 50 pF unless otherwise 
specified 
(Note 2) 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
33 
MHz 
1X Clock Driven to Intel486 
DX2 


t1 
ClK 
Period 
30 
125 
ns 
14.1 


t1a 
ClK 
Period Stability 
0.1% 
11 
Adjacent 
Clocks 


t2 
ClK 
High Time 
11 
ns 
14.1 
at2V 


t3 
ClK 
low Time 
11 
ns 
14.1 
atO.8V 


t4 
ClK 
Fall Time 
3 
ns 
14.1 
2V to 0.8V 


t5 
ClK 
Rise Time 
3 
ns 
14.1 
0.8Vto~2~ 


t6 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
14 
ns 
14.5 
(Not~- 


M/IO#, 
D/C#, 
W/R#, 
ADS#, 
lOCK#,FERR#,BREQ, 
HlDA 
,.~ 
tr~1 
Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
201 


~ 


1~S 
t'<f.lote 1) 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
~ 


lOCK # Float Delay 
\..'< ~. 
~ 


t8 
PCHK# 
Valid Delay 
.a-." ':"\ 
~. 
~,of4.~ 
~~ote3) 


t8a 
BLAST #, PLOCK # Valid Delay 
.(; ~. 
14 
ns 
~ 


(Note 3) 


t9 
BLAST#, 
PlOCK# 
Float Del~y~ 
~~ 
~~ 
~~s 
(Note 1) 


t10 
00-031, 
DPO-3 Write DaJ!/J}ttI' 


~ ~~~: 
~ 


14.5 
(Note 3) 
Delay 
~ 


tll 
00-031, 
DPO-3 wr_))4fa"Flo~~ 
'~ 
~1.·s 
14.6 
(Note 1) 
Delay 
,. 
.•••. 
~. 
'" 


t12 
EADS # Setup Time 
A""'r/' AP\i J' 
ns 
14.2 


t13 
EADS # Hold Time 
~),;~ 
~ 
ns 
14.2 


t14 
KEN#, 
BS16#, 
BS8~P 
~ 
~5 
ns 
14.2 


t15 
KEN#, 
BS16#, 
ilS8# 
t1oldTim1Y 
3 
ns 
14.2 


t16 
RDY#, 
BRD [1"', ~pTime 
5 
ns 
14.3 


t17 
RDY#, 
Bf •.,.. ~ 
old Time 
3 
ns 
14.3 


t18 
HOLD, AHatD, 
Setup Time 
6 
ns 
14.2 


t18a 
BOFF # Setup Time 
7 
ns 
14.2 


t19 
HOLD, AHOlD, 
BOFF # Hold Time 
3 
ns 
14.2 


t20 
RESET, FlUSH#, 
A20M#, 
NMI, 
5 
ns 
14.2 
(Note 4) 
INTR, IGNNE# 
Setup Time 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, 
3 
ns 
14.2 
(Note 4) 
INTR, IGNNE# 
Hold Time 


t22 
00-031, 
DPO-3, 
A4-A31 
Read 
5 
ns 
14.2,14.3 
Setup Time 


t23 
00-031, 
DPO-3, A4-A31 
Read 
3 
ns 
14.2,14.3 
Hold Time 


NOTES: 
1. Not 100% tested. Guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. Charts 14.4.3 provides the charts that may be used to determine the delay 
due to derating, depending on the lumped capacitive loading, that must be added to these specification values. 
3. The minimum Intel486 DX2 output valid delays are hold times provided to external circuitry. 
4. A reset pulse width of 15 ClK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and ClK are stable. 


Vcc = 5V ± 5%, Tcase = O·C to +85·C, Cl = 0 pF 
All Inputs and Outputs are TTL Level (Note 4) 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


t24 
TCK Frequency 
25 
MHz 
1x Clock 


t25 
TCK Period 
40 
ns 
(Note 2) 


t26 
TCK High Time 
10 
ns 
at2.0V 


t27 
TCKLowTime 
10 
ns 
atO.8V 


t28 
TCK Rise Time 
4 
ns 
(Note 1) 


t29 
TCKFaliTime 
4 
ns 
(Note 1) 


t30 
TDI, TMS Setup Time 
8 
ns 
14.7 
(Note 3) 


t31 
TDI, TMS Hold Time 
7 
ns 
14.7 
(Note 3) 


t32 
TOO Valid Delay 
3 
25 
ns 
14.7 
(Note 3) 


t33 
TOO Float Delay 
TBD 


t34 
All Outputs 
(Non-Test) 
Valid Delay 
3 
25 
ns 
14.7 
(Note 3) 


t35 
All Outputs 
(Non-Test) 
Float Delay 
36 
ns 
14.7 
(Note 3) 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
ns 
14.7 
(Note 3) 


t37 
All Inputs (Non-Test) 
Hold Time 
7 
ns 
14.7 
(Note 3) 


NOTES: 
1. Rise/Fall times are measured between 0.8V and 2.0V. Rise/Fall times can be relaxed by 1 ns per 10 ns increase in TCK 
period. 
2. TCK period ~ ClK period. 
3. Parameter measured from TCK. 
4. Boundary Scan A.C. Specifications in the above table are target values. They have not been characterized. Therefore 
they are subject to change. 
• 


ClK [ 


EADS# [ 


9S8#,9S16#, 
[ 
KEN# 


90FF#, 
AHOlD, [ 
HOLD 


RESET, FlUSH#, [ 
A20M#, 
IGNNE#, 


INTR, 
NMI 
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14.4.1 
TYPICAL 
OUTPUT 
VALID 
DELAY 
VERSUS 
LOAD 
CAPACITANCE 
UNDER 
WORST 
CASE 
CONDITIONS 
FOR THE 50 MHz AND 66 MHz INTEL486 
DX2 CPU 
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C1. (picoforad.) 
NOTE: 
This graph will not be linear outside 
of the CL range 
shown. 


nom = nominal 
value 
given 
in A.C. Characteristics 
table. 


14.4.2 
TYPICAL 
OUTPUT 
RISE TIME 
VERSUS 
LOAD CAPACITANCE 
UNDER 
WORST·CASE 
CONDITIONS 
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NOTE: 
This graph 
will not be linear outside 
of the CL range 
shown. 
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14.4.3.8 
TYPICAL 
LOADING 
DELAY 
VERSUS 
CAPACITIVE 
LOADING 
UNDER 
WORST·CASE 
CONDITIONS 
FOR A HIGH TO LOW TRANSITION 
ON THE INTEL486 
DX2 CPU 


Loading 
Delay 
2 
(no) 


14.4.3.b 
TYPICAL 
LOADING 
DELAY 
VERSUS 
CAPACITIVE 
LOADING 
UNDER 
WORST·CASE 
CONDITIONS 
FOR A LOW TO HIGH TRANSITION 
ON THE INTEL486 
DX2 CPU 


Loading 
Delay 
0 5 
(no) 
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~:~~ REf. 
.50 CHAMfER 


(INDEX CORNER) 


SWAGGED 
PIN 
(. 
PL) 


SEATING 
PLANE 


~B (ALL 
PINS) 
Lt 


SWAGGED 


PIN 


DETAIL 


Family: Ceramic 
Pin Grid Array Package 


Symbol 
Mlllimetera 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
2.8 
3.5 
SOLID LID 
0.110 
0.140 
SOLID LID 


Aa 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


0 
44.07 
44.83 
1.735 
1.765 


01 
40.51 
40.77 
1.595 
1.605 


e1 
2.29 
2.79 
0.090 
0.110 
l 
2.54 
3.30 
0.100 
0.130 


N 
168 
168 


Sl 
1.52 
2.54 
0.060 
0.100 


Letter 
or 
Description 
of Dimensions 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance 
from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


D 
Largest overall package 
dimension 
of length 


D1 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing 
between 
true lead position 
centerlines 


L 
Distance 
from seating plane to end of lead 


S1 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling dimension: millimeter. 
2. Dimension "e1" ("e") is non-cumulative. 
3. seating plane (standoff) is defined by P.C. board hole size: 0.0415-0.0430 inch. 
4. Dimensions "S", "S1" and "c" 
are nominal. 
5. Details of Pin 1 identifier are optional. 


15.1 Package Thermal Specifications 


The Intel486 
DX2 microprocessor 
is specified 
for op- 
eration when TC (the case temperature) 
is within the 
range of 0'C-85'C. 
TC may be measured 
in any en- 
vironment 
to determine 
whether 
the 
Intel486 
DX2 
microprocessor 
is within 
specified 
operating 
range. 
The 
case 
temperature 
should 
be measured 
at the 
center 
of the top surface 
opposite 
the pins. 


The ambient 
temperature 
(TA) is guaranteed 
as long 
as TC is not violated. 
The ambient 
temperature 
can 
be calculated 
from 
8JC and 8JA from the following 
equations. 


TJ = Te + P' 
IIJe 


TA = TJ - 
P' 
IIJA 


TA = Te - 
(P • ileA> 


Te = TA + P' 
[IIJA - 
IIJcl 


where 
TJ, TA. TC = Junction, 
Ambient 
and Case 
Temperature 
respectively. 
8JC, 8JA = Junction-to- 
Case and Junction-to-Ambient 
Thermal 
Resistance, 
respectively. 


The values 
for 8JA and 8JC are given in Table 
13.2 
for the 1.75 sq. in., 168-pin, 
ceramic 
PGA. 


Table 13.3 shows the TA allowable 
(without 
exceed- 
ing Tc> at various airflows 
and operating 
frequencies 


(fCLK)· 


Note that TA is greatly 
improved 
by attaching 
"fins" 
or a "heat 
sink" 
to the package. 
P (the 
maximum 
power consumption) 
is calculated 
by using the maxi- 
mum Icc at 5V as tabulated 
in the DC Characteris- 
tics of Section 
14. 


8CA va Alrflow-ft/mln 
(m/sec) 


8JC 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 
I 
With Heat Sink' 
2.5 
10.5 
7.0 
4.5 
3.5 
3.0 
2.5 
I 
Without 
Heat Sink 
2.0 
16 
14.0 
10.5 
9.0 
8.0 
7.5 
.. 


• 
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00000000000 
00000000000 
00000000000 
00000000000 
00000000000 
00000000000 
00000000000 
00000000000 
00000000000 
00000000000 
00000000000 


I 


I 


I 


I 


I 


I 


0.079,,-1 I- 


1 
rO.063" 
0.060"1 r 
~J' 


Alrflow-ft/mln 
(m/sec) 


0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TA with Heat Sink 
50 MHz 
32.6 
50.0 
62.5 
67.5 
70.0 
72.5 


66 MHz 
18.9 
40.9 
56.7 
62.9 
66.1 
69.3 


TA without Heat Sink 
50 MHz 
5.2 
15.1 
32.6 
40.1 
45.1 
47.6 


66 MHz 
-15.8 
-3.2 
18.9 
28.3 
34.6 
37.8 


16.0 SUGGESTED 
SOURCES FOR 
INTEL486™ DX2 ACCESSORIES 


Following are some suggested sources of accesso- 
ries for the Intel486 OX2. They are not an endorse- 
ment of any kind, nor a warranty of the performance 
of any of the listed products and/or companies. 


1. McKenzie Technology 
44370 Old Palmspring Blvd. 
Fremont, CA 
94538 


Tel: (415) 651-2700 
2. E-CAM Technology, Inc. 
14455 North Hayden Rd. 
Suite 208 
Scottsdale, AZ 
85260 


Tel: (602) 443-1949 
3. Augat Inc. (for sockets with decaps) 
Interconnection Products Group 
33 Perry Ave. 
P.O. Box 779 
Attleboro, MA 
02703 
Tel: (508) 222-2202 


Heat Sinks/Fins 


1. AAVIO Engineering, Inc. 


One Kool Path 
P.O. Box 400 
Laconia, NH 
03247 
Tel: (603) 528-3400 


TTL Crystals/Oscillators 


1. NFL Frequency Controls, Inc. 
357 Beloit Street 
Burlington, WI 
53105 
Tel: (414) 763-3591 
2. M-Tron 
P.O. Box 630 
Yankton, SO 
57078 
Tel: (605) 665-9321 


Debugging 
Tower 


1. Emulation Technology 
2344 Walsh Ave., Building F 
Santa Clara, CA 
95051 
Tel: (408) 982-0664 
• 


Revision -003 of the Intel486 DX2 Microprocessor 
Data Book contains many updates and improve- 
ments to the original version. A revision summary of 
major changes is listed below: 


PGA Pin Table Pins B10 and C12 are no-connects 
Section 6.5 
Added clarification for the built in 
self test (BIST) during reset. 


Section 7.2.9 
Added explanation of bus hold and 
hold acknowledge protocol. 


Figure 7.26b 
Added figure to iIIusrate HOLD reo 
quest acknowledge during BOFF(; . 
Section 12.0 
Replaced references to the 
Up- 
grade 
Socket 
with 
the 
Pentium 
OverDrive Processor Socket. 


Section 12.2.2 Modified section to reflect changes 
in physical dimensions of heat sink 
unit. 


Section 12.2.3 Modified End User Easy section. 
Section 12.2.4 Changed section to only show ZIF 
socket vendors. 
Section 12.3 
Modified section to reflect maximum 
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DID 
VCC 
VSS 
E 
0 
0 
0 
0 
0 
0 


0 
BOrr- 
BSS- 
A2011- 
017 
013 
09 
0 
0 
0 
0 
0 
0 
0 


C 
BS16- 
RESET 
rLUSH- 
FERR- 
HC 
HC 
HC 
HC 
030 
02S 
026 
027 
VCC 
VCC 
CLK 
01S 
011 
C 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


B 
EAOS- 
HC 
Hili 
HC 
HC 
HC 
VCC 
HC 
VCC 
031 
VCC 
025 
VSS 
VSS 
VSS 
021 
019 
B 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


A 
AHOLO 
IHTR 
IGHHE- 
HC 
HC 
HC 
vss 
HC 
VSS 
029 
VSS 
024 
OP3 
023 
HC 
022 
020 
A 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


17 
16 
15 
14 
13 
12 
11 
10 
9 
8 
7 
6 
5 
4 
3 
2 
1 
240440-3 


A 
tI 
l,; 
U 
I:. 
~ 
(; 
H 
J 
K 
L 
t.4 
N 
P 
a 
R 
S 


1 
/020 
019 
011 
09 
VSS 
OPI 
VSS 
VSS 
vcc 
vss 
VSS 
VSS 
02 
DO 
A31 
A28 
A27 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


2 
022 
021 
018 
013 
vcc 
08 
vcc 
03 
05 
vcc 
06 
vcc 
01 
A29 
VSS 
A25 
A26 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


3 
TCK 
VSS 
CLK 
017 
010 
015 
012 
OP2 
016 
014 
07 
04 
OPO 
A30 
A17 
VCC 
A23 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


4 
023 
VSS 
vcc 
A19 
VSS 
He 
4 
0 
0 
0 
0 
0 
0 


5 
OP3 
VSS 
vcc 
A21 
A18 
A14 
5 
0 
0 
0 
0 
0 
0 


6 
024 
025 
027 
A24 
vcc 
VSS 
6 
0 
0 
0 
0 
0 
0 


7 
vss 
vcc 
026 
A22 
A15 
A12 
7 
0 
0 
0 
0 
0 
0 


8 
029 
031 
028 
A20 
VCC 
VSS 
8 
0 
0 
0 
Intel486 
Tt.4 t.4ICROPROCESSOR 
0 
0 
0 


9 
vss 
vcc 
030 
A16 
vcc 
VSS 
9 
0 
0 
0 
50 t.4Hz VERSION 
0 
0 
0 


10 
NC 
NC 
NC 
PIN SIDE VIEW 
A13 
vcc 
VSS 
10 
0 
0 
0 
0 
0 
0 


11 
vss 
vcc 
NC 
A9 
VCC 
VSS 
1 1 
0 
0 
0 
0 
0 
0 


12 
NC 
NC 
NC 
AS 
All 
vss 
12 
0 
0 
0 
0 
0 
0 


13 
Ne 
NC 
NC 
A7 
A8 
Al0 
13 
0 
0 
0 
0 
0 
0 


14 
TOt 
T"S 
fERR- 
A2 
VCC 
vss 
14 
0 
0 
0 
0 
0 
0 


15 
IGNNE- 
N••I 
flUSH- 
AlO ••- 
HOLO 
KEN- 
NC 
BROY- 
BE2- 
BEO- 
PWT 
o/c- 
LOCK- 
HLOA 
BREQ 
A3 
A6 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


16 
INTR 
TDO 
RESET 
BS8- 
VCC 
ROY- 
VCC 
VCC 
BEl- 
VCC 
VCC 
VCC 
••/10- 
VCC 
PLOCK- 
BLAST- 
A4 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


17 
AHOLD EADS- 
BS16_ 
BOff- 
VSS 
BE3- 
VSS 
VSS 
PCO 
VSS 
VSS 
VSS 
W/R_ 
VSS 
PCHK_ 
NC 
AOS_ 
17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
t.4 
N 
P 
a 
R 
S 
240440-85 


•• 


intel~ 


S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
D 
C 
B 
A 


1 
"27 
"28 
"31 
00 
02 
YSS 
YSS 
YSS 
YCC 
YSS 
YSS 
DPI 
YSS 
D9 
011 
019 
02~ 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


2 
"26 
"25 
YSS 
"29 
01 
YCC 
06 
VCC 
05 
03 
VCC 
08 
YCC 
013 
018 
021 
022 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


3 
"23 
YCC 
"17 
"30 
OPO 
04 
07 
014 
016 
OP2 
012 
015 
DID 
017 
ClK 
VSS 
TCK 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


4 
He 
VSS 
"19 
VCC 
VSS 
023 
4 
0 
0 
0 
0 
0 
0 


5 
A14 
A18 
A21 
VCC 
VSS 
OP3 
5 
0 
0 
0 
0 
0 
0 


6 
vss 
VCC 
A24 
027 
025 
024 
6 
0 
0 
0 
0 
0 
0 


7 
"12 
"15 
A22 
026 
VCC 
VSS 
7 
0 
0 
0 
0 
0 
0 


8 
YSs 
VCC 
A20 
028 
031 
029 
8 
0 
0 
0 
Inte1486Tt.4 
MICROPROCESSOR 
0 
0 
0 


9 
vss 
VCC 
A16 
030 
VCC 
VSS 
9 
0 
0 
0 
50 MHZ VERSION 
0 
0 
0 


10 
vss 
VCC 
A13 
TOP SIDE VIEW 
HC 
HC 
He 
10 
0 
0 
0 
0 
0 
0 


11 
vss 
VCC 
A9 
HC 
VCC 
vss 
1 1 
0 
0 
0 
0 
0 
0 


12 
VSS 
"11 
A5 
HC 
HC 
He 
12 
0 
0 
0 
0 
0 
0 


13 
AID 
"8 
A7 
HC 
HC 
HC 
13 
0 
0 
0 
0 
0 
0 


14 
vss 
YCC 
A2 
FERR" 
Tlls 
TOI 
14 
0 
0 
0 
0 
0 
0 


15 
"S 
A3 
BREQ 
HlOA 
lOCI<" 
o/c" 
PWT 
BED" 
BE2" 
BROY" 
HC 
KEN" 
HOLD 
A20ll" 
FLUSH" 
NIlI 
IGHNE" 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


16 
A4 
II.AST" 
PLOCK" 
VCC 
11/10" 
YCC 
VCC 
VCC 
BEl" 
VCC 
YCC 
ROY" 
YCC 
BsS" 
RESET 
TOO 
IHTR 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


17 
ADS" 
NC 
PCHK" 
VSS 
W/R" 
VSS 
Yss 
YsS 
PCO 
VSS 
VSS 
BE3" 
VSS 
1lOfF" 
8516" 
EADS" 
AHOLD 
17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
D 
C 
B 
A 
240440-86 


Address 
Qata 
Control 
Test 
N/C 
Vcc 
Vss 
(50 MHz Only) 


A2 
014 
Do 
P1 
A20M# 
015 
TCK 
A3 
A3(1) 
B7 
A7 
A3 
R15 
01 
N2 
A05# 
517 
TOI 
A14 
A10 
B9 
A9 
At 
516 
02 
N1 
AHOLO 
A17 
TOO 
B16 
A12 
B11 
A11 
A5 
012 
03 
H2 
BEO# 
K15 
TM5 
B14 
A13 
C4 
B3 
A6 
515 
04 
M3 
BE1# 
J16 
A14(1) 
C5 
B4 
A7 
013 
05 
J2 
BE2# 
J15 
B10 
E2 
B5 
A8 
R13 
06 
L2 
BE3# 
F17 
B12 
E16 
E1 
A9 
011 
07 
L3 
BLA5T# 
R16 
B13 
G2 
E17 
AlO 
513 
08 
F2 
BOFF# 
017 
B14(1) 
G16 
G1 
A11 
R12 
09 
01 
BROY# 
H15 
B16(1) 
H16 
G17 
A12 
57 
010 
E3 
BREO 
015 
C10 
J1 
H1 
A13 
010 
011 
C1 
B58# 
016 
C11 
K2 
H17 
A14 
55 
012 
G3 
B516# 
C17 
C12 
K16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
C13 
L16 
K17 
A16 
09 
014 
K3 
O/C# 
M15 
G15 
M2 
L1 
A17 
03 
015 
F3 
OPO 
N3 
R17 
M16 
L17 
A18 
R5 
016 
J3 
OP1 
F1 
54 
P16 
M1 
A19 
04 
017 
03 
OP2 
H3 
R3 
M17 
A20 
08 
018 
C2 
OP3 
A5 
R6 
P17 
A21 
05 
019 
B1 
EA05# 
B17 
R8 
02 
A22 
07 
020 
A1 
FERR# 
C14 
R9 
R4 
A23 
53 
021 
B2 
FLU5H# 
C15 
R10 
56 
A24 
06 
022 
A2 
HLOA 
P15 
R11 
58 
A25 
R2 
023 
A4 
HOLD 
E15 
R14 
59 
A26 
52 
024 
A6 
IGNNE# 
A15 
510 
A27 
51 
025 
B6 
INTR 
A16 
511 
A28 
R1 
026 
C7 
KEN# 
F15 
512 
A29 
P2 
027 
C6 
LOCK # 
N15 
514 
A30 
P3 
028 
C8 
M/IO# 
N16 
A31 
01 
029 
A8 
NMI 
B15 
030 
C9 
PCO 
J17 
031 
B8 
PCHK# 
017 
PWT 
L15 
PLOCK# 
016 
ROY# 
F16 
RE5ET 
C16 
W/R# 
N17 


NOTE: 
1. These 
pins are no longer 
No-Connects 
on the 50 MHz version. 


• 
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Vee 
A2 
ADS- 
BLAST- 
PLOCK- 
LOCK_ 
Vss 
He 
PeHK- 
BRDY- 
8OFF- 
BSIU 
9S8- 
NC 
ROY- 
KEN- 
Vee 
HOLD 
AHOLD 
TCK 
NC 
Vss 
Vee 
Ne 
eLK 
HLDA 


~~R. 
Vee 
BREQ 
BEO- 
BE1* 
BE2- 
VSS 
BE3- 
~5lio- 
ole- 
Vss 
PWT 
Vee 
PeD 
EADS- 
A20W- 
RESET 
FLUSH- 
INTR 
NWI 
Vss 


,. . . 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 


1 
VSS 
50 
VSS 
99 
VSS 
148 
VSS 
2 
A21 
51 
021 
100 
NMI 
149 
NC 
3 
A22 
52 
NS 
101 
INTR 
150 
A3 
4 
A23 
53 
022 
102 
FLUSH # 
151 
NC 
5 
A24 
54 
VCC 
103 
RESET 
152 
A4 
6 
Vcc 
55 
023 
104 
A20M# 
153 
NC 
7 
A25 
56 
NC 
105 
EAOS# 
154 
A5 
8 
A26 
57 
OP3 
106 
PCO 
155 
NC 
9 
A27 
58 
VSS 
107 
VCC 
156 
UP# 
10 
A28 
59 
024 
108 
PWT 
157 
NC 
11 
VSS 
60 
NC 
109 
VSS 
158 
As 
12 
A29 
61 
025 
110 
O/C# 
159 
A7 
13 
A30 
62 
VCC 
111 
M/IO# 
160 
NC 
14 
A31 
63 
026 
112 
VCC 
161 
A8 
15 
NC 
64 
NC 
113 
BE3# 
162 
NC 
16 
OPO 
65 
027 
114 
VSS 
163 
A9 
17 
Do 
66 
VSS 
115 
BE2# 
164 
VCC 
18 
01 
67 
028 
116 
BE1# 
165 
Al0 
19 
VCC 
68 
NC 
117 
BEO# 
166 
NC 
20 
02 
69 
029 
118 
BREQ 
167 
VSS 
21 
VSS 
70 
VCC 
119 
VCC 
168 
VSS 
22 
VSS 
71 
030 
120 
W/R# 
169 
NC 
23 
03 
72 
NC 
121 
VSS 
170 
VCC 
24 
VCC 
73 
NC 
122 
HLOA 
171 
NC 
25 
04 
74 
031 
123 
CLK 
172 ' 
All 
26 
05 
75 
NC 
124 
NC 
173 
NC 
27 
06 
76 
NC 
125 
VCC 
174 
A12 
28 
VCC 
77 
IGNNE# 
126 
VSS 
175 
VCC 
29 
07 
78 
NC 
127 
NC 
176 
A13 
30 
OP1 
79 
NC 
128 
TCK 
177 
VSS 
31 
08 
80 
TOO 
129 
AHOLO 
178 
A14 
32 
09 
81 
FERR# 
130 
HOLD 
179 
VCC 
33 
VSS 
82 
NC 
131 
VCC 
180 
A15 
34 
NC 
83 
NC 
132 
KEN# 
181 
A16 
35 
010 
84 
VCC 
133 
ROY# 
182 
VSS 
36 
VCC 
85 
NC 
134 
NC 
183 
A17 
37 
011 
86 
VSS 
135 
BS8# 
184 
VCC 
38 
012 
87 
NC 
136 
BS16# 
185 
TOI 
39 
013 
88 
NC 
137 
BOFF# 
186 
NC 
40 
VSS 
89 
NC 
138 
BROY# 
187 
TMS 
41 
014 
90 
NC 
139 
PCHK# 
188 
NC 
42 
015 
91 
NC 
140 
NC 
189 
A18 
43 
OP2 
92 
NC 
141 
VSS 
190 
NC 
44 
016 
93 
VCC 
142 
LOCK# 
191 
A19 
45 
017 
94 
NC 
143 
PLOCK# 
192 
NC 
46 
018 
95 
Vss 
144 
BLAST # 
193 
A20 
47 
019 
96 
VSS 
145 
AOS# 
194 
VSS 
48 
020 
97 
NC 
146 
A2 
195 
NC 
49 
VCC 
98 
Vcc 
147 
VCC 
196 
VCC 
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Address 
Data 
Control 
HC 
Vcc 
Vss 


A2 
146 
Do 
17 
A20M# 
104 
15 
6 
1 
A3 
150 
01 
18 
AOS# 
145 
34 
19 
11 
A4 
152 
02 
20 
AHOLO 
129 
52 
24 
21 
A5 
154 
03 
23 
BEO# 
117 
56 
28 
22 
Ae 
158 
04 
25 
BE1# 
116 
60 
36 
33 
A7 
159 
05 
26 
BE2# 
115 
64 
49 
40 
A8 
161 
06 
27 
BE3# 
113 
68 
54 
50 
A9 
163 
07 
29 
BLAST # 
144 
72 
62 
58 
AlO 
165 
08 
31 
BOFF# 
137 
73 
70 
66 
A11 
172 
09 
32 
BROY# 
138 
75 
84 
86 
A12 
174 
010 
35 
BREQ 
118 
76 
93 
95 
A13 
176 
011 
37 
BS# 
135 
78 
98 
96 
A14 
178 
012 
38 
BS16# 
136 
79 
107 
99 
A15 
180 
013 
39 
CLK 
123 
82 
112 
109 
A16 
181 
014 
41 
O/C# 
110 
83 
119 
114 
A17 
183 
015 
42 
OPO 
16 
85 
125 
121 
A18 
189 
016 
44 
OP1 
30 
87 
131 
126 
A19 
191 
017 
45 
OP2 
43 
88 
147 
141 
A20 
193 
018 
46 
OP3 
57 
89 
164 
148 
A21 
2 
019 
47 
EAOS# 
105 
90 
:170 
167 
A22 
3 
020 
48 
FERR# 
81 
91 
175 
168 
A23 
4 
021 
51 
FLUSH# 
102 
92 
179 
177 
A24 
5 
022 
53 
HLOA 
122 
94 
184 
182 
A25 
7 
023 
55 
HOLD 
130 
97 
196 
194 
A26 
8 
024 
59 
IGNNE# 
77 
124 
A27 
9 
025 
61 
INTR 
101 
127 
A28 
10 
026 
63 
KEN# 
132 
134 
A29 
12 
027 
65 
LOCK# 
142 
140 
A30 
13 
028 
67 
M/IO# 
111 
149 
A31 
14 
029 
69 
NMI 
100 
151 
030 
71 
PCO 
106 
153 
031 
74 
PHCK# 
139 
155 
PWT 
108 
157 
PLOCK# 
143 
160 
ROY# 
133 
162 
RESET 
103 
166 
TOI 
185 
169 
TOO 
80 
171 
TMS 
187 
173 
W/R# 
120 
186 
188 
190 
192 
195 
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Symbol 
Type 
Name and Function 


CLK 
I 
Clock provides 
the fundamental 
timing and the internal operating 
frequency 
for the 
Intel486 
Microprocessor. 
All external timing parameters 
are specified 
with respect 
to 
the rising edge of CLK. 


ADDRESS 
BUS 


A31-A4 
1/0 
A31-A2 
are the address lines of the microprocessor. 
A31-A2, 
together 
with the byte 
A2-A3 
0 
enables 
BEO # -BE3 #, define the physical 
area of memory or input! output space 
accessed. 
Address 
lines A31-A4 
are used to drive addresses 
into the microprocessor 
to perform 
cache line invalidations. 
Input signals must meet setup and hold times t22 
and t23' A31-A2 
are not driven during bus or address 
hold. 


BEO-3# 
0 
The byte enable signals indicate 
active bytes during read and write cycles. Ouring the 
first cycle of a cache fill, the external system should assume that all byte enables 
are 
active. BE3# 
applies to 024-031, 
BE2# 
applies to 016-023, 
BE1 # applies to 08- 


016 and BEO# applies to 00-07. 
BEO#-BE3# 
are active LOW and are not driven 
during bus hold. 


DATA 
BUS 


031-00 
I/O 
These are the data lines forthe 
Intel486 
Microprocessor. 
Lines 00-07 
define the least 


significant 
byte of the data bus while lines 024-031 
define the most significant 
byte of 
the data bus. These signals must meet setup and hold times t22 and t23 for proper 
operation 
on reads. These pins are driven during the second and subsequent 
clocks of 


write cycles. 


DATA 
PARITY 


OPO-OP3 
I/O 
There is one data parity pin for each byte of the data bus. Oata parity is generated 
on all 
write data cycles with the same timing as the data driven by the Intel486 
Microprocessor. 
Even parity information 
must be driven back into the microprocessor 


on the data parity pins with the same timing as read information 
to insure that the 
correct 
parity check status is indicated 
by the Intel486 
microprocessor. 
The signals 
read on these pins do not affect program 
execution. 
Input signals must meet setup and hold times t22 and t23' OPO-OP3 
should be 
connected 
to Vcc through a pullup resistor 
in systems which do not use parity. 


OPO-OP3 
are active HIGH and are driven during the second and subsequent 
clocks of 
write cycles. 


PCHK# 
0 
Parity Status is driven on the PCHK'If 
pin the clock after ready for read operations. 
The 
parity status is for data sampled 
at the end of the previous 
clock. A parity error is 
indicated 
by PCHK # being LOW. Parity status is only checked 
for enabled 
bytes as 
indicated 
by the byte enable and bus size signals. PCHK # is valid only in the clock 
immediately 
after read data is returned 
to the microprocessor. 
At all other times 


PCHK # is inactive 
(HIGH). PCHK # is never floated. 
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Symbol 
Type 
Name and Function 


BUS CYCLE DEFINITION 


M/IO# 
a 
The memory/input-output, 
datalcontrol 
and write/read 
lines are the primary bus 
O/C# 
a 
definition 
signals. These signals are driven valid as the AOS# 
signal is asserted. 


W/R# 
a 
M/IO# 
D/C# 
W/R# 
Bus Cycle Initiated 


0 
0 
0 
Interrupt Acknowledge 
0 
0 
1 
Halt/Special 
Cycle 
0 
1 
0 
I/O Read 
0 
1 
1 
I/O Write 
1 
0 
0 
Code Read 
1 
0 
1 
Reserved 
1 
1 
0 
Memory Read 
1 
1 
1 
Memory Write 


The bus definition 
signals are not driven during bus hold and follow the timing of the 
address 
bus. Refer to Section 
7.2.11 for a description 
of the special bus cycles. 


LOCK# 
a 
The bus lock pin indicates 
that the current bus cycle is locked. The Intel486 
Microprocessor 
will not allow a bus hold when LOCK # is asserted 
(but address 
holds 
are allowed). 
LOCK # goes active in the first clock of the first locked bus cycle and goes 
inactive after the last clock of the last locked bus cycle. The last locked cycle ends 
when ready is returned. 
LOCK # is active LOW and is not driven during bus hold. Locked 
read cycles will not be transformed 
into cache fill cycles if KEN # is returned 
active. 


PLOCK# 
a 
The pseudo-lock 
pin indicates 
that the current bus transaction 
requires more than one 
bus cycle to complete. 
Examples 
of such operations 
are floating 
point long reads and 
writes (64 bits), segment 
table descriptor 
reads (64 bits), in addition 
to cache line fills 
(128 bits). The Intel486 
Microprocessor 
will drive PLOCK # active until the addresses 
for 
the last bus cycle of the transaction 
have been driven regardless 
of whether 
ROY # or 
BROY # have been returned. 
Normally 
PLOCK # and BLAST # are inverse of each other. However 
during the first bus 
cycle of a 64-bit floating 
point write, both PLOCK # and BLAST # will be asserted. 


PLOCK# 
is a function 
of the BS8#, 
BS16# 
and KEN# 
inputs. PLOCK# 
should be 
sampled 
only in the clock ready is returned. 
PLOCK # is active LOW and is not driven 
during bus hold. 


BUS CONTROL 


AOS# 
a 
The address status output indicates 
that a valid bus cycle definition 
and address are 
available 
on the cycle definition 
lines and address bus. AOS# 
is driven active in the 
same clock as the addresses 
are driven. AOS# 
is active LOW and is not driven during 
bus hold. 


ROY# 
I 
The non-burst 
ready input indicates 
that the current bus cycle is complete. 
ROY # 
indicates 
that the external 
system has presented 
valid data on the data pins in response 
to a read or that the external 
system has accepted 
data from the Intel486 
Microprocessor 
in response 
to a write. ROY # is ignored when the bus is idle and at the 
end of the first clock of the bus cycle. 
ROY # is active during address 
hold. Data can be returned 
to tile processor 
while 
AHOLO is active. 
ROY # is active LOW, and is not provided 
with an internal pullup resistor. 
ROY # must 
satisfy setup and hold times t16 and t17 for proper chip operation. 


Symbol 
Type 
Name and Function 


BURST CONTROL 


BRDY# 
I 
The burst ready input performs 
the same function 
during a burst cycle that ROY # 
performs 
during a non-burst 
cycle. BRDY # indicates 
that the external 
system has 
presented 
valid data in response 
to a read or that the external 
system has accepted 
data 
in response 
to a write. BRDY # is ignored when the bus is idle and at the end of the first 
clock in a bus cycle. 
BRDY # is sampled 
in the second and subsequent 
clocks of a burst cycle. The data 
presented 
on the data bus will be strobed 
into the microprocessor 
when BRDY # is 
sampled 
active. If ROY # is returned 
simultaneously 
with BRDY #, BRDY # is ignored 
and the burst cycle is prematurely 
aborted. 
BRDY # is active LOW and is provided with a small pullup resistor. 
BRDY # must satisfy 
the setup and hold times t16 and tH. 


BLAST# 
0 
The burst last signal indicates 
that the next time BRDY # is returned 
the burst bus cycle 
is complete. 
BLAST # is active for both burst and non-burst 
bus cycles. BLAST # is 
active LOW and is not driven during bus hold. 


INTERRUPTS 


RESET 
I 
The reset input forces the Intel486 
Microprocessor 
to begin execution 
at a known state. 


The microprocessor 
cannot 
begin execution 
of instructions 
until at least 1 ms after Vcc 
and CLK have reached 
their proper DC and AC specifications. 
The RESET pin should 
remain active during this time to insure proper microprocessor 
operation. 
RESET is 
active HIGH. RESET is asynchronous 
but must meet setup and hold times t20 and t21 for 
recognition 
in any specific 
clock. 


INTR 
I 
The maskable 
interrupt 
indicates 
that an external 
interrupt 
has been generated. 
If the 
internal interrupt 
flag is set in EFLAGS, active interrupt 
processing 
will be initiated. The 


Intel486 
Microprocessor 
will generate 
two locked interrupt 
acknowledge 
bus cycles in 
response 
to the INTR pin going active. INTR must remain active until the interrupt 
acknowledges 
have been performed 
to assure that the interrupt 
is recognized. 
INTR is active HIGH and is not provided 
with an internal 
pulldown 
resistor. 
INTR is 
asynchronous, 
but must meet setup and hold times t20 and t21 for recognition 
in any 
specific 
clock. 


NMI 
I 
The non-maskable 
interrupt 
request signal indicates 
that an external 
non-maskable 
interrupt 
has been generated. 
NMI is rising edge sensitive. 
NMI must be held LOW for at 
least four CLK periods before this rising edge. NMI is not provided 
with an internal 
pulldown 
resistor. 
NMI is asynchronous, 
but must meet setup and hold times t20 and t21 
for recognition 
in any specific 
clock. 


BUS ARBITRATION 


BREQ 
0 
The internal 
cycle pending 
signal indicates 
that the Intel486 
Microprocessor 
has 
internally 
generated 
a bus request. 
BREQ is generated 
whether 
or not the Intel486 
Microprocessor 
is driving the bus. BREQ is active HIGH and is never floated. 


HOLD 
I 
The bus hold request 
allows another 
bus master complete 
control 
of the Intel486 
Microprocessor 
bus. In response 
to HOLD going active the Intel486 
Microprocessor 
will 
float most of its output and input/output 
~ins. HLDA will be asserted 
after completing 
the 
current bus cycle, burst cycle or sequence 
of locked cycles. The Intel486 
Microprocessor 
will remain in this state until HOLD is deasserted. 
HOLD is active high 
and is not provided 
with an internal 
pulldown 
resistor. 
HOLD must satisfy setup and hold 
times t1B and t19 for proper operation. 


HLDA 
0 
Hold acknowledge 
goes active in response 
to a hold request presented 
on the HOLD 
pin. HLDA indicates 
that the Intel486 
microprocessor 
has given the bus to another 
local 
bus master. HLDA is driven active in the same clock that the Intel486 
Microprocessor 
floats its bus. HLDA is driven inactive when leaving bus hold. HLDA is active HIGH and 
remains driven during bus hold. 
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Symbol 
Type 
Name and Function 


BUS ARBITRATION 
(Continued) 


BOFF# 
I 
The backoff 
input forces the Intel486 
Microprocessor 
to float its bus in the next clock. 
The microprocessor 
will float all pins normally floated 
during bus hold but HLDA will not 
be asserted 
in response 
to BOFF #. BOFF # has higher priority than RDY # or BRDY #; 


if both are returned 
in the same clock, BOFF # takes effect. The microprocessor 
remains in bus hold until BOFF # is negated. 
If a bus cycle was in progress 
when 
BOFF # was asserted 
the cycle will be restarted. 
BOFF # is active LOW and must meet 
setup and hold times t18 and t19 for proper operation. 


CACHE INVALIDATION 


AHOLD 
I 
The address hold request allows another 
bus master access to the Intel486 
Microprocessor's 
address bus for a cache invalidation 
cycle. The Intel486 
Microprocessor 
will stop driving its address bus in the clock following 
AHOLD going 
active. Only the address 
bus will be floated 
during address 
hold, the remainder 
of the 
bus will remain active. AHOLD is active HIGH and is provided with a small internal 
pulldown 
resistor. 
For proper operation 
AHOLD 
must meet setup and hold times t18 and 
t19· 


EADS# 
I 
This signal indicates 
that a valid external address 
has been driven onto the Intel486 
Microprocessor 
address pins. This address will be used to perform 
an internal cache 
invalidation 
cycle. EADS # is active LOW and is provided with an internal 
pullup resistor. 


EADS# 
must satisfy setup and hold times t12 and t13 for proper operation. 


CACHE CONTROL 


KEN# 
I 
The cache enable pin is used to determine 
whether 
the current cycle is cacheable. 


When the Intel486 microprocessor 
generates 
a cycle that can be cached and KEN # is 
active one clock before RDY # or BRDY # during the first transfer 
of the cycle, the cycle 
will become 
a cache line fill cycle. Returning 
KEN # active one clock before RDY # 
during the last read in the cache line fill will cause the line to be placed in the on-chip 
cache. KEN # is active LOW and is provided with a small internal 
pullup resistor. 
KEN # 
must satisfy setup and hold times t14 and t15 for proper operation. 


FLUSH# 
I 
The cache flush input forces the Intel486 
Microprocessor 
to flush its entire internal 
cache. FLUSH # is active low and need only be asserted 
for one clock. FLUSH # is 
asynchronous 
but setup and hold times t20 and t21 must be met for recognition 
in any 
specific 
clock. FLUSH # being sampled 
low in the clock before the falling edge of 
RESET causes the Intel486 
Microprocessor 
to enter the tri-state test mode. 


PAGE CACHEABILITY 


PWT 
0 
The page write-through 
and page cache disable pins reflect the state of the page 
PCD 
0 
attribute 
bits, PWT and PCD, in the page table entry or page directory 
entry. If paging is 
disabled 
or for cycles that are not paged, PWT and PCD reflect the state of the PWT and 
PCD bits in control 
register 3. PWT and PCD have the same timing as the cycle definition 
pins (M/IO#, 
D/C# 
and W/R#). 
PWT and PCD are active HIGH and are not driven 
during bus hold. PCD is masked 
by the cache disable bit (CD) in Control 
Register 
O. 


NUMERIC 
ERROR REPORTING 


FERR# 
0 
The floating point error pin is driven active when a floating 
point error occurs. 
FERR # is 
similar to the ERROR # pin on the 387 math coprocessor. 
FERR # is included 
for 
compatibility 
with systems 
using DOS type floating 
point error reporting. 
FERR # will not 
go active if FP errors are masked in FPU register. 
FERR # is active LOW, and is not 
floated 
during bus hold. 


Symbol 
Type 
Name and Function 


NUMERIC 
ERROR REPORTING 
(Continued) 


IGNNE# 
I 
When the ignore numeric error pin is asserted 
the Intel486 
Microprocessor 
will ignore a 
numeric error and continue 
executing 
non-control 
floating 
point instructions, 
but FERR # 
will still be activated 
by the Inte1486. When IGNNE # is deasserted 
the Intel486 
microprocessor 
will freeze on a non-control 
floating 
point instruction, 
if a previous 
floating 
point instruction 
caused an error. IGNNE # has no effect when the NE bit in 
control 
register 0 is set. IGNNE # is active LOW and is provided with a small internal 
pullup resistor. 
IGNNE # is asynchronous 
but setup and hold times t20 and t21 must be 
met to insure recognition 
on any specific 
clock. 


BUS SIZE CONTROL 


8S16# 
I 
The bus size 16 and bus size 8 pins (bus sizing pins) cause the Intel486 
Microprocessor 
8S8# 
I 
to run multiple bus cycles to complete 
a request from devices that cannot 
provide or 
accept 32 bits of data in a single cycle. The bus sizing pins are sampled 
every clock. The 
state of these pins in the clock before ready is used by the Intel486 
microprocessor 
to 
determine 
the bus size. These signals are active LOW and are provided 
with internal 
pullup resistors. 
These inputs must satisfy setup and hold times t14 and t15 for proper 


operation. 


ADDRESS 
MASK 


A20M# 
I 
When the address bit 20 mask pin is asserted, 
the Intel486 
Microprocessor 
masks 
physical 
address bit 20 (A20) before performing 
a lookup to the internal cache or driving 
a memory cycle on the bus. A20M # emulates 
the address wraparound 
at one Mbyte 
which occurs on the 8086. A20M # is active LOW and should be asserted 
only when the 
processor 
is in real mode. This pin is asynchronous 
but should meet setup and hold 
times t20 and t21 for recognition 
in any specific 
clock. For proper operation, 
A20M # 
should be sampled 
high at the falling edge of RESET. 


TEST ACCESS 
PORT (50 MHz Version Only) 


TCK 
I 
Test Clock is an input to the Intel486 CPU and provides the clocking 
function 
required 
by 
the JT AG boundary 
scan feature. TCK is used to clock state information 
and data into 
and out of the component. 
State select information 
and data are clocked 
into the 
component 
on the rising edge of TCK on TMS and TOI, respectively. 
Data is clocked 
out 
of the part on the falling edge of TCK on TOO. 


TOI 
I 
Test Data Input is the serial input used to shift JTAG instructions 
and data into the 
component. 
TOI is sampled 
on the rising edge of TCK, during the SHIFT-IR 
and the 
SHIFT-DR 
TAP controller 
states. During all other tap controller 
states, TOI is a "don't 
care". 


TOO 
0 
Test Data Output 
is the serial output used to shift JTAG instructions 
and data out of the 
component. 
TOO is driven on the falling edge of TCK during the SHIFT-IR 
and 
SHIFT-DR 
TAP control,ler states. At all other times TOO is driven to the high impedance 
state. 


TMS 
I 
Test Mode Select 
is decoded 
by the JT AG TAP (Tap Access 
Port) to select the 
operation 
of the test logic. TMS is sampled 
on the rising edge of TCK. To guarantee 
deterministic 
behavior 
of the TAP controller 
TMS is provided with an internal 
pull-up 
resistor. 
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Name 
Active 
When 
Level 
Floated 


BREQ 
HIGH 
HLOA 
HIGH 
BEO#-BE3# 
LOW 
Bus Hold 
PWT,PCO 
HIGH 
Bus Hold 
W/R#, 
O/C#, 
M/IO# 
HIGH 
Bus Hold 
LOCK# 
LOW 
Bus Hold 
PLOCK# 
LOW 
Bus Hold 
AOS# 
LOW 
Bus Hold 
BLAST # 
LOW 
Bus Hold 
PCHK# 
LOW 
FERR# 
LOW 
A2-A3 
HIGH 
Bus, Address 
Hold 


Name 
Active 
Synchronousl 
Level 
Asynchronous 


CLK 
RESET 
HIGH 
Asynchronous 
HOLD 
HIGH 
Synchronous 
AHOLO 
HIGH 
Synchronous 
EAOS# 
LOW 
Synchronous 
BOFF# 
LOW 
Synchronous 
FLUSH# 
LOW 
Asynchronous 
A20M# 
LOW 
Asynchronous 
BS16#, 
BS8# 
LOW 
Synchronous 
KEN# 
LOW 
Synchronous 
ROY# 
LOW 
Synchronous 
BROY# 
LOW 
Synchronous 
INTR 
HIGH 
Asynchronous 
NMI 
HIGH 
Asynchronous 
IGNNE# 
LOW 
Asynchronous 


Name 
Active 
When 
Level 
Floated 


00-031 
HIGH 
Bus Hold 
OPO-OP3 
HIGH 
Bus Hold 
A4-A31 
HIGH 
Bus, Address 
Hold 


Name 
Input or 
Sampledl 
Output 
Driven On 


TCK 
Input 
N/A 


TOI 
Input 
Rising Edge of TCK 


TOO 
Output 
Falling Edge of TCK 


TMS 
Input 
Rising Edge of TCK 


intel486™ 
CPU 
Component 
Revision 
Stepping 
Name 
10 
10 


B3 
04 
01 


B4 
04 
01 


B5 
04 
01 


B6 
04 
01 


CO 
04 
02 


C1 
04 
03 


DO 
04 
04 


cA2 
04 
10 


cA3 
04 
10 


cBO 
04 
11 


cB1 
04 
11 


IntelOverDrlve™ 
Processor 
Stepping 
Name 


A2 
04 
32 


B1 
04 
33 


The Intel486 Microprocessor is a 32-bit architecture 
with on-chip memory management, floating point 
and cache memory units. 


The Intel486 Microprocessor contains all the fea- 
tures of the 386 Microprocessor with enhancements 
to increase performance. The instruction set in- 
cludes the complete 386 microprocessor instruction 
set along with extensions to serve new applications. 
The on-chip memory management unit (MMU) is 
completely compatible with the 386 Microprocessor 
MMU. The Intel486 Microprocessor brings the 387 
math coprocessor on-chip. All software written for 
the 386 microprocessor, 387 math coprocessor and 
previous members of the 86/87 architectural family 
will run on the Intel486 Microprocessor without any 
modifications. 


Several enhancements have been added to the In- 
tel486 Microprocessor to increase performance. On- 
chip cache memory allows frequently used data and 
code to be stored on-chip reducing accesses to the 
external bus. RISC design techniques have been 
used to reduce instruction cycle times. A burst bus 
feature enables fast cache fills. All of these features, 
combined, lead to performance greater than twice 
that of a 386 Microprocessor. 


The memory management unit (MMU) consists of a 
segmentation unit and a paging unit. Segmentation 
allows management of the logical address space by 
providing easy data and code relocatibility and effi- 
cient sharing of global resources. The paging mech- 
anism operates beneath segmentation and is trans- 
parent to the 
segmentation 
process. 
Paging is 
optional and can be disabled by system software. 
Each segment can be divided into one or more 
4 Kbyte segments. To implement a virtual memory 
system, the Intel486 Microprocessor supports full re- 
startability for all page and segment faults. 


Memory is organized into one or more variable 
length segments, each up to four gigabytes (232 
bytes) in size. A segment can have attributes associ- 
ated with it which include its location, size, type (Le., 
stack, code or data), and protection characteristics. 
Each task on an Intel486 Microprocessor can have a 
maximum of 16,381 segments, each up to four giga- 
bytes in size. Thus each task has a maximum of 64 
terabytes (trillion bytes) of virtual memory. 


The segmentation unit provides four-levels of pro- 
tection for isolating and protecting applications and 
the operating system from each other. The hardware 
enforced protection allows the design of systems 
with a high degree of integrity. 


The Intel486 Microprocessor has two modes of op- 
eration: Real Address Mode (Real Mode) and Pro- 
tected 
Mode 
Virtual 
Address 
Mode 
(Protected 
Mode). In Real Mode the Intel486 Microprocessor 
operates as a very fast 8086. Real Mode is required 
primarily to set up the processor for Protected Mode 
operation. Protected Mode provides access to the 
sophisticated memory management paging and priv- 
ilege capabilities of the processor. 


Within Protected Mode, software can perform a task 
switch to enter into tasks designated as Virtual 8086 
Mode tasks. Each virtual 8086 task behaves with 
8086 semantics, allowing 8086 software (an applica- 
tion program or an entire operating system) to exe- 
cute. 


The on-chip floating point unit operates in parallel 
with the arithmetic and logic unit and provides arith- 
metic instructions for a variety of numeric data types. 
It executes numerous built-in transcendental func- 
tions (e.g., tangent, sine, cosine, and log functions). 
The floating point unit fully conforms to the ANSII 
IEEE standard 754-1985 for floating point arithmetic. 


The on-chip cache is 8 Kbytes in size. It is 4-way set 
associative and follows a write-through policy. The 
on-chip cache includes features to provide flexibility 
in external memory system design. Individual pages 
can be designated as cacheable or non-cacheable 
by software or hardware. The cache can also be en- 
abled and disabled by software or hardware. 


Finally the Intel486 Microprocessor has features to 
facilitate high performance hardware designs. The 
1X clock eases high frequency board level designs. 
The burst bus feature enables fast cache fills. These 
features are described beginning in Section 6. 


The Intel486 Microprocessor register set includes all 
the registers contained in the 386 Microprocessor 
and the 387 math coprocessor. The register set can 
be split into the following categories: 


Base Architecture Registers 
General Purpose Registers 
Instruction Pointer 
Flags Register 
Segment Registers 


Systems Level Registers 
Control Registers 
System Address Registers 


Floating Point Registers 
Oata Registers 


Tag Word 
Status Word 
Instruction and Oata Pointers 
Control Word 


The base architecture and floating point registers 
are accessible by the applications program. The sys- 
tem level registers are only accessible at privilege 
level 0 and are used by the systems level program. 
The debug and test registers are also only accessi- 
ble at privilege level O. 


Figure 2.1 shows the Intel486 Microprocessor base 
architecture registers. The contents of these regis- 
ters are task-specific and are automatically loaded 
with a new context upon a task switch operation. 


General 
Purpose 
Registers 


31 
24123 
16 
15 
817 
0 


AH 
AX 
AL 
EAX 


BH 
BX 
BL 
EBX 


CH 
cx 
CL 
ECX 


DH 
OX 
DL 
EDX 


Sl 
ESI 


01 
EDI 


BP 
EBP 


SP 
ESP 


Segment 
Registers 
15 
0 


CS 
Code Segment 


SS 
Stack 
Segment 
") 


ES 
Data Segments 
FS 


GS 


Instruction 
Pointer 
31 
16 
15 
0 
I 
I 
IP 
I EIP 


Flags Register 
I 
I 
FLAGS 
I EFLAGS 


intel~ 


The base architecture includes six directly accessi- 
ble descriptors, each specifying a segment up to 
4 Gbytes in size. The descriptors are indicated by 
the selector values placed in the Intel486 Microproc- 
essor segment registers. Various selector values 
can be loaded as a program executes. 


The selectors are also task-specific, so the segment 
registers are automatically loaded with new context 
upon a task switch operation. 


2.1.1.1 
General 
Purpose 
Registers 


The eight 32-bit general 
purpose registers are 
shown in Figure 2.1. These registers hold data or 
address quantities. The general purpose registers 
can support data operands of 1, 8, 16 and 32 bits, 
and bit fields of 1 to 32 bits. Address operands of 16 
and 32 bits are supported. The 32-bit registers are 
named EAX, EBX, ECX, EOX, ESI, EOI, EBP and 
ESP. 


The least significant 16 bits of the general purpose 
registers can be accessed separately by using the 
16-bit names of the registers AX, BX, CX, OX,SI, 01, 
BP and SP. The upper 16 bits of the register are not 
changed when the lower 16 bits are accessed sepa- 
rately. 


Finally 8-bit operations can individually access the 
lowest byte (bits 0-7) 
and the higher byte (bits 
8-15) of the general purpose registers AX, BX, CX 
and OX.The lowest bytes are named AL, BL, CL and 
OL respectively. The higher bytes are named AH, 
BH, CH and OH respectively. The individual byte ac- 
cessibility offers additional flexibility for data opera- 
tions but is not used for effective address calcula- 
tion. 


The instruction pointer, shown in Figure 2.1, is a 
32-bit register named EIP. EIP holds the offset of the 
next instruction to be executed. The offset is always 
relative to the base of the code segment (CS). The 
lower 16 bits (bits 0-15) of the EIP contain the 16-bit 
instruction pointer named IP, which is used for 16-bit 
addressing. 


2.1.1.3 
Flags Register 


The 
flags 
register 
is 
a 
32-bit 
register 
named 
EFLAGS. The defined bits and bit fields within 
EFLAGS control certain operations and indicate 
status of the Intel486 Microprocessor. The lower 
16 bits (bits 0-15) 
of EFLAGS contain the 16-bit 
register named FLAGS, which is most useful when 
executing 8086 and 80286 code. EFLAGS is shown 
in Figure 2.2. 
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CARRY 
FLAG 
PARITY 
FLAG 
AUXILIARY 
CARRY 
ZERO 
FLAG 
SIGN 
FLAG 
TRAP 
FLAG 


ALIGNlo4ENT 
CHECK 
VIRTUAL 
1o40DE 
RESUlo4E 
FLAG 
NESTED 
TASK 
FLAG 
I/O 
PRIVILEGE 
LEVEL 
OVERFLOW 
DIRECTION 
FLAG 
INTERRUPT 
ENABLE 


NOTE: 
o indicates 
Intel Reserved: 
do not define; 
see Section 
2.1.6. 


EFLAGS bits 1, 3, 5,15 and 19-31 are "undefined". 
When these bits are stored during interrupt process- 
ing or with a PUSHF instruction (push flags on$o 
stack), a one is stored in bit 1 and zeros in bits 3, 5, . 
15 and 19-31. 


to an odd address, a dword access to an ad- 
dress that is not on a dword boundary, or an 
8-byte reference to an address that is not on a 
64-bit word boundary. See Section 7.1.6 for 
more information on operand alignment. 


Alignment faults are only generated by pro- 
grams running at privilege level 3. The AC bit 
setting is ignored at privilege levels 0, 1 and 2. 
Note that references to the descriptor tables 
(for selector loads), or the task state segment 
(TSS), are implicitly level 0 references even if 
the instructions causing the references are 
executed at level 3. Alignment faults are re- 
ported through interrupt 17, with an error code 
of O. Table 2.1 gives the alignment required 
for the Intel486 microprocessor data types. 


The EFLAGS register in the Intel486 Microprocessor 
contains a new bit not previously defined. The new 
bit, AC, is defined in the upper 16 bits of the register 
and it enables faults on accesses to misaligned 
data. 
AC 
(Alignment Check, bit 18) 
The AC bit enables the generation of faults if a 
memory reference is to a misaligned address. 
Alignment faults are enabled when AC is set 
to 1. A mis-aligned address is a word access 


Memory 
Access 
Alignment 
(Byte Boundary) 


Word 
2 
Dword 
4 
Single Precision Real 
4 
Double Precision Real 
8 
Extended Precision Real 
8 
Selector 
2 
48-Bit Segmented Pointer 
4 
32-Bit Flat Pointer 
4 
32-Bit Segmented Pointer 
2 
48-Bit "Pseudo-Descriptor" 
4 
FSTENVIFLDENV Save Area 
4/2 (On Operand Size) 
FSAVE/FRSTOR Save Area 
4/2 (On Operand Size) 
Bit String 
4 


•• 


IMPLEMENTATION 
NOTE: 
Several instructions on the Intel486 Microprocessor 
generate misaligned references, even if their mem- 
ory address is aligned. For example, on the In- 
tel486 Microprocessor, the SGDT/SIDT (store glob- 
al/interrupt 
descriptor 
table) 
instruction 
reads/ 
writes two bytes, and then reads/writes four bytes 
from a "pseudo-descriptor" 
at the given address. 
The Intel486 Microprocessor will generate misa- 
ligned references unless the address is on a 2 mod 
4 boundary. The FSAVE and FRSTOR instructions 
(floating point save and restore state) will generate 
misaligned references for one-half of the register 
save/restore 
cycles. The Intel486 Microprocessor 
will not cause any AC faults if the effective address 
given in the instruction has the proper alignment. 
VM 
(Virtual 8086 Mode, bit 17) 
The VM bit provides Virtual 8086 Mode within 
Protected Mode. If set while the Intel486 Mi- 
croprocessor is in Protected Mode, the In- 
tel486 Microprocessor will switch to Virtual 
8086 operation, handling segment loads as 
the 8086 does, but generating exception 13 
faults on privileged opcodes. The VM bit can 
be set only in Protected Mode, by the IRET 
instruction (if current privilege level = 0) and 
by task switches at any privilege level. The 
VM bit is unaffected by POPF. PUSHF always 
pushes a 0 in this bit, even if executing in Vir- 
tual 8086 Mode. The EFLAGS image pushed 
during interrupt processing or saved during 
task switches will contain a 1 in this bit if the 
interrupted code was executing as a Virtual 
8086 Task. 
RF 
(Resume Flag, bit 16) 
The RF flag is used in conjunction with the 
debug register breakpoints. It is checked at 
instruction boundaries before breakpoint pro- 
cessing. When RF is set, it causes any debug 
fault to be ignored on the next instruction. RF 
is then automatically reset at the successful 
completion of every instruction (no faults are 
signalled) except the IRET instruction, the 
POPF instruction, (and JMP, CALL, and INT 
instructions causing a task switch). These in- 
structions set RF to the value specified by the 
memory image. For example, at the end of the 
breakpoint service routine, the IRET instruc- 
tion can pop an EFLAG image having the RF 
bit set and resume the program's execution at 
the breakpoint address without generating an- 
other breakpoint fault on the same location. 


NT 
(Nested Task, bit 14) 
This flag applies to Protected Mode. NT is set 
to indicate that the execution of this task is 
nested within another task. If set, it indicates 


that the current nested task's Task State Seg- 
ment (TSS) has a valid back link to the previ- 
ous task's TSS. This bit is set or reset by con- 
trol transfers to other tasks. The value of NT 
in EFLAGS is tested by the IRET instruction to 
determine whether to do an inter-task return 
or an intra-task return. A POPF or an IRET 
instruction will affect the setting of this bit ac- 
cording to the image popped, at any privilege 
level. 


10PL (Input/Output Privilege Level, bits 12-13) 


This two-bit field applies to Protected Mode. 
10PL indicates the numerically maximum CPL 
(current privilege level) value permitted to ex- 
ecute I/O instructions without generating an 
exception 13 fault or consulting the I/O Per- 
mission Bitmap. It also indicates the maximum 
CPL value allowing alteration of the IF (INTR 
Enable Flag) bit when new values are popped 
into the EFLAG register. POPF and IRET in- 
struction can alter the 10PLfield when execut- 
ed at CPL = O.Task switches can always al- 
ter the 10PLfield, when the new flag image is 
loaded from the incoming task's TSS. 
OF 
(Overflow Flag, bit 11) 
OF is set if the operation resulted in a signed 
overflow. Signed overflow occurs when the 
operation resulted in carry/borrow 
Into the 
sign bit (high-order bit) of the result but did not 
result in a carry/borrow out of the high-order 
bit, or vice-versa. For 8-, 16-, 32-bit opera- 
tions, OF is set according to overflow at bit 7, 
15, 31, respectively. 


DF 
(Direction Flag, bit 10) 
DF defines whether ESI and/or EDI registers 
postdecrement or postincrement during the 
string instructions. Postincrement occurs if DF 
is reset. Postdecrement occurs if DF is set. 


IF 
(INTR Enable Flag, bit 9) 
The IF flag, when set, allows recognition of 
external interrupts signalled on the INTR pin. 
When IF is reset, external interrupts signalled 
on the INTR are not recognized. 10PL indi- 
cates the maximum CPL value allowing altera- 
tion of the IF bit when new values are popped 
into EFLAGS or FLAGS. 
TF 
(Trap Enable Flag, bit 8) 
TF controls the generation of exception 1 trap 
when single-stepping through code. When TF 
is set, the Intel486 Microprocessor generates 
an exception 1 trap after the next instruction is 
executed. When TF is reset, exception 1 traps 
occur only as a function of the breakpoint ad- 
dresses loaded into debug registers DRO- 
DR3. 


SF 
(Sign Flag, bit 7) 
SF is set if the high-order bit of the result is 
set, it is reset otherwise. For 8-, 16-, 32-bit 
operations, SF reflects the state of bit 7, 15, 
31 respectively. 
ZF 
(Zero Flag, bit 6) 
ZF is set if all bits of the result are O. Other- 
wise it is reset. 


AF 
(Auxiliary Carry Flag, bit 4) 
The Auxiliary Flag is used to simplify the addi- 
tion and subtraction of packed BCD quanti- 
ties. AF is set if the operation resulted in a 
carry out of bit 3 (addition) or a borrow into bit 
3 (subtraction). Otherwise AF is reset. AF is 
affected by carry out of, or borrow into bit 3 
only, regardless of overall operand length: 8, 
16 or 32 bits. 
PF 
(Parity Flags, bit 2) 
PF is set if the low-order eight bits of the oper- 
ation contains an even number of "1's" (even 
parity). PF is reset if the low-order eight bits 
have odd parity. PF is a function of only the 
low-order eight bits, regardless of operand 
size. 
CF 
(Carry Flag, bit 0) 
CF is set if the operation resulted in a carry 
out of (addition), or a borrow into (subtraction) 
the high-order bit. Otherwise CF is reset. For 
8-, 16- or 32-bit operations, CF is set accord- 
ing to carry/borrow at bit 7,15 or 31, respec- 
tively. 


SEGMENT 
REGISTERS 


NOTE: 
In these descriptions, "set" means "set to 1," and 
"reset" means "reset to 0." 


2.1.1.4 Segment 
Registers 


Six 16-bit segment registers hold segment selector 
values identifying the currently addressable memory 
segments. In protected mode, each segment may 
range in size from one byte up to the entire linear 
and physical address space of the machine, 4 
Gbytes (232 bytes). In real address mode, the maxi- 
mum segment size is fixed at 64 Kbytes (216bytes). 


The six addressable segments are defined by the 
segment registers CS, SS, OS, ES, FS and GS. The 
selector in CS indicates the current code segment; 2 
the selector in SS indicates the current stack seg- 
ment; the selectors in OS, ES, FS and GS indicate 
the current data segments. 


2.1.1.5 Segment 
Descriptor 
Cache 
Registers 


The segment descriptor cache registers are not pro- 
grammer visible, yet it is very useful to understand 
their content. A programmer invisible descriptor 
cache register is associated with each programmer- 
visible segment register, as shown by Figure 2.3. 
Each descriptor cache register holds a 32-bit base 
address, a 32-bit segment limit, and the other neces- 
sary segment attributes. 
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When a selector value is loaded into a segment reg- 
ister, the associated descriptor cache register is au- 
tomatically updated with the correct information. In 
Real Address Mode, only the base address is updat- 
ed directly (by shifting the selector value four bits to 
the left), since the segment maximum limit and attri- 
butes are fixed in Real Mode. In Protected Mode, 
the base address, the limit, and the attributes are all 
updated per the contents of the segment descriptor 
indexed by the selector. 


Whenever a memory reference occurs, the segment 
descriptor cache register associated with the seg- 
ment being used is automatically involved with the 
memory reference. The 32-bit segment base ad- 
dress becomes a component of the linear address 
calculation, the 32-bit limit is used for the limit-check 
operation, and the attributes are checked against 
the type of memory reference requested. 


The system level registers, Figure 2.4, control opera- 
tion of the on-chip cache, the on-chip floating point 
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unit (FPU) and the segmentation and paging mecha- 
nisms. These registers are only accessible to pro- 
grams running at privilege level D, the highest privi- 
lege level. 


The system level registers include three control reg- 
isters and four segmentation base registers. The 
three control registers are CRD,CR2 and CR3. CR1 
is reserved for future Intel processors. The four seg- 
mentation base registers are the Global Descriptor 
Table Register (GDTR), the Interrupt Descriptor Ta- 
ble Register (IDTR),the Local Descriptor Table Reg- 
ister (LDTR) and the Task State Segment Register 
(TR). 


2.1.2.1 
Control 
Registers 


Control 
Register 
0 (CRO) 


CRD, shown in Figure 2.5, contains 1D bits for con- 
trol and status purposes. Five of the bits defined in 
the Intel486 Microprocessor's CRD are newly de- 
fined. The new bits are CD, NW, AM, WP and NE. 
The function of the bits in CRD can be categorized 
as follows: 
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See Section 
2.1.6 


Intel486 
Microprocessor 
Operating 
Modes: PG, PE 
(Table 2.2) 


On-Chip 
Cache Control 
Modes: 
CD, NW (Table 2.3) 


On-Floating 
Point Unit Control: 
TS, EM, MP, NE 
(Table 2.4) 


Alignment 
Check 
Control: 
AM 


Supervisor 
Write Protect: 
WP 


Table 2.2. Processor 
Operating 
Modes 


PG 
PE 
Mode 


0 
0 
REAL Mode. Exact 8086 semantics, 
with 32-bit extensions 
available 
with 
prefixes. 


0 
1 
Protected 
Mode. Exact 80286 
semantics, 
plus 32-bit extensions 
through 
both prefixes and "default" 
prefix setting associated 
with code 
segment 
descriptors. 
Also, a sub- 
mode is defined 
to support a virtual 
8086 within the context 
of the 
extended 
80286 protection 
model. 


1 
0 
UNDEFINED. 
Loading CRO with this 
combination 
of PG and PE bits will 
raise a GP fault with error code O. 


1 
1 
Paged Protected 
Mode. All the 
facilities 
of Protected 
mode, with 
paging enabled 
underneath 
segmentation. 


CD 
NW 
Operating 
Mode 


1 
1 
Cache fills disabled, write-through 
and 
invalidates 
disabled. 


1 
0 
Cache fills disabled, 
write-through 
and 
invalidates 
enabled. 


0 
1 
INVALID. 
If CRO is loaded with this 
configuration 
of bits, a GP fault with 
error code is raised. 


0 
0 
Cache fills enabled, write-through 
and 
invalidates 
enabled. 


CROBIT 
Instruction 
Type 


EM 
TS 
MP 
Floating-Point 
Walt 


0 
0 
0 
Execute 
Execute 
0 
0 
1 
Execute 
Execute 
0 
1 
0 
Trap 7 
Execute 
0 
1 
1 
Trap 7 
Trap 7 
1 
0 
0 
Trap 7 
Execute 
1 
0 
1 
Trap 7 
Execute 
1 
1 
0 
Trap 7 
Execute 
1 
1 
1 
Trap 7 
Trap 7 


The low-order 
16 bits of CRO are also known 
as the 
Machine 
Status 
Word 
(MSW), 
for compatibility 
with 
the 80286 protected 
mode. LMSW and SMSW (load 
and 
store 
MSW) 
instructions 
are taken 
as special 
aliases of the load and store CRO operations, 
where 
only the low-order 
16 bits of CRO are involved. 
The 
LMSW and SMSW instructions 
in the Intel486 
micro- 
processor 
work in an identical 
fashion 
to the LMSW 
and SMSW 
instructions 
in the 80286 
(Le., they only 
operate 
on the low-order 
16 bits of CRO and ignores 
the new bits). New Intel486 
Microprocessor 
operat- 
ing systems 
should 
use the MOV CRO, Reg instruc- 
tion. 


The defined 
CRO bits are described 
below. 


PG 
(Paging Enable, 
bit 31) 


The PG bit is used to indicate whether 
paging is 2 


enabled 
(PG = 1) or disabled 
(PG = 0). See Ta- 
ble 2.2. 


CD 
(Cache 
Disable, 
bit 30) 


The CD bit is used to enable the on-chip 
cache. 


When 
CD = 1, the 
cache 
will 
not 
be filled 
on 
cache misses. When CD = 0, cache fills may be 
performed 
on misses. See Table 2.3. 


The state of the CD bit, the cache enable 
input 
pin (KEN#), 
and the relevant 
page cache 
dis- 
able 
(PCD) 
bit determine 
if a line read 
in re- 
sponse 
to a cache 
miss will be installed 
in the 
cache. 
A line is installed 
in the 
cache 
only 
if 
CD = 0 and KEN # and PCD are both zero. The 
relevant 
PCD bit comes 
from 
either 
the 
page 
table entry, page directory 
entry or control 
reg- 
ister 3. Refer to Section 
5.6 for more details on 
page cacheability. 


CD is set to one after RESET. 


NW (Not Write-Through, 
bit 29) 


The 
NW 
bit 
enables 
on-chip 
cache 
write- 
throughs 
and write-invalidate 
cycles 
(NW = 0). 


When 
NW = 0, all writes, 
including 
cache 
hits, 
are sent out to the pins. 
Invalidate 
cycles 
are 
enabled 
when 
NW = O. During an invalidate 
cy- 
cle a line will be removed 
from the cache 
if the 
invalidate 
address 
hits in the cache. 
See Table 
2.3. 


When 
NW = 1, write-throughs 
and 
write-invali- 
date cycles are disabled. 
A write will not be sent 
to the pins if the write 
hits in the cache. 
With 
NW = 1 the only write cycles that reach the ex- 
ternal 
bus 
are 
cache 
misses. 
Write 
hits 
with 
NW = 1 will never update 
main memory. 
Invali- 
date cycles 
are ignored 
when NW = 1. 


AM 
(Alignment 
Mask, bit 18) 


The 
AM 
bit 
controls 
whether 
the 
alignment 
check (AC) bit in the flag register 
(EFLAGS) 
can 
allow 
an alignment 
fault. 
AM = 0 disables 
the 
AC bit. AM = 1 enables 
the AC bit. AM = 0 is the 
386 Microprocessor 
compatible 
mode. 


386 Microprocessor software may load incor- 
rect data into the AC bit in the EFLAGS register. 
Setting AM = 0 will prevent AC faults from oc- 
curring before the Intel486 Microprocessor has 
created the AC interrupt service routine. 
WP (Write Protect, bit 16) 
WP protects read-only pages from supervisor 
write access. The 386 Microprocessor allows a 
read-only page to be written from privilege lev- 
els 0-2. The Intel486 Microprocessor is com- 
patible 
with 
the 
386 
Microprocessor 
when 
WP=O. WP=1 
forces a fault on a write to a 
read-only page from any privilege level. Operat- 
ing systems with Copy-on-Write features can be 
supported with the WP bit. Refer to Section 
4.5.3 for further details on use of the WP bit. 


NE (Numerics Exception, bit 5) 
The NE bit controls whether unmasked floating 
point exceptions (UFPE) are handled through 
interrupt vector 16 (NE= 1) or through an exter- 
nal interrupt (NE= 0). NE= 0 (default at reset) 
supports the DOS operating system error re- 
porting scheme from the 8087, 80287 and 387 
math coprocessor. In DOS systems, math co- 
processor errors are reported via external inter- 
rupt vector 13. DOS uses interrupt vector 16 for 
an operating system call. Refer to Sections 
6.2.13 and 7.2.14 for more information on float- 
ing point error reporting. 
For any UFPE the floating point error output pin 
(FERR#) will be driven active. 
For NE= 0, the Intel486 Microprocessor works 
in conjunction with the ignore numeric error in- 
put (IGNNE#) 
and the FERR# 
output pins. 
When a UFPE occurs and the IGNNE# input is 
inactive, the Intel486 Microprocessor freezes 
immediately before executing the next floating 
point instruction. An external interrupt controller 
will supply an interrupt vector when FERR# is 
driven active. The UFPE is ignored if IGNNE# 
is active and floating point execution continues. 


NOTE: 
The freeze does not take place if the next in- 
struction 
is one of the 
control 
instructions 
FNCLEX, 
FNINIT, 
FNSAVE, 
FNSTENV, 
FNSTCW, FNSTSW, FNSTSW AX, FNENI, 
FNDISI and FNSETPM. The freeze does occur 
if the next instruction is WAIT. 
For NE= 1, any UFPE will result in a software 
interrupt 16, immediately before executing the 
next non-control floating point or WAIT instruc- 
tion. The ignore numeric error input (IGNNE#) 
signal will be ignored. 
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TS 
(Task Switched, bit 3) 
The TS bit is set whenever a task switch opera- 
tion is performed. Execution of a floating point 
instruction with TS= 1 will cause a device not 
available (DNA) fault (trap vector 7). If TS = 1 
and MP= 1 (monitor coprocessor in CRO) a 
WAIT instruction will cause a DNA fault. See 
Table 2.4. 


EM (Emulate Coprocessor, bit 2) 
The EM bit determines whether floating point 
instructions are trapped (EM= 1) or executed. If 
EM= 1, all floating point instructions will cause 
fault 7. 


NOTE: 
WAIT instructions are not affected by the state 
of EM. See Table 2.4. 


MP (Monitor Coprocessor, bit 1) 
The MP bit is used in conjunction with the TS bit 
to determine if WAIT instructions should trap. If 
MP= 1 and TS = 1, WAIT instructions cause 
fault 7. Refer to Table 2.4. The TS bit is set to 1 
on task switches by the Intel486 Microproces- 
sor. Floating point instructions are not affected 
by the state of the MP bit. It is recommended 
that the MP bit be set to one for the normal 
operation of the Intel486 Microprocessor. 
PE (Protection Enable, bit 0) 
The PE bit enables the segment based protec- 
tion mechanism. If PE= 1 protection is enabled. 
When PE= 0 the Intel486 Microprocessor oper- 
ates in REAL mode, with segment based pro- 
tection disabled, and addresses formed as in an 
8086. Refer to Table 2.2. 


All new CRObits added to the 386 and Intel486 Mi- 
croprocessors, except for ET and NE, are upward 
compatible with the 80286 because they are in reg- 
ister bits not defined in the 80286. For strict compati- 
bility with the 80286, the load machine status word 
(LMSW) instruction is defined to not change the ET 
or NE bits. 


CR1 is reserved for use in future Intel microproces- 
sors. 


Control 
Register 
2 (CR2) 


CR2, shown in Figure 2.6, holds the 32-bit linear ad- 
dress that caused the last page fault detected. The 
error code pushed onto the page fault handler's 
stack when it is invoked provides additional status 
information on this page fault. 
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Intel reserved: 
Do not define; 
See Section 
2.1.6. 


CR3, shown in Figure 2.6, contains the physical 
base address of the page directory table. The In- 
tel486 Microprocessor page directory is always page 
aligned (4 Kbyte-aligned). This alignment is enforced 
by only storing bits 20-31 in CR3. 


In the Intel486 Microprocessor CR3 contains two 
new bits, page write-through (PWT) (bit 3) and page 
cache disable (PCO) (bit 4). The page table entry 
(PTE) and page directory entry (POE) also contain 
PWT and PCO bits. PWT and PCO control page 
cacheability. When a page is accessed in external 
memory, the state of PWT and PCO are driven out 
on the PWT and PCO pins. The source of PWT and 
PCO can be CR3, the PTE or the POE. PWT and 
PCO are sourced from CR3 when the POE is being 
updated. When paging is disabled (PG = 0 in CRO), 
PCO and PWT are assumed to be 0, regardless of 
their state in CR3. 


A task switch through a task state segment (TSS) 
which changes the values in CR3, or an explicit load 
into CR3 with any value, will invalidate all cached 
page table entries in the translation lookaside buffer 
(TLB). 


The page directory base address in CR3 is a physi- 
cal address. The page directory can be paged out 
while its associated task is suspended, but the oper- 
ating system must ensure that the page directory is 
resident in physical memory before the task is dis- 
patched. The entry in the TSS for CR3 has a physi- 
cal address, with no provision for a present bit. This 
means that the page directory for a task must be 
resident in physical memory. The CR3 image in a 
TSS must point to this area, before the task can be 
dispatched through its TSS. 


2.1.2.2 System Address Registers 


Four special registers are defined to reference the 
tables or segments supported by the 80286, 386 
and 
Intel486 
Microprocessor 
protection 
model. 


These tables or segments are: 


GOT (Global Descriptor Table) 
lOT (Interrupt Descriptor Table) 
LOT (Local Descriptor Table) 
TSS (Task State Segment) 


The addresses of these tables and segments are 
stored in special registers, the System Address and 
System Segment Registers, illustrated in Figure 2.4. 
These registers are named GOTR, IOTR, LOTR and 
TR respectively. Section 4, Protected Mode Archi- 
tecture, describes the use of these registers. 


System Address Registers: GDTR and IDTR 


The GOTR and IOTR hold the 32-bit linear base ad- 
dress and 16-bit limit of the GOT and lOT, respec- 
tively. 


Since the GOT and lOT segments are global to all 
tasks in the system, the GOT and lOT are defined by 


• 32-bit linear addresses (subject to page translation if 
paging is enabled) and 16-bit limit values. 


System Segment Registers: LDTR and TR 


The LOTR and TR hold the 16-bit selector for the 
LOT descriptor and the TSS descriptor, respectively. 


Since the LOT and TSS segments are task specific 
segments, the LOT and TSS are defined by selector 
values stored in the system segment registers. 


NOTE: 
A programmer-invisible segment descriptor register 
is associated with each system segment register. 


Figure 2.7 shows the floating point register set. The 
on-chip FPU contains eight data registers, a tag 
word, a control register, a status register, an instruc- 
tion pointer and a data pointer. 
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The operation of the Intel486 Microprocessor's on- 
chip floating point unit is exactly the same as the 
387 math coprocessor. Software written for the 387 
math coprocessor will run on the on-chip floating 
point unit (FPU) without any modifications. 


2.1.3.1 Data Registers 


Floating point computations use the Intel486 Micro- 
processor's FPU data registers. These eight 80-bit 
registers provide the equivalent capacity of twenty 
32-bit registers. Each of the eight data registers is 
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divided into "fields" corresponding to the FPU's ex- 
tended-precision data type. 


The FPU's register set can be accessed either as a 
stack, with instructions operating on the top one or 
two stack elements, or as a fixed register set, with 
instructions operating on explicitly designated regis- 
ters. The TOP field in the status word identifies the 
current top-of-stack register. A "push" 
operation 
decrements TOP by one and loads a value into the 
new top register. A "pop" operation stores the value 
from the current top register and then increments 
TOP by one. Like other Intel486 microprocessor 
stacks in memory, the FPU register stack grows 
"down" toward lower-addressed registers. 


Instructions may address the data registers either 
implicitly or explicitly. Many instructions operate on 
the register at the TOP of the stack. These instruc- 
tions implicitly address the register at which TOP 
points. Other instructions allow the programmer to 
explicitly specify which register to use. This explicit 
register addressing is also relative to TOP. 


2.1.3.2 Tag Word 


The tag word marks the content of each numeric 
data register, as shown in Figure 2.8. Each two-bit 
tag represents one of the eight data registers. The 
principal function of the tag word is to optimize the 
FPUs performance and stack handling by making it 
possible to distinguish between empty and nonemp- 
ty register locations. It also enables exception han- 
dlers to check the contents of a stack location with- 
out the need to perform complex decoding of the 
actual data. 


2.1.3.3 Status Word 


The 16-bit status word reflects the overall state of 
the FPU. The status word is shown in Figure 2.9 and 
is located in the status register. 
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TAG (5) 
I 
TAG (4) 
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TAG (3) 
I 
TAG (2) 
I 
TAG (1) 
I 
TAG (0) 


NOTE: 
The index i of tag (i) is not 
top-relative. 
A program 
typically 
uses the "top" 
field of Status 
Word to determine 
which 
tag (i) 
field refers 
to logical 
top of stack. 


TAG VALUES: 
00 = Valid 
01 = Zero 
10 = QNaN, 
SNaN, 
Infinity, 
Denormal 
and Unsupported 
Formats 
11 = Empty 


EXCEPTION fLAGS: 


PRECISION 


UNDERfLOW 


OVERfLOW 


ZERO DIVIDE 
DENORt.4ALlZED OPERAND 


INVALID OPERATION 


ES is set ~ any unmasked exception b~ is se~ cleared otherwise. 
See Teble 2.5 for interpretation of condition code. 
TOP values: 
000 = Register 0 is Top of Stack 
001 = Register 1 is Top of Stack 
. 


111 = Register 7 is Top of Stack 
For definitions of exceptions, refer to the Section entnled 
"Exception Handling". 


The B bit (Busy, bit 15) is included for 8087 compati- 
bility. The B bit reflects the contents of the ES bit (bit 
7 of the status word). 


Bits 13-11 (TOP) point to the FPU register that is 
the current top-of-stack. 


BUSY 


TOP Of STACK POINTER 


CONDmON 
CODE 


The four numeric condition code bits, CO-C3, are 
similar to the flags in EFLAGS. Instructions that per- 
form arithmetic operations update CO-C3 to reflect 
the outcome. The effects of these instructions on 
the condition codes are summarized in Tables 2.5 
through 2.8. 
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Instruction 
CO (5) 
I 
C3 (Z) 
C1 (A) 
C2(C) 


FPREM, FPREM1 
Three least significant 
bits 
Reduction 
(see Table 2.3) 
of quotient 
0= 
complete 
02 
00 
01 
orO/U# 
1 = incomplete 


FCOM, FCOMP, 
FCOMPP, FTST, 
Result of comparison 
Zero 
Operand 
is not 
FUCOM, FUCOMP, 
(see Table 2.7) 
orO/U# 
comparable 
FUCOMPP, 
FICOM, 
(Table 2.7) 
FICOMP 


FXAM 
Operand 
class 
Sign 
Operand 
class 
(see Table 2.8) 
orO/U# 
(Table 2.8) 


FCHS, FABS, FXCH, 
FINCTOP, 
FDECTOP, 
Constant 
loads, 
UNDEFINED 
Zero 
UNDEFINED 
FXTRACT, 
FLD, 
orO/U# 
FILD, FBLD, 
FSTP (ext real) 


FIST, FBSTP, 
FRNDINT, 
FST, 
FSTP, FADD, FMUL, 
FDIV, FDIVR, 
UNDEFINED 
Roundup 
UNDEFINED 
FSUB, FSUBR, 
orO/U# 
FSCALE, 
FSORT, 
FPATAN, 
F2XM1, 
FYL2X, FYL2XP1 


FPTAN, FSIN 
Roundup 
Reduction 
FCOS, FSINCOS 
UNDEFINED 
or O/U#, 
0= 
complete 
undefined 
1 = incomplete 
ifC2 
= 1 


FLDENV, 
FRSTOR 
Each bit loaded from memory 


FINIT 
Clears these bits 


FLDCW, FSTENV, 
FSTCW, FSTSW, 
UNDEFINED 
FCLEX, FSAVE 


O/U# 
When 
both 
IE and SF bits of status 
word 
are set, indicating 
a stack 
exception, 
this 
bit 
distinguishes 
between 
stack overflow 
(C1 = 
1) and underflow 
(C1 = 
0). 


Reduction 
If FPREM 
or FPREM1 
produces 
a remainder 
that 
is less than the 
modulus, 
reduction 
is 
complete. 
When 
reduction 
is incomplete 
the 
value 
at the 
top 
of the 
stack 
is a partial 
remainder, 
which can be used as input to further 
reduction. 
For FPTAN, 
FSIN, FCOS, and 
FSINCOS, 
the reduction 
bit is set if the operand 
at the top of the stack is too large. In this 
case the original 
operand 
remains 
at the top of the stack. 


Roundup 
When the PE bit of the status word is set, this bit indicates 
whether 
the last rounding 
in the 
instruction 
was upward. 


UNDEFINED 
Do not rely on finding 
any specific 
value in these 
bits. 


Condition 
Code 
Interpretation 
after FPREM and FPREM1 
C2 
C3 
C1 
CO 


Incomplete 
Reduction: 


1 
X 
X 
X 
further interaction 
required 
for complete 
reduction 


01 
00 
02 
o MOD8 
, 


0 
0 
0 
0 
0 
1 
0 
1 
Complete 
Reduction: 
1 
0 
0 
2 
0 
1 
1 
0 
3 
CO, C3, C1 contain three least 


0 
0 
1 
4 
significant 
bits of quotient 


0 
1 
1 
5 
1 
0 
1 
6 
1 
1 
1 
7 


Order 
C3 
C2 
CO 


TOP> 
Operand 
0 
0 
0 
TOP < Operand 
0 
0 
1 
TOP = Operand 
1 
0 
0 
Unordered 
1 
1 
1 


C3 
C2 
C1 
CO 
Value at TOP 


0 
0 
0 
0 
+ Unsupported 
0 
0 
0 
1 
+ NaN 
0 
0 
1 
0 
- 
Unsupported 
0 
0 
1 
1 
- 
NaN 
0 
1 
0 
0 
+ Normal 
0 
1 
0 
1 
+ Infinity 
0 
1 
1 
0 
- 
Normal 
0 
1 
1 
1 
- 
Infinity 
1 
0 
0 
0 
+0 
1 
0 
0 
1 
+ Empty 
1 
0 
1 
0 
-0 
1 
0 
1 
1 
- 
Empty 
1 
1 
0 
0 
+ Denormal 
1 
1 
1 
0 
. 
- 
Denormal 
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Bit 7 is the error summary (ES) status bit. The ES bit 
is set if any unmasked exception bit (bits 0-5 in the 
status word) is set; ES is clear otherwise. The 
FERR11 (floating point error) signal is asserted when 
ES is set. 


Bit 6 is the stack flag (SF). This bit is used to distin- 
guish invalid operations due to stack overflow or un· 
derflow. When SF is set, bit 9 (C1) distinguishes be- 
tween 
stack 
overflow 
(C1= 1) 
and 
underflow 
(C1 =0). 


Table 2.9 shows the six exception flags in bits 0-5 
of the status word. Bits 0-5 are set to indicate that 
the FPU has detected an exception while executing 
an instruction. 


The six exception flags in the status word can be 
individually masked by mask bits in the FPU control 
word. Table 2.9 lists the exception conditions, and 
their causes in order of precedence. Table 2.9 also 
shows the action taken by the FPU if the corre- 
sponding exception flag is masked. 


An exception that is not masked by the control word 
will cause three things to happen: the corresponding 
exception flag in the status word will be set, the ES 
bit in the status word will be set and the FERR11 
output signal will be asserted. When the Intel486 Mi- 
croprocessor attempts to execute another floating 
point or WAIT instruction, exception 16 occurs or an 
external interrupt happens if the NE= 1 in control 


register O. The exception condition must be resolved 
via an interrupt service routine. The FPU saves the 
address of the floating point instruction that caused 
the exception and the address of any memory oper- 
and required by that instruction in the instruction and 
data pointers (see Section 2.1.3.4). 


Note that when a new value is loaded into the status 
word 
by 
the 
FLDENV 
(load 
environment) 
or 
FRSTOR (restore state) instruction, the value of ES 
(bit 7) and its reflection in the B bit (bit 15) are not 
derived from the values loaded from memory. The 
values of ES and B are dependent upon the values 
of the exception flags in the status word and their 
corresponding masks in the control word. If ES is set 
in such a case, the FERR1I output of the Intel486 
Microprocessor is activated immediately. 


2.1.3.4 
Instruction 
and Data Pointers 


Because the FPU operates in parallel with the ALU 
(in the Intel486 microprocessor the arithmetic and 
logic unit (ALU) consists of the base architecture 
registers), any errors detected by the FPU may be 
reported after the ALU has executed the floating 
point instruction that caused it. To allow identifica- 
tion of the failing numeric instruction, the Intel486 
Microprocessor contains two pointer registers that 
supply the address of the failing numeric instruction 
and the address of its numeric memory operand (if 
appropriate). 


Exception 
Cause 
Default Action 
(If exception 
Is masked) 


Invalid 
Operation on a signaling NaN, unsupported format, 
Result is a quiet NaN, integer 
Operation 
indeterminate form (0· 00, 0/0, (+ 00) + (- 
00), etc.), or 
indefinite, or BCD indefinite 
stack overflow/underflow (SF is also set). 


Denormalized 
At least one of the operands is denormalized, Le.,it has 
Normal processing 
Operand 
the smallest exponent but a nonzero significand. 
continues 


Zero Divisor 
The divisor is zero while the dividend is a noninfinite, 
Result is 00 
nonzero number. 


Overflow 
The result is too large in magnitude to fit in the specified 
Result is largest finite value 
format. 
or 00 


Underflow 
The true result is nonzero but too small to be 
Result is denormalized or 
represented in the specified format, and, if underflow 
zero 
exception is masked, denormalization causes loss of 
accuracy. 


Inexact 
The true result is not exactly representable in the 
Normal processing 
Result 
specified format (e.g., 1/3); the result is rounded 
continues 
(Precision) 
according to the rounding mode. 


The instruction and data pointers are provided for 
user-written error handlers. These registers are ac- 
cessed 
by 
the 
FLDENV 
(load 
environment), 
FSTENV (store environment), FSAVE (save state) 
and FRSTOR (restore state) instructions. Whenever 
the Intel486 Microprocessor decodes a new floating 
point instruction, it saves the instruction (including 
any prefixes that may be"present), the address of 
the operand (if present) and the opcode. 


The instruction and data pointers appear in one of 
four formats depending on the operating mode of 
the Intel486 Microprocessor (protected mode or 
real-address 
mode) 
and 
depending 
on 
the 


operand-size attribute in effect (32-bit operand or 
16-bit operand). When the Intel486 Microprocessor 
is in the virtual-86 mode, the real address mode for- 
mats are used. The four formats are shown in Fig- 
ures 
2.10-2.13. 
The 
floating 
point 
instructions 
FLDENV, FSTENV, FSAVE and FRSTOR are used 
to transfer these values to and from memory. Note 
that the value of the data pointer is undefined if the 
prior floating point instruction did not have a memory 
operand. 


NOTE: 
The operand size attribute is the 0 bit in a segment 
descriptor. 


32-81T PROTECTED 
MODE FORMAT 


15 


0 • 
4 


8 


C 


10 


14 


18 


RESERVED 
CONTROL 
WORD 


RESERVED 
STATUS 
WORD 


RESERVED 
TAG WORD 


IPOFFSET 


00000 
I 
OPCODE10 
..0 
CSSELECTOR 


DATA OPERAND 
OFFSET 


RESERVED 
OPERAND 
SELECTOR 


32-81T REAL·ADDRESS 
MODE FORMAT 
15 
, 


RESERVED 
CONTRqL 
WORD 


RESERVED 
STATUS 
WORD 


RESERVED 
TAG ":",,ORD 


RESERVED 
INSTRUCTION 
POINTER 
15 ..0 


0000 
I 
INSTRUCTION 
POINTER 
31 ..16 
I 
0 
I 
OP~ODE 
10..0 


RESERVED 
OPERAND 
P~INTER 
15 ..0 


0000 
I 
OPERAND 
POINTER 
31..16 
I 
0000 
00000000 
, 
, 


16-BIT PROTECTED 
MODE FORMAT 
15 
7 
0 


CONTROL 
WORD 


STATUS 
WORD 


TAG WORD 


IPOFFSET 


CSSELECTOR 


OPERAND 
OFFSET 


OPERAND 
SELECTOR 


Figure 
2.12. Protected 
Mode 
FPU 
Instruction 
and Data Pointer 
Image 
In Memory, 
16-Blt Format 
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16-BIT REAL-ADDRESS 
MODE AND 


VIRTUAL-BOB6 
MODE FORMAT 
15 
7 
0 


CONTROL 
WORD 


STATUS 
WORD 


TAG WORD 


INSTRUCTION 
POINTER 
15..0 


IP19.16 
101 
OPCODE 
10 ..0 


OPERAND 
POINTER 
15 ..0 


DP 19.16 I0 I0 
0 
0 
0 
0 
0 
0 
0 
000 


Figure 
2.13. Real Mode 
FPU 
Instruction 
and Data Pointer 


Image 
In Memory, 
16·Blt Format 


The FPU provides 
several 
processing 
options 
that are selected 
by loading 
a control 
word from memory 
into 


the control 
register. 
Figure 2.14 shows the format 
and encoding 
of fields in the control 
word. 


RESERVED 


RESERVED· 


ROUNDING 
CONTROL 


PRECISION 
CONTROL 


EXCEPTION 
MASKS: 


PRECISION 


UNDERFLOW 


OVERFLOW 


ZERO DIVIDE 


DENORMALIZED 
OPERAND 


INVALID 
OPERATION 


Precision Control 
00-24 
bits (single 
precision) 
01-(reserved) 
10-53 
bits (double 
precision) 
11--64 
bits (extended 
precision) 


• "0" 
AFTER RESET OR FINIT; 


CHANGEABLE 
UPON LOADING THE 
CONTROL 
WORD (CW). 
PROGRAMS 
MUST IGNORE THIS BIT. 


Rounding Control 
Oo-Round 
to nearest 
or even 
01-Round 
down 
(toward 
- 00) 
10-Round 
up (toward + 00) 
l1-Ghop 
(truncate 
toward 
zero) 


The low-order byte of the FPU control word config- 
ures the FPU error and exception masking. Bits 0-5 
of the control word contain individual masks for each 
of the six exceptions that the FPU recognizes. 


The high-order byte of the control word configures 
the FPU operating mode, including precision and 
rounding. 
RC (Rounding Control, bits 10-11) 


The RC bits provide for directed rounding and 
true chop, as well as the unbiased round to 
nearest even mode specified in the IEEE stan- 
dard. Rounding control affects only those in- 
structions that perform rounding at the end of 
the operation (and thus can generate a preci- 
sion exception); namely, FST, FSTP, FIST, all 
arithmetic 
instructions 
(except 
FPREM, 
FPREM1, FXTRACT, FABS and FCHS), and all 
transcendental instructions. 
PC (Precision Control, bits 8-9) 
The PC bits can be used to set the FPU internal 
operating precision of the significand at less 
than the default of 64 bits (extended precision). 
This can be useful in providing compatibility with 
early generation arithmetic processors of small- 
er precision. PC affects only the instructions 
ADD, SUB, DIV, MUL, and SORT. For all other 
instructions, either the precision is determined 
by the opcode or extended precision is used. 


2.1.4.1 
Debug 
Registers 


The six programmer accessible debug registers, Fig- 
ure 2.15, provide on-chip support for debugging. De- 
bug registers DRO-3 specify the four linear break- 
points. The Debug control register DR?, is used to 
set the breakpoints and the Debug Status Register, 
DR6, displays the current state of the breakpoints. 
The use of the Debug registers is described in Sec- 
tion 9. 


LINEAR 
BREAKPOINT 
ADDRESS 
0 


LINEAR 
BREAKPOINT 
ADDRESS 
1 


LINEAR 
BREAKPOINT 
ADDRESS 
2 


LINEAR 
BREAKPOINT 
ADDRESS 
3 


Intel Reserved 
Do Not Define 


Intel Reserved 
Do Not Define 


BREAKPOINT 
STATUS 


BREAKPOINT 
CONTROL 


Test Registers 


CACHE TEST DATA 


CACHE TEST STATUS 


CACHE TEST CONTROL 


TLB TEST CONTROL 


TLB TEST STATUS 


TLB = Translation 
Lookaside 
Buffer 


ORO 


DR1 


DR2 


DR3 


DR4 


DR5 


DR6 


DR? 


TR3 


TR4 


TR5 


TR6 


TR? 


2.1.4.2 
Test Registers 


The Intel486 Microprocessor contains five test regis- 
ters. The test registers are shown in Figure 2.15. 
TR6 and TR? are used to control the testing of the 
translation lookaside buffer. TR3, TR4 and TR5 are 
used for testing the on-chip cache. The use of the 
test registers is discussed in Section 8. 


There are a few differences regarding the accessibil- 
ity of the registers in Real and Protected Mode. Ta- 
ble 2.10 summarizes these differences. See Section 
4, Protected Mode Architecture, for further details. 


• 


Use In 
Use In 
Use In 


Register 
Real Mode 
Protected Mode 
Virtual 8086 Mode 


load 
Store 
load 
Store 
load 
Store 


General 
Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Segment 
Register 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Flag Register 
Yes 
Yes 
Yes 
Yes 
IOPL 
IOPLo 


Control 
Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
Yes 


GDTR 
Yes 
Yes 
PL = 0 
Yes 
No 
Yes 


IDTR 
Yes 
Yes 
PL = 0 
Yes 
No 
Yes 


LDTR 
No 
No 
PL = 0 
Yes 
No 
No 


TR 
No 
No 
PL = 0 
Yes 
No 
No 


FPU Data Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Control 
Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Status Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Instruction 
Pointer 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Data Pointer 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Debug Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
No 


Test Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
No 


NOTES: 
PL = 0: The registers 
can be accessed 
only when the current 
privilege 
level is zero. 


"OPL: 
The PUSHF 
and POPF instructions 
are made 
I/O 
Privilege 
Level sensitive 
in Virtual 
86 Mode. 


2.1.6 COMPATIBiliTY 


VERYIMPORTANTNOTE: 
COMPATIBiliTY WITH FUTUREPROCESSORS 


In the preceding register descriptions, note cer- 
tain Intel486 Microprocessor 
register 
bits are 
Intel reserved. When reserved bits are called 
out, treat them as fully undefined. This Is essen· 
tlal for your software compatibility 
with future 
processorsl Follow the guidelines below: 
1) Do not depend on the states of any unde- 
fined bits when testing the values of defined 
register bits. Mask them out when testing. 
2) Do not depend on the states of any unde- 
fined bits when storing them to memory or 
another register. 


3) Do not depend on the ability to retain Infor- 
mation written Into any undefined bits. 
4) When loading registers always load the unde- 
fined bits as zeros. 
5) However, registers which have been previ- 
ously stored may be reloaded without mask· 
Ing. 


Depending upon the values of undefined regis- 
ter bits will make your software dependent upon 
the unspecified 
Intel486 Microprocessor 
han- 
dling of these bits. Depending on undefined val- 
ues risks making your software 
Incompatible 
with future processors that define usages for 
the 
Intel486 
Microprocessor-undefined 
bits. 
AVOID ANY SOFTWARE DEPENDENCE UPON 
THE STATE OF UNDEFINED Intel486 MICRO· 
PROCESSORREGISTERBITS. 


The Intel486 Microprocessor instruction set can be 
divided into 11 categories of operations: 


Data Transfer 
Arithmetic 
Shift/Rotate 
String Manipulation 
Bit Manipulation 
Control Transfer 
High Level Language Support 
Operating System Support 
Processor Control 
Floating Point 
Floating Point Control 


The Intel486 Microprocessor instructions are listed 
in Section 10. Note that all floating point unit instruc- 
tion mnemonics begin with an F. 


All Intel486 Microprocessor instructions operate on 
either 0, 1, 2 or 3 operands; where an operand re- 
sides in a register, in the instruction itself or in mem- 
ory. Most zero operand instructions (e.g., CLI, STI) 
take only one byte. One operand instructions gener- 
ally are two bytes long. The average instruction is 
3.2 bytes long. Since the Intel486 Microprocessor 
has a 32-byte instruction queue, an average of 10 
instructions will be prefetched. The use of two oper- 
ands permits the following types of common instruc- 
tions: 


Register to Register 
Memory to Register 
Memory to Memory 
Immediate to Register 
Register to Memory 
Immediate to Memory 


The operands can be either 8, 16, or 32 bits long. As 
a general rule, when executing code written for the 
Intel486 or 386 Microprocessors (32-bit code), oper- 
ands are 8 or 32 bits; when executing existing 80286 
or 8086 code (16-bit code), operands are 8 or 16 
bits. Prefixes can be added to all instructions which 
override the default length of the operands (Le., use 
32-bit operands for 16-bit code, or 16-bit operands 
for 32-bit code). 


Memory on the Intel486 Microprocessor is divided 
up into 8-bit quantities (bytes), 16-bit quantities 
(words), and 32-bit quantities (dwords). Words are 
stored in two consecutive bytes in memory with the 
low-order byte at the lowest address, the high order 


byte at the high address. Dwords are stored in four 
consecutive bytes in memory with the low-order byte 
at the lowest address, the high-order byte at the 
highest address. The address of a word or dword is 
the byte address of the low-order byte. 


In addition to these basic data types, the Intel486 
Microprocessor supports two larger units of memory: 
pages and segments. Memory can be divided up 
into one or more variable length segments, which 
can be swapped to disk or shared between pro- 
grams. Memory can also be organized into one or 
more 4 Kbyte pages. Finally, both segmentation and 
paging can be combined, gaining the advantages of 
both systems. The Intel486 Microprocessor sup- 
ports both pages and segments in order to provide 
maximum flexibility to the system designer. Segmen- 
tation and paging are complementary. Segmentation 
is useful for organizing memory in logical modules, 
and as such is a tool for the application programmer, 
while pages are useful for the system programmer 
for managing the physical memory of a system. 


The Intel486 Microprocessor has three distinct ad- 
dress spaces: logical, 
linear, and physical. 
A logi- 
cal address (also known as a virtual address) con- 
sists of a selector and an offset. A selector is the 
contents of a segment register. An offset is formed 
by summing all of the addressing components 
(BASE, INDEX, DISPLACEMENT)discussed in Sec- 
tion 2.5.3 Memory 
Addressing 
Modes 
into an ef- 
fective address. Since each task on the Intel486 Mi- 
croprocessor has a maximum of 16K (214 -1) 
se- 
lectors, and offsets can be 4 gigabytes, (232 bits) 
this gives a total of 246 bits or 64 terabytes of logI- 
cal address space per task. The programmer sees 
this virtual address space. 


The segmentation unit translates the logical 
ad- 
dress space into a 32-bit linear address space. If the 
paging unit is not enabled then the 32-bit linear ad- 
dress corresponds to the physical 
address. The 
paging unit translates the linear address space into 
the physical address space. The physical 
address 
is what appears on the address pins. 


The primary difference between Real Mode and Pro- 
tected Mode is how the segmentation unit performs 
the translation of the logical address into the linear 
address. In Real Mode, the segmentation unit shifts 
the selector left four bits and adds the result to the 
offset to form the linear address. While in Protected 
Mode every selector has a linear base address as- 
sociated with it. The linear base address is stored in 
one of two operating system tables (Le., the Local 
Descriptor Table or Global Descriptor Table). The 
selector's linear base address is added to the offset 
to form the final linear address. 
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EFFECTIVE 


ADDRESS 
3 
2 D 
LOGICAL OR 
SEGWENTATION 


VIRTUAL 
ADDRESS 
UNIT 
13 


DESCRIPTOR 
INDEX 


32 
PAGING UNIT 


LINEAR 
(OPTIONAL 
USE) 


ADDRESS 


32 


PHYSICAL 
ADDRESS 


Figure 2.16. Address Translation 


Figure 2.16 shows the relationship between the vari- 
and create a system with a four gigabyte Iinea~ad- 
ous address spaces. 
dress space. This creates a system where the virtual 
address space is the same as the Ii~ear addre~s 
space. Further details of segmentation are dls- 
2.3.2 
SEGMENT 
REGISTER 
USAGE 
cussed in Section 4.1. 


The main data structure used to organize memory is 
the segment. On the Intel486 Microprocessor, seg- 
ments are variable sized blocks of linear addresses 
which have certain attributes associated with them. 
There are two main types of segments: code and 
data the segments are of variable size and can be 
as s;"all as 1 byte or as large as 4 gigabytes (232 
bytes). 


In order to provide compact instruction encoding, 
and increase processor performance, instructions 
do not need to explicitly specify which segment reg- 
ister is used. A default segment register is automati- 
cally chosen according to the rules of Table 2.11 
(Segment Register Selection Rules). In general, data 
references use the selector contained in the OS reg- 
ister; Stack references use the SS register and In- 
struction fetches use the CS register. The contents 
of the Instruction Pointer provide the offset. Special 
segment override prefixes allow the ~xplicit u~e o! ~ 
given segment register, and over~lde th~ Implicit 
rules listed in Table 2.11. The overnde prefixes also 
allow the use of the ES, FS and GS segment regis- 
ters. 


There are no restrictions regarding the overlapping 
of the base addresses of any segments. Thus, all 6 
segments could have the base address set to zero 


The Intel486 Microprocessor has two distinct physi- 
cal address spaces: Memory and I/O. Generally, pe- 
ripherals are placed in I/O space although the In- 
tel486 
Microprocessor 
also 
supports 
memory- 


mapped peripherals. The I/O 
space consists of 
64 Kbytes, it can be divided into 64K 8-bit po~s, 3.2K 
16-bit ports, or 16K 32-bit ports, or any combination 
of ports which add up to less than 64 Kbytes. The 
64K I/O address space refers to physical memory 
rather than linear address since I/O instructions do 
not go through the segmentation or paging hard- 
ware. The M/IO# 
pin acts as an additional address 
line thus allowing the system designer to easily de- 
termine which address space the processor is ac- 
cessing. 


The I/O ports are accessed via the IN and OUT I/O 
instructions, with the port address supplied as an 
immediate 8-bit constant in the instruction or in the 
OXregister. All 8- and 16-bit port addresses are zero 
extended on the upper address lines. The I/O in- 
structions cause the M/IO# 
pin to be driven low. 


I/O port addresses OOF8Hthrough OOFFHare re- 
served for use by Intel. 
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Type 
of 
Implied (Default) 
Segment 
Override 
Memory 
Reference 
Segment 
Use 
Prefixes 
Possible 


Code Fetch 
CS 
None 


Destination 
of PUSH, PUSHF, INT, 
SS 
None 
CALL, PUSHA Instructions 


Source of POP, POPA, POPF, 
SS 
None 
IRET, RET instructions 


Destination 
of STOS, MOVS, REP 
ES 
None 
STOS, REP MOVS Instructions 
(DI is Base Register) 


Other Data References, 
with 
Effective 
Address 
Using Base 
Register of: 


[EAX) 
DS 
[EBX) 
DS 
[ECX) 
DS 
[EDX) 
OS 
All 
[ESI) 
DS 
[EDI) 
DS 
[EBP) 
SS 
[ESP) 
SS 


The Intel486 
Microprocessor 
provides 
a total of 11 
addressing 
modes 
for instructions 
to specify 
oper- 
ands. The addressing 
modes 
are optimized 
to allow 
the efficient 
execution 
of high level languages 
such 
as C and FORTRAN, 
and they cover the vast majori- 
ty 
of 
data 
references 
needed 
by 
high-level 
lan- 
guages. 


Two 
of the 
addressing 
modes 
provide 
for instruc- 
tions 
that 
operate 
on 
register 
or immediate 
oper- 
ands: 


Register 
Operand 
Mode: The operand 
is located 
in 
one of the 8-, 16- or 32-bit general 
registers. 


Immediate 
Operand 
Mode: The operand 
is includ- 
ed in the instruction 
as part of the opcode. 


The 
remaining 
9 modes 
provide 
a mechanism 
for 
specifying 
the effective 
address 
of an operand. 
The 
linear address 
consists 
of two components: 
the seg- 
ment 
base 
address 
and an effective 
address. 
The 
effective 
address 
is calculated 
by using 
combina- 
tions of the following 
four address 
elements: 


DISPLACEMENT: 
An 8-, or 32-bit 
immediate 
value, 


following 
the instruction. 


BASE: The contents 
of any general 
purpose 
regis- 
ter. The base registers 
are generally 
used by compil- 
ers to point to the start of the local variable 
area. 


INDEX: The contents 
of any general 
purpose 
regis- 


ter except 
for ESP. The index registers 
are used to 
access 
the elements 
of an array, or a string of char- 


acters. 


SCALE: The index register's 
value can be multiplied 
by a scale 
factor, 
either 
1, 2, 4 or 8. Scaled 
index 


• 


mode is especially useful for accessing arrays or 
structures. 


Combinations of these 4 components make up the 9 
additional addressing modes. There is no perform- 
ance penalty for using any of these addressing com- 
binations, since the effective address calculation is 
pipelined with the execution of other instructions. 
The one exception is the simultaneous use of Base 
and Index components which requires one addition- 
al clock. 


As shown in Figure 2.17, the effective address (EA) 
of an operand is calculated according to the follow- 
ing formula. 


Direct Mode: The operand's offset is contained as 
part of the instruction as an 8-, 16- or 32-bit dis- 
placement. 
EXAMPLE: 
INC Word 
PTR [500] 


Register Indirect Mode: A BASE register contains 
the address of the operand. 
EXAMPLE: 
MOY 
[ECX], 
EDX 


Based Mode: A BASE register's contents is added 
to a DISPLACEMENT to form the operand's offset. 
EXAMPLE: 
MOY ECX, [EAX + 24] 


Index Mode: An INDEX register's contents is added 
to a DISPLACEMENT to form the operand's offset. 
EXAMPLE: 
ADD EAX, TABLE[ESI] 


Scaled Index Mode: An INDEX register's contents is 
multiplied by a scaling factor which is added to a 
DISPLACEMENT to form the operand's offset. 
EXAMPLE: 
IMUL EBX, TABLE[ESI·4],7 


Based Index Mode: The contents of a BASE register 
is added to the contents of an INDEX register to 
form the effective address of an operand. 
EXAMPLE: 
MOY EAX, [ESI] 
[EBX] 


Based Scaled Index Mode: The contents of an IN- 
DEX register is multiplied by a SCALING factor and 
the result is added to the contents of a BASE regis- 
ter to obtain the operand's offset. 
EXAMPLE: 
MOY ECX, [EDX·S] 
[EAX] 
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fS 
ES 
os 
-cs 
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GS 
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os 
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,/ 
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'" 
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------~ 
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Based Index Mode with Displacement: The contents 
of an INDEX Register and a BASE register's con- 
tents and a DISPLACEMENT are all summed to- 
gether to form the operand offset. 
EXAMPLE: 
ADD EDX, [ESI] 
[EBP + OOFFFFFOH] 


Based Scaled Index Mode with Displacement: The 
contents of an INDEX register are multiplied by a 
SCALING factor, the result is added to the contents 
of a BASE register and a DISPLACEMENT to form 
the operand's offset. 
EXAMPLE: 
MOV 
EAX, 
LOCALTABLE[EDI·4] 
[EBP+80] 


2.5.4 
DIFFERENCES 
BETWEEN 
16- AND 32-BIT 
ADDRESSES 


In order to provide software compatibility with the 
80286 and the 8086, the Intel486 Microprocessor 
can execute 16-bit instructions in Real and Protect- 
ed Modes. The processor determines the size of the 
instructions it is executing by examining the 0 bit in 
the CS segment Descriptor. If the 0 bit is 0 then all 
operand lengths and effective addresses are as- 
sumed to be 16 bits long. If the 0 bit is 1 then the 
default length for operands and addresses is 32 bits. 
In Real Mode the default size for operands and ad- 
dresses is 16-bits. 


Regardless of the default precision of the operands 
or addresses, the Intel486 Microprocessor is able to 
execute either 16- or 32·bit instructions. This is 
specified via the use of override prefixes. Two prefix- 
es, the 
Operand 
Size 
Prefix 
and the Address 
Length 
Prefix, override the value of the 0 bit on an 
individual instruction basis. These prefixes are auto- 
matically added by Intel assemblers. 


Example: The processor is executing in Real Mode 
and the programmer needs to access the EAX regis- 
ters. The assembler code for this might be MOV 
EAX, 32-bit MEMORYOP, ASM486 Macro Assem- 
bler automatically determines that an Operand Size 
Prefix is needed and generates it. 


Address Length Prefix since, with 0 =0, the default 
addressing mode is 16-bits. 


Example: The 0 bit is 1, and the program wants to 
store a 16-bit quantity. The Operand Length Prefix is 
used to specify only a 16-bit value; MOV MEM16, 
OX. 


The OPERAND LENGTH and Address Length Pre· 
fixes can be applied separately or in combination to 
any instruction. The Address Length Prefix does not 
allow addresses over 64 Kbytes to be accessed in 
Real Mode. A memory address which exceeds 
FFFFH will result in a General Protection Fault. An 
Address Length Prefix only allows the use of the ad- 
ditional Intel486 Microprocessor addressing modes. 


When executing 32-bit code, the Intel486 Microproc- 
essor uses either 8-, or 32-bit displacements, and 
any register can be used as base or index registers. 
When executing 16-bit code, the displacements are 
either 8, or 16 bits, and the base and index register 
conform to the 80286 model. Table 2.12 illustrates 
the differences. 


The Intel486 Microprocessor can support a wide va- 
riety of data types. In the following descriptions, the 
on-chip floating point unit (FPU) consists of the float- 
ing point registers. The central processing 
unit 
(CPU) consists of the base architecture registers. 


2.6.1.1 
Unsigned 
Data Types 


The FPU does not support unsigned data types. Re- 
fer to Table 2.13. 
'Byte: 
Unsigned 8-bit quantity 
Word: 
Unsigned 16-bit quantity 
Dword: Unsigned 32-bit quantity 


The least significant bit (LSB) in a byte is bit 0, and 
the most significant bit is 7. 


Example: The 0 bit is 0, and the programmer wishes 
to use Scaled Index addressing mode to access an 
array. The Address Length Prefix allows the use of 
MOV OX, TABLE[ESI·2]. 
The assembler uses an 


Table 2.12. BASE and INDEX Registers 
for 16· and 32·BI~Addresses 


16·Blt Addressing 
32·Blt Addressing 


BASE REGISTER 
BX,BP 
Any 32-bit GP Register 
INDEX REGISTER 
SI,DI 
Any 32-bit GP Register 
Except ESP 
SCALE FACTOR 
none 
1,2,4,8 
DISPLACEMENT 
0,8,16 bits 
0,8,32 bits 


• 


2.6.1.2 
Signed 
Data Types 


All signed data types assume 2's complement nota- 
tion. The signed data types contain two fields, a sign 
bit and a magnitude. The sign bit is the most signifi- 
cant bit (MSB). The number is negative if the sign bit 
is 1. If the sign bit is 0, the number is positive. The 
magnitude field consists of the remaining bits in the 
number. Refer to Table 2.13. 
8-bit Integer: 
Signed 8-bit quantity 
16-bit Integer: Signed 16-bit quantity 


32-bit Integer: Signed 32-bit quantity 
64·bit Integer: Signed 64-bit quantity 


The FPU only supports 16-, 32- and 64-bit integers. 
The CPU only supports 8-, 16- and 32·bit integers. 


2.6.1.3 
Floating 
Point Data Types 


Floating point data type in the Intel486 Microproces- 
sor contain three fields, sign, significand and expo- 
nent. The sign field is one bit and is the MSB of the 
floating point number. The number is negative if the 
sign bit is 1. If the sign bit is 0, the number is posi- 
tive. The significand gives the significant bits of the 
number. The exponent field contains the power of 2 
needed to scale the significand. Refer to Table 2.13. 


Only the FPU supports floating point data types. 
Single Precision Real: 
23-bit significand and 8- 
bit exponent. 32 bits total. 


Double Precision Real: 
52-bit significand and 11- 
bit exponent. 64 bits total. 


Extended Precision Real: 64-bit significand and 15- 
bit exponent. 80 bits total. 
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2.6.1.4 
BCD Data Types 


The Intel486 Microprocessor supports packed and 
unpacked binary coded decimal (BCD) data types. A 
packed BCD data type contains two digits per byte, 
the lower digit is in bits 0-3 
and the upper digit in 
bits 4-7. 
An unpacked BCD data type contains 1 
digit per byte stored in bits 0-3. 


The CPU supports 8-bit packed and unpacked BCD 
data types. The FPU only supports 80-bit packed 
BCD data types. Refer to Table 2.13. 


2.6.1.5 
String Data Types 


A string data type is a contiguous sequence of bits, 
bytes, words or dwords. A string may contain be- 
tween 1 byte and 4 Gbytes. Refer to Table 2.14. 


String data types are only supported by the CPU. 


Dword String: Contiguous sequence of dwords. 


Bit String: A set of contiguous bits. In the Intel486 
Microprocessor bit strings can be up to 4 gigabits 
long. 


2.6.1.6 
ASCII Data Types 


The Intel486 Microprocessor supports ASCII (Ameri- 
can Standard Code for Information Interchange) 
strings and can perform arithmetic operations (such 
as addition and division) on ASCII data. Refer to Ta- 
ble 2.14. 
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Table 2.13. Intel486TM Microprocessor 
Data Types 


~by~by 
_Reglom. 
FPU 
.j, .j, 


Data Format 
Range 
Precision 
7 
017 
017 
017 
017 
017 
017 
017 
017 
017 
0 


Byte 
X 
0-255 
8 bits 
C 


15 
0 


Word 
X 
0-64K 
16 bits 
I 


31 
0 


Dword 
X 
0-4G 
32 bits 
I 


8-Bit Integer 
X 
102 
8 bits 
Two'. 
.~ 
Compfement 


Sign B~ f 


15 
0 


16-Bit Integer 
X X 
104 
16 bits 
Two'. 
I I 
Complement 


Sign~ 
f 


31 
0 


32-Bit Integer 
X X 
109 
32 bits 
~tl! 
Sign B~ f 


63 
0 


64-Bit Integer 
X 
1019 
64 bits 
Two'. 
I I 
Complement 


Sign B~ f 


7 
0 


8-Bit Unpacked 
BCD 
X 
0-9 
1 Digit 
One BCD Digit per Byte' 


7 
0 


8-Bit Packed 
BCD 
X 
0-9 
2 Digits 
Two BCD Dig~. per Byte! 


79 
72 
0 


80-Bit Packed 
BCD 
X 
±10±18 
18 Digits 
!9'O<ed ! 
f 
Sign Bit 


31 
23 
0 


Single Precision 
Real 
X 
± 10±38 
24 Bits 
IIB~I 
Signiflcand 


Sign B~ f 


63 
52 
0 


Double 
Precision 
Real 
X 
± 1O±308 
53 Bits 
I!B~I 
Significand 


Sign B~ t 


79 
63 
0 


Extended 
Precision 
Real 
X 
± 10±4932 
64 Bits 
I 


Biased 
11 
Signilicand 
Exp. 


t 
Sign B~ 


• 


A+ND 


A+l 
A 
I"1TOI 
~ 


A+4N+3 
A+4N+2 
A+4N+1 


Dwordl 
~ 
String 31 


A+4N 


01 


A+2N+l 
A+2N 
A+3 
A+2 
A+l 
A 


115 


I 


01 
115 
0115 
01 
N 


A+7 
A+6 
A+5 
AH 
A+3 
A+2 
A+l 
A 


131 
0131 
01 


A+ 268,435,455 
J- 


Bit~}} 
String~(( 
7 


, t 
+ 2,147,483.647 


A- 268,435,456 


J- 
ollEy 


-2,147,483,848 
D 


2.6.1.7 
Pointer 
Data Types 


A pointer 
data type contains 
a value that gives the 
address 
of a piece of data. The Intel486 
Microproc- 
essor supports 
two types of pointers. 
Refer to Table 
2.15. 


Table 2.15. Pointer 
Data Types 


Leu. 
611 Byte 
J- 


2.6.2 
LITTLE 
ENDIAN 
vs BIG ENDIAN 
DATA 
FORMATS 


The Intel486 Microprocessor, as well as all other 
members of the 86 architecture use the "Iittle-endi- 
an" method for storing data types that are larger 
than one byte. Words are stored in two consecutive 
bytes in memory with the low-order byte at the low- 
est address and the high order byte at the high ad- 
dress. Dwords are stored in four consecutive bytes 
in memory with the low-order byte at the lowest ad- 
dress and the high order byte at the highest address. 
The address of a word or dword data item is the byte 
address of the low-order byte. 


Figure 2.18 illustrates the differences between the 
big-endian and Iittle-endian formats for dwords. The 
32 bits of data are shown with the low order bit num- 
bered bit 0 and the high order bit numbered 32. Big- 
endian data is stored with the high-order bits at the 
lowest addressed byte. Little-endian data is stored 
with the high-order bits in the highest addressed 
byte. 


The Intel486 Microprocessor has two instructions 
which can convert 16- or 32-bit data between the 
two byte orderings. BSWAP (byte swap) handles 
four byte values and XCHG (exchange) handles two 
byte values. 


Interrupts and exceptions alter the normal program 
flow, in order to handle external events, to report 
errors or exceptional conditions. The difference be- 
tween interrupts and exceptions is that interrupts are 
used to handle asynchronous external events while 
exceptions handle instruction faults. Although a pro- 
gram can generate a software interrupt via an INT N 
instruction, the processor treats software interrupts 
as exceptions. 


Haraware Interrupts occur as me resUlt Of an exter- 
nal event and are classified into two types: maskable 
or non-maskable. Interrupts are serviced after the 
execution of the current instruction. After the inter· 
rupt handler is finished servicing the interrupt, exe- 
cution proceeds with the instruction immediately af· 
ter the interrupted instruction. Sections 2.7.3 and 
2.7.4 discuss the differences between Maskable and 
Non-Maskable interrupts. 


Exceptions are classified as faults, traps, or aborts 
depending on the way they are reported, and wheth- 
er or not restart of the instruction causing the excep- 
tion is supported. Faults are exceptions that are de- 
tected and serviced before 
the execution of the 
faulting instruction. A fault would occur in a virtual 
memory system, when the processor referenced a 
page or a segment which was not present. The oper- 
ating system would fetch the page or segment from 
disk, and then the Intel486 Microprocessor would re- 
start the instruction. Traps are exceptions that are 
reported immediately after the execution of the in- 
struction which caused the problem. User defined 
interrupts are examples of traps. Aborts are excep- 
tions which do not permit the precise location of the 
instruction causing the exception to be determined. 
Aborts are used to report severe errors, such as a 
hardware error, or illegal values in system tables. 


Thus, when an interrupt service routine has been 
completed, execution proceeds from the instruction 
immediately following the interrupted instruction. On 
the other hand, the return address from an excep- 
tion fault routine will always point at the instruction 
causing the exception and include any leading in- 
struction prefixes. Table 2.16 summarizes the possi- 
ble interrupts for the Intel486 Microprocessor and 
shows where the return address points. 


The Intel486 Microprocessor has the ability to han- 
dle up to 256 different interrupts/exceptions. In or- 
der to service the interrupts, a table with up to 256 
interrupt vectors must be defined. The interrupt vec- 
tors are simply pointers to the appropriate interrupt 
service routine. In Real Mode (see Section 3.1), the 
vectors are 4 byte quantities, a Code Segment plus 
a 16-bit offset; in Protected Mode, the interrupt vec- 
tors are 8 byte quantities, which are put in an Inter- 
rupt Descriptor Table (see Section 4.3.3.4). Of the 
256 possible interrupts, 32 are reserved for use by 
Intel, the remaining 224 are free to be used by the 
system designer. 


When an interrupt occurs the following actions hap- 
pen. First, the current program address and the 
Flags are saved on the stack to allow resumption of 
the interrupted program. Next, an 8-bit vector is sup- 


plied to the Intel486 
Microprocessor 
which identifies 
the appropriate 
entry in the interrupt 
table. The table 
contains 
the starting 
address 
of the interrupt 
service 
routine. 
Then, 
the 
user 
supplied 
interrupt 
service 
routine 
is executed. 
Finally, 
when 
an IRET instruc- 
tion is executed 
the old processor 
state is restored 
and program 
execution 
resumes 
at the appropriate 
instruction. 


The 8-bit interrupt 
vector 
is supplied 
to the Intel486 
Microprocessor 
in several different 
ways: exceptions 
supply 
the 
interrupt 
vector 
internally; 
software 
INT 
instructions 
contain 
or imply 
the 
vector; 
maskable 
hardware 
interrupts 
supply 
the 8-bit vector 
via the 


interrupt 
acknowledge 
bus sequence. 
Non-Maska- 
ble 
hardware 
interrupts 
are 
assigned 
to 
interrupt 
vector 
2. 
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Maskable 
interrupts 
are the most common 
way used 
by the Intel486 
Microprocessor 
to respond 
to asyn- 
chronous 
external 
hardware 
events. 
A hardware 
in- 
terrupt 
occurs 
when 
the 
INTR 
is pulled 
high 
and 
the Interrupt 
Flag bit (IF) is enabled. 
The processor 
only 
responds 
to 
interrupts 
between 
instructions, 


(REPeat 
String 
instructions,· 
have an "interrupt 
win- 
dow", 
between 
memory 
moves, 
which 
allows 
inter- 
rupts during 
long string 
moves). 
When 
an interrupt 
occurs 
the processor 
reads an 8-bit vector 
supplied 
by the hardware 
which 
identifies 
the source 
of the 
interrupt, 
(one of 224 user defined 
interrupts). 
The 
exact nature of the interrupt 
sequence 
is discussed 
in Section 
7.2.10. 


Instruction 
Which 
Return Address 


Function 
Interrupt 
Can Cause 
Points 
to 
Type 
Number 
Faulting 
Exception 
Instruction 


Divide Error 
0 
DIV,IDIV 
YES 
FAULT 


Debug Exception 
1 
Any Instruction 
YES 
TRAP' 


NMI Interrupt 
2 
INT 20r 
NMI 
NO 
NMI 


One Byte Interrupt 
3 
INT 
NO 
TRAP 


Interrupt 
on Overflow 
4 
INTO 
NO 
TRAP 


Array Bounds Check 
5 
BOUND 
YES 
FAULT 


Invalid OP-Code 
6 
Any Illegal Instruction 
YES 
FAULT 


Device Not Available 
7 
ESC, WAIT 
YES 
FAULT 


Double Fault 
8 
Any Instruction 
That Can 
ABORT 


Generate 
an Exception 


Intel Reserved 
9 


InvalidTSS 
10 
JMP, CALL, IRET, INT 
YES 
FAULT 


Segment 
Not Present 
11 
Segment 
Register 
Instructions 
YES 
FAULT 


Stack Fault 
12 
Stack References 
YES 
FAULT 


General 
Protection 
Fault 
13 
Any Memory 
Reference 
YES 
FAULT 


Page Fault 
14 
Any Memory Access 
or Code Fetch 
YES 
FAULT 


Intel Reserved 
15 


Floating 
Point Error 
16 
Floating Point, WAIT 
YES 
FAULT 


Alignment 
Check Interrupt 
17 
Unaligned 
Memory Access 
YES 
FAULT 


Intel Reserved 
18-31 


Two Byte Interrupt 
0-255 
INTn 
NO 
TRAP 


The IF bit in the EFLAG registers is reset when an 
interrupt is being serviced. This effectively disables 
servicing additional interrupts during an interrupt 
service routine. However, the IF may be set explicitly 
by the interrupt handler, to allow the nesting of inter- 
rupts. When an IRET instruction is executed the 
original state of the IF is restored. 


Non-maskable interrupts provide a method of servic- 
ing very high priority interrupts. A common example 
of the use of a non-maskable interrupt (NMI) would 
be to activate a power failure routine. When the NMI 
input is pulled high it causes an interrupt with an 
internally supplied vector value of 2. Unlike a normal 
hardware interrupt, no interrupt acknowledgment se- 
quence is performed for an NMI. 


While executing the NMI servicing procedure, the In- 
tel486 Microprocessor will not service further NMI 
requests until an interrupt return (IRET) instruction is 
executed or the processor is reset. If NMI occurs 
while currently servicing an NMI, its presence will be 
saved for servicing after executing the first IRET in- 
struction. The IF bit is cleared at the beginning of an 
NMI interrupt to inhibit further INTR interrupts. 


A third type of interrupt/exception for the Intel486 
Microprocessor is the software interrupt. An INT n 
instruction causes the processor to execute the in- 
terrupt service routine pointed to by the nth vector in 
the interrupt table. 


A special case of the two byte software interrupt INT 
n is the one byte INT 3, or breakpoint interrupt. By 
inserting this one byte instruction in a program, the 
user can set breakpoints in his program as a debug- 
ging tool. 


A final type of software interrupt is the single step 
interrupt. It is discussed in Section 9.2. 


2.7.6 INTERRUPT AND EXCEPTION 
PRIORITIES 


Interrupts are externally-generated events. Maska- 
ble Interrupts (on the INTR input) and Non-Maskable 
Interrupts (on the NMI input) are recognized at in- 
struction 
boundaries. When 
NMI and 
maskable 
INTR are both recognized at the same instruction 
boundary, the Intel486 Microprocessor invokes the 
NMI service routine first. If, after the NMI service 
routine has been invoked, maskable interrupts are 
still enabled, then the Intel486 Microprocessor will 
invoke the appropriate interrupt service routine. 


Table 2.17a. Intel486TM Microprocessor 
Priority 
for Invoking Service Routines In Case 


_ 


__ 
O_f_S_I_m_U_lt_a_n_e_O_U_S_E_x_t_er_n_a_I_ln_t_e_rr_u_p_tS 
I_ 
1. NMI 
2.INTR 


Exceptions are internally-generated events. Excep- 
tions are detected by the Intel486 Microprocessor if, 
in the course of executing an instruction, the In- 
tel486 Microprocessor detects a problematic condi- 
tion. The Intel486 Microprocessor then immediately 
invokes the appropriate exception service routine. 
The state of the Intel486 Microprocessor is such 
that the instruction causing the exception can be re- 
started. If the exception service routine has taken 
care of the problematic condition, the instruction will 
execute without causing the same exception. 


It is possible for a single instruction to generate sev- 
eral exceptions (for example, transferring a single 
operand could generate two page faults if the oper- 
and location spans two "not present" pages). How- 
ever, only one exception is generated upon each at- 
tempt to execute the instruction. Each exception 
service routine should correct its corresponding ex- 
ception, and restart the instruction. In this manner, 
exceptions are serviced until the instruction exe- 
cutes successfully. 


As the Intel486 Microprocessor executes instruc- 
tions, it follows a consistent cycle in checking for 
exceptions, as shown in Table 2.17b. This cycle is 
repeated as each instruction is executed, and oc- 
curs in parallel with instruction decoding and execu- 
tion. 


Consider the case of the Intel486 Microproces- 
sor having just completed an instruction. It then 
performs the following checks before reaching 
the point where the next instruction is completed: 
1. Check for Exception 1 Traps from the instruc- 
tion just completed (single-step via Trap Flag, 
or Data Breakpoints set in the Debug Regis- 
ters). 


2. Check for Exception 1 Faults in the next in- 
struction 
(Instruction 
Execution Breakpoint 
set in the Debug Registers for the next in- 
struction). 
3. Check for external NMI and INTR. 


4. Check for Segmentation Faults that prevent- 
ed fetching the entire next instruction (excep- 
tions 11 or 13). 


5. Check for Page Faults that prevented fetching 
the entire next instruction (exception 14). 
6. Check for Faults decoding the next instruction 
(exception 6 if illegal opcode; exception 6 if in 
Real Mode or in Virtual 8086 Mode and at- 
tempting to execute an instruction for Protect- 
ed Mode only (see Section 4.6.4); or excep- 
tion 13 if instruction is longer than 15 bytes, or 
privilege violation in Protected Mode (Le., not 
at IOPL or at CPL= 0). 
7. If WAIT opcode, check if TS= 1 and MP= 1 
(exception 7 if both are 1). 
8. If opcode for Floating Point Unit, check if 
EM= 1 or TS = 1 (exception 7 if either are 1). 
9. If opcode for Floating Point Unit (FPU), check 
FPU error status (exception 16 if error status 
is asserted). 
10. Check in the following order for each meme- 
ry reference required by the instruction: 
a. Check for Segmentation Faults that pre- 
vent transferring the entire memory quan- 
tity (exceptions 11, 12, 13). 


b. Check for Page Faults that prevent trans- 
ferring the entire memory quantity (ex- 
ception 14). 


NOTE: 
The order stated supports the concept of the 
paging mechanism being "underneath" the seg- 
mentation mechanism. Therefore, for any given 
code or data reference in memory, segmenta- 
tion exceptions are generated.before paging ex- 
ceptions are generated. 


The Intel486 Microprocessor fully supports restart- 
ing all instructions after faults. If an exception is de- 
tected in the instruction to be executed (exception 
categories 4 through 10 in Table 2.17b), the Intel486 
Microprocessor invokes the appropriate exception 
service routine. The Intel486 Microprocessor is in a 
state that permits restart of the instruction, for all 
cases but those in Table 2.17c. Note that all such 
cases are easily avoided by proper design of the 
operating system. 


Table 2.17c. Conditions 
Preventing 
Instruction 
Restart 


An instruction causes a task switch to a task 
whose Task State Segment is partially 
"not 
present". (An entirely "not present" TSS is re- 
startable.) Partially present TSS's can be avoid- 
ed either by keeping the TSS's of such tasks 
present in memory, or by aligning TSS segments 
to reside entirely within a single 4K page (for TSS 
segments of 4 Kbytes or less). 


NOTE: 
These conditions are avoided by using the oper- 
ating system designs mentioned in this table. 


A Double Fault (exception 8) results when the proc- 
essor attempts to invoke an exception service rou- 
tine for the segment exceptions (10, 11, 12 or 13), 
but in the process of doing so, detects an exception 
other than a Page Fault (exception 14). 


A Double Fault (exception 8) will also be generated 
when the processor attempts to invoke the Page 
Fault (exception 14) service routine, and detects an 
exception other than a second Page Fault. In any 
functional system, the entire Page Fault service rou- 
tine must remain "present" in memory. 


When a Double Fault occurs, the Intel486 Micro- 
processor invokes the exception service routine for 
exception 8. 


Several interrupt vectors of the Intel486 Microproc- 
essor are used to report exceptional conditions 
while executing numeric programs in either real or 
protected mode. Table 2.18 shows these interrupts 
and their causes. 
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Interrupt 
Cause of Interrupt 
Number 


7 
A Floating 
Point instruction 
was encountered 
when EM or TS of the Intel486™ 
Processor 
control 
register zero (CRO) was set. EM = 1 indicates 
that software 
emulation 
of the 
instruction 
is required. When TS is set, either a Floating 
Point or WAIT instruction 
causes 
interrupt 
7. This indicates 
that the current FPU context 
may not belong to the current task. 


13 
The first word or doubleword 
of a numeric operand 
is not entirely within the limit of its 
segment. 
The return address pushed onto the stack of the exception 
handler points at the 
Floating 
Point instruction 
that caused the exception, 
including 
any prefixes. The FPU has 
not executed 
this instruction; 
the instruction 
pointer and data pointer register refer to a 
previous, 
correctly 
executed 
instruction. 


16 
The previous 
numerics 
instruction 
caused an unmasked 
exception. 
The address of the 
faulty instruction 
and the address of its operand 
are stored in the instruction 
pointer and 
data pointer registers. 
Only Floating 
Point and WAIT instructions 
can cause this interrupt. 


The Intel486TM Processor 
return address 
pushed onto the stack of the exception 
handler 
points to a WAIT or Floating 
Point instruction 
(including 
prefixes). This instruction 
can be 
restarted 
after clearing the exception 
condition 
in the FPU. The FNINIT, FNCLEX, 


FNSTSW, 
FNSTENV, 
and FNSAVE instructions 
cannot cause this interrupt. • 


When the processor is reset or powered up it is ini- 
tialized in Real Mode. Real Mode has the same base 
architecture as the 8086, but allows access to the 
32·bit register set of the Intel486 Microprocessor. 
The addressing mechanism, memory size, interrupt 
handling, are all identical to the Real Mode on the 
80286. 


All of the Intel486 Microprocessor instructions are 
available in Real Mode (except those instructions 
listed in Section 4.6.4). The default operand size in 
Real Mode is 16 bits, just like the 8086. In order to 
use the 32-bit registers and addressing modes, over- 
ride prefixes must be used. In addition, the segment 
size on the Intel486 Microprocessor in Real Mode is 
64 Kbytes so 32-bit effective addresses must have a 
value less the OOOOFFFFH.The primary purpose of 
Real Mode is to set up the processor for Protected 
Mode Operation. 


The LOCK prefix on the Intel486 Microprocessor, 
even in Real.Mode, is more restrictive than on the 
80286. This is due to the addition of paging on the 
Intel486 Microprocessor in Protected Mode and Vir- 
tual 8086 Mode. Paging makes it impossible to guar- 
antee 
that 
repeated string 
instructions 
can 
be 
LOCKed. The Intel486 Microprocessor can't require 
that all pages holding the string be physically pres- 
ent in memory. Hence, a Page Fault (exception 14) 
might have to be taken during the repeated string 
instruction. Therefore the LOCK prefix can't be sup- 
ported during repeated string instructions. 


intel~ 


These are the only instruction forms where the 
LOCK prefix is legal on the Intel486 Microprocessor: 


Opcode 
Operands 
(Dest, Source) 


BITTestand 
Mem,Reg/immed 
SET/RESET/COMPLEMENT 
XCHG 
Reg,Mem 
XCHG 
Mem,Reg 
ADD,OR,ADC,SBB, 
Mem,Reg/immed 
AND,SUB,XOR 
NOT,NEG,INC,DEC 
Mem 
CMPXCHG,XADD 
Mem,Reg 


An exception 6 will be generated if a LOCK prefix is 
placed before any instruction form or opcode not 
listed above. The LOCK prefix allows indivisible 
read/modify/write 
operations on memory operands 
using the instructions above. For example, even the 
ADD Reg, Mem is not LOCKable, because the Mem 
operand is not the destination (and therefore no 
memory read/modify/operation is being performed). 


Since, on the Intel486 Microprocessor, repeated 
string instructions are not LOCKable, it is not possi- 
ble to LOCK the bus for a long period of time. There· 
fore, the LOCK prefix is not IOPL-sensitive on the 
Intel486 Microprocessor. The LOCK prefix can be 
used at any privilege level, but only on the instruc- 
tion forms listed above. 


3.2 
Memory Addressing 


In Real Mode the maximum memory size is limited to 
1 megabyte. Thus, only address lines A2-A19 
are 
active. (Exception, after RESET address lines A20- 
A31 are high during CS-relative memory cycles until 
an intersegment jump or call is executed (see Sec- 
tion 6.5». 


MAX LIMIT 
FIXED AT 64K 
IN 
REAL MODE 


Since paging is not allowed in Real Mode the linear 
addresses are the same as physical addresses. 
Physical addresses are formed in Real Mode by 
adding the contents of the appropriate segment reg- 
ister which is shifted left by four bits to an effective 
address. This addition results in a physical address 
from OOOOOOOOH 
to 0010FFEFH. This is compatible 
with 80286 Real Mode. Since segment registers are 
shifted left by 4 bits, Real Mode segments always 
start on 16 byte boundaries. 


All segments in Real Mode are exactly 64 Kbytes 
long, and may be read, written, or executed. The 
Intel486 Microprocessor will generate an exception 
13 if a data operand or instruction fetch occurs past 
the end of a segment (Le., if an operand has an 
offset greater than FFFFH, for example a word with 
a low byte at FFFFH and the high byte at OOOOH). 


Segments may be overlapped in Real Mode. Thus, if 
a particular segment does not use all 64 Kbytes an- 
other segment can be overlayed on top of the un- 
used portion of the previous segment. This allows 
the programmer to minimize the amount of physical 
memory needed for a program. 


There are two fixed areas in memory which are re- 
served in Real address mode: system initialization 
area and the interrupt table area. Locations OOOOOH 
through 003FFH are reserved for interrupt vectors. 
Each one of the 256 possible interrupts has a 4-byte 
jump vector reserved for it. Locations FFFFFFFOH 
through FFFFFFFFH are reserved for system initiali- 
zation. 


Many of the exceptions shown in Table 2.16 and 
discussed in Section 2.7 are not applicable to Real 
Mode operation, in particular exceptions 10, 11, 14, 
17, will not happen in Real Mode. Other exceptions 
have slightly different meanings in Real Mode; Table 
3.1 identifies these exceptions. 


The HLT instruction stops program execution and 
prevents the processor from using the local bus until 
restarted. Either NMI, INTR with interrupts enabled 
(IF= 1), or RESET will force the Intel486 Microproc- 
essor out of halt. If interrupted, the saved CS:IP will 
point to the next instruction after the HLT. 


As in the case in protected mode, the shutdown will 
occur when a severe error is detected that prevents 
further processing. In Real Mode, shutdown can oc- 
cur under two conditions: 


An interrupt or an exception occur (exceptions 8 or 
13) and the interrupt vector is larger than the Inter- 
rupt Descriptor Table (Le., there is not an interrupt 
handler for the interrupt). 


A CALL, INT or PUSH instruction attempts to wrap 
around the stack segment when SP is not even (Le., 
pushing a value on the stack when SP = 0001 re- 
sulting in a stack segment greater than FFFFH). 


An NMI input can bring the processor out of shut- 
down if the Interrupt Descriptor Table limit is large 
enough to contain the NMI interrupt vector (at least 
0017H) and the stack has enough room to contain 
the vector and flag information (Le., SP is greater 
than 0005H). If these conditions are not met, the 
Intel486 CPU is unable to execute the NMI and exe- 
cutes another shutdown cycle. In this case, the proc- 
essor remains in the shutdown and can only exit via 
the RESET input. 


Table 3.1. Exceptions with Different Meanings in Real Mode (see Table 2.16) 
I 
Function 
Interrupt 
Related 
Return 
Number 
Instructions 
Address Location 


Interrupt table limit too small 
8 
INT Vector is not 
Before 
within table limit 
Instruction 


CS, OS,ES, FS, GS 
13 
Word memory reference 
Before 
Segment overrun exception 
beyond offset = FFFFH. 
Instruction 
An attempt to execute 
past the end of CS segment. 


SS Segment overrun exception 
12 
Stack Reference 
Before 
beyond offset = FFFFH 
Instruction 


• 


4.0 
PROTECTED 
MODE 
ARCHITECTURE 


The complete capabilities of the Intel486 Microproc- 
essor are unlocked when the processor operates in 
Protected Virtual Address Mode (Protected Mode). 
Protected Mode vastly increases the linear address 
space to four gigabytes (232 bytes) and allows the 
running of virtual memory programs of almost unlim- 
ited size (64 terabytes or 246 bytes). In addition Pro- 
tected Mode allows the Intel486 Microprocessor to 
run all of the existing 8086, 80286 and 386 micro- 
processor software, while providing a sophisticated 
memory management and a hardware-assisted pro- 
tection mechanism. Protected Mode allows the use 
of additional instructions especially optimized for 
supporting multitasking operating systems. The base 
architecture of the Intel486 Microprocessor remains 
the same, the registers, instructions, and addressing 
modes described in the previous sections are re- 
tained. The 
main difference 
between 
Protected 


Mode, and Real Mode from a programmer's view is 
the increased address space, and a different ad- 
dressing mechanism. 


Like Real Mode, Protected Mode uses two compo- 
nents to form the logical address, a 16-bit selector is 
used to determine the linear base address of a seg- 
ment, the base address is added to a 32-bit effective 
address to form a 32-bit linear address. The linear 
address is then either used as the 32-bit physical 
address, or if paging is enabled the paging mecha- 
nism maps the 32-bit linear address into a 32-bit 
physical address. 


The difference between the two modes lies in calcu- 
lating the base address. In Protected Mode the se- 
lector is used to specify an index into an operating 
system defined table (see Figure 4.1). The table 
contains the 32-bit base address of a given seg- 
ment. The physical address is formed by adding the 
base address obtained from the table to the offset. 


Paging provides an additional memory management 
mechanism which operates only in Protected Mode. 
Paging provides a means of managing the very large 
segments of the Intel486 Microprocessor. As such, 
paging operates beneath segmentation. The paging 
mechanism translates the protected linear address 
which comes from the segmentation unit into a 
physical address. Figure 4.2 shows the complete In- 
tel486 Microprocessor addressing mechanism with 
paging enabled. 
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Segmentation is one method of memory manage- 
ment. Segmentation provides the basis for protec- 
tion. Segments are used to encapsulate regions of 
memory which have common attributes. For exam- 
ple, all of the code of a given program could be con- 
tained in a segment, or an operating system table 
may reside in a segment. All information about a 
segment is stored in an 8 byte data structure called 
a descriptor. All of the descriptors in a system are 
contained in tables recognized by hardware. 


The following terms are used throughout the discus- 
sion of descriptors, privilege levels and protection: 


PL: Privilege Level-one 
of the four hierarchical 
privilege levels. Level 0 is the most privileged level 
and level 3 is the least privileged. More privileged 
levels are numerically smaller than less privileged 
levels. 


RPL: Requestor Privilege Level-The 
privilege level 
of the original supplier of the selector. RPL is deter- 
mined by the least two significant bits of a selector. 


DPL: 
Descriptor Privilege Level-This 
is the least 
privileged level at which a task may access that de- 
scriptor (and the segment associated with that de- 
scriptor). Descriptor Privilege Level is determined by 
bits 6:5 in the Access Right Byte of a descriptor. 


CPL: Current Privilege Level-The 
privilege level at 
which a task is currently executing, which equals the 
privilege level of the code segment being executed. 
CPL can also be determined by examining the low- 
est 2 bits of the CS register, except for conforming 
code segments. 


EPL: Effective Privilege Level-The 
effective privi- 
lege level is the least privileged of the RPL and DPL. 
Since smaller privilege level values indicate greater 
privilege, EPL is the numerical maximum of RPL and 
DPL. 


Task: One instance of the execution of a program. 
Tasks are also referred to as processes. 


4.3.3.1 
Descriptor 
Tables 
Introduction 


The descriptor tables define all of the segments 
which are used in an Intel486 Microprocessor sys- 
tem. There are three types of tables on the Intel486 


Microprocessor which hold descriptors: the Global 
Descriptor Table, Local Descriptor Table, and the In- 
terrupt Descriptor Table. All of the tables are vari- 
able length memory arrays. They can range in size 
between 8 bytes and 64 Kbytes. Each table can hold 
up to 8192 8-byte descriptors. The upper 13 bits of a 
selector are used as an index into the descriptor ta- 
ble. The tables have registers associated with them 
which hold the 32-bit linear base address, and the 
16-bit limit of each table. 


Each of the tables has a register associated with it, 
the GDTR, LDTR, and the IDTR (see Figure 4.3). 
The LGDT, LLDT, and L1DT instructions, load the 
base and limit of the Global, Local, and Interrupt De- 
scriptor Tables, respectively, into the appropriate 
register. The SGDT, SLDT, and SIDT store the base 
and limit values. These tables are manipulated by 
the operating system. Therefore, the load descriptor 
table instructions are privileged instructions. 
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4.3.3.2 
Global Descriptor Table 


The Global Descriptor Table (GDT) contains de- 
scriptors which are possibly available to all of the 
tasks in a system. The GDT can contain any type of 
segment descriptor except for descriptors which are 
used for servicing interrupts (Le., interrupt and trap 
descriptors). Every Intel486 Microprocessor system 
contains a GDT. Generally the GDT contains code 
and data segments used by the operating systems 
and task state segments, and descriptors for the 
LDTs in a system. 


The first slot of the Global Descriptor Table corre- 
sponds to the null selector and is not used. The null 
selector defines a null pointer value. 


4.3.3.3 
Local Descriptor Table 


LDTs contain descriptors which are associated with 
a given task. Generally, operating systems are de- 
signed so that each task has a separate LDT. The 
LDT may contain only code, data, stack, task gate, 
and call gate descriptors. LDTs provide a mecha- 
nism for isolating a given task's code and data seg- 
ments from the rest of the operating system, while 
the GDT contains descriptors for segments which 
are common to all tasks. A segment cannot be ac- 
cessed by a task if its segment descriptor does not 
exist in either the current LDT or the GDT. This pro- 
vides both isolation and protection for a task's seg- 
ments, while still allowing global data to be shared 
among tasks. 


Unlike the 6 byte GDT or IDT registers which contain 
a base address and limit, the visible portion of the 
LDT register contains only a 16-bit selector. This se- 
lector refers to a Local Descriptor Table descriptor in 
the GDT. 


4.3.3.4 
Interrupt Descriptor Table 


The third table needed for Intel486 Microprocessor 
systems is the Interrupt Descriptor Table. (See Fig- 
ure 4.4.) The IDT contains the descriptors which 
point to the location of up to 256 interrupt service 
routines. The IDT may contain only task gates, inter- 
rupt gates, and trap gates. The IDT should be at 
least 256 bytes in size in order to hold the descrip- 
tors for the 32 Intel Reserved Interrupts. Every inter- 
rupt used by a system must have an entry in the IDT. 
The IDT entries are referenced via INT instructions, 
external interrupt vectors, and exceptions. (See Sec- 
tion 2.7 Interrupts). 
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Figure 4.4. Interrupt Descriptor 
Table Register Use 
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4.3.4.1 Descriptor 
Attribute 
Bits 


The object to which the segment selector points to 
is called a descriptor. Descriptors are eight byte 
quantities which contain attributes about a given re- 
gion of linear address space (Le.,a segment). These 
attributes include the 32-bit base linear address of 
the segment, the 20-bit length and granularity of the 
segment, the protection level, read, write or execute 
privileges, the default size of the operands (16-bit or 
32-bit), and the type of segment. All of the attribute 
information about a segment is contained in 12 bits 
in the segment descriptor. Figure 4.5 shows the gen- 
eral format of a descriptor. All segments on the In- 
tel486 Microprocessor have three attribute fields in 
common: the P bit, the DPL bit, and the 8 bit. The 
Present P bit is 1 if the segment is loaded in physical 
memory, if P=0 then any attempt to access this 


segment causes a not present exception (exception 
11). The Descriptor Privilege Level DPL is a two-bit 
field which specifies the protection level 0-3 associ- 
ated with a segment. 


The Intel486 Microprocessor has two main catego- 
ries of segments: system segments and non-system 
segments (for code and data). The segment 8 bit in 
the segment descriptor determines if a given seg- 
ment is a system segment or a code or data seg- 
ment. If the S bit is 1 then the segment is either a 
code or data segment, if it is 0 then the segment is a 
system segment. 


4.3.4.2 Intel486TM CPU Code, Data Descriptors 
(8= 1) 


Figure 4.6 shows the general format of a code and 
data descriptor and Table 4.1 illustrates how the bits 
in the Access Rights Byte are interpreted. 


o 
BYTE 
ADDRESS 
o 


BASE 
23 ... 16 


BASE 
Base Address of the segment 
LIMIT 
The length of the segment 
P 
PresentBit 
1= Present 0= Not Present 
DPL 
DescriptorPrivilegeLevel 0-3 
5 
SegmentDescriptor 0= SystemDescriptor 1= Code or Data SegmentDescriptor 
TYPE 
Type of Segment 
A 
Accessed Bit 
G 
GranularityBit 
1= Segmentlength is page granular 
0= Segmentlength is byte granular 
D 
DefaultOperationSize (recognizedin code segmentdescriptorsonly) 
1= 32-bit segment 0= 16-bitsegment 
o 
Bit must be zero (0) for compatibilitywith future processors 
AVL 
Availablefield for user or as 


NOTE: 
In a maximum-sizesegment (i.e., a segment with G= 1 and segment limit 19...0= FFFFFH),the lowest 12 bits of the 
segment base should be zero (i.e.,segmentbase 11...000= OOOH). 


SEGMENT BASE 15 ... 0 
SEGMENT LIMIT 15 ... 0 


LIMIT 
ACCESS 
BASE 
BASE 31 ... 24 
G 
D 
0 
AVL 
19 ... 16 
RIGHTS 
23 ... 16 
BYTE 


D/B 
1= Default InstructionAttribu1esare 32-Bits 
0= Default InstructionAttributesare 16-Bits 
AVL 
Availablefield for user or as 
G 
GranularityBit 
1= Segmentlength is page granular 
0= Segmentlength is byte granular 
o 
Bit must be zero (0) for compatibilitywith future processors 
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Bit 
Name 
Function 
Position 


7 
Present (P) 
P = 1 
Segment 
is mapped 
into physical 
memory. 
P=O 
No mapping to physical 
memory exits, base and limit are 
not used. 
6-5 
Descriptor 
Privilege 
Segment 
privilege 
attribute 
used in privilege tests. 


Level (DPL) 
4 
Segment 
Descrip- 
S = 1 
Code or Data (includes 
stacks) 
segment 
descriptor. 
tor (S) 
S=O 
System Segment 
Descriptor 
or Gate Descriptor. 


3 
Executable 
(E) 
E - 0 
0'''';010' tyO';, d,ta ~gme"t 
} It 
2 
Expansion 
Direc- 
ED = 0 Expand up segment, 
offsets 
must be ::;;limit. 
Data 
tion (ED) 
ED = 1 Expand down segment, 
offsets 
must be > limit. 
Segment 
1 
Writeable 
(W) 
W = 0 
Data segment 
may not be written into. 
(S = 1, 
W = 
1 
Data segment 
may be written into. 
E = 0) 


3 
Executable 
(E) 
E = 
1 
Descriptor 
type is code segment: 
r 


2 
Conforming 
(C) 
C=1 
Code segment 
may only be executed 
Code 
when CPL 
:2: DPL and CPL 
Segment 
remains unchanged. 
(S = 1, 
1 
Readable 
(R) 
R=O 
Code segment 
may not be read. 
E = 
1) 
R = 1 
Code segment 
may be read. 


0 
Accessed 
(A) 
A=O 
Segment 
has not been accessed. 
A = 1 
Segment 
selector 
has been loaded into segment 
register 
or used by selector 
test instructions. 


Type 
Field 
Definition 


Code 
and 
data 
segments 
have 
several 
descriptor 
fields in common. 
The accessed 
A bit is set whenev- 
er the processor 
accesses 
a descriptor. 
The A bit is 
used by operating 
systems 
to keep usage statistics 
on a given 
segment. 
The 
G bit, or granularity 
bit, 
specifies 
if a segment 
length 
is byte-granular 
or 
page-granular. 
Intel486 
Microprocessor 
segments 
can 
be 
one 
megabyte 
long 
with 
byte 
granularity 
(G = 0) 
or 
four 
gigabytes 
with 
page 
granularity 
(G = 1), (Le., 220 pages 
each 
page 
is 4 Kbytes 
in 
length). The granularity 
is totally unrelated 
to paging. 
A Intel486 
Microprocessor 
system 
can 
consist 
of 
segments 
with byte granularity, 
and page granularity, 
whether 
or not paging is enabled. 


The executable 
E bit tells if a segment 
is a code or 
data segment. 
A code segment 
(E = 1, S = 1) may be 
execute-only 
or execute/read 
as determined 
by the 
Read 
R bit. 
Code 
segments 
are 
execute 
only 
if 
R=O, 
and 
execute/read 
if R=1. 
Code 
segments 
may never be written 
into. 


NOTE: 
Code segments 
may be modified 
via aliases. 
Alias- 
es are writeable 
data 
segments 
which 
occupy 
the 
same 
range 
of linear 
address 
space 
as the 
code 
segment. 


The D bit indicates 
the default 
length 
for operands 
and effective 
addresses. 
If D = 1 then 
32-bit 
oper- 
ands and 32-bit 
addressing 
modes 
are assumed. 
If 
D = 0 then 
16·bit 
operands 
and 
16-bit 
addressing 
modes 
are assumed. 
Therefore 
all existing 
80286 
code 
segments 
will execute 
on the Intel486 
Micro- 
processor 
assuming 
the D bit is set O. 


Another 
attribute 
of code segments 
is determined 
by 
the conforming 
C bit. Conforming 
segments, 
C = 1, 
can be executed 
and shared 
by programs 
at differ- 
ent privilege 
levels. (See Section 
4.4 Protection.) 


Segments 
identified 
as data segments 
(E = 0, S = 1) 
are used for two types 
of Intel486 
Microprocessor 
segments: 
stack and data segments. 
The expansion 
direction 
(ED) 
bit specifies 
if a segment 
expands 
downward 
(stack) or upward (data). If a segment 
is a 
stack 
segment 
all offsets 
must be greater 
than the 
segment 
limit. On a data segment 
all offsets 
must be 
less than or equal to the limit. In other words, 
stack 
segments 
start at the base 
linear address 
plus the 
maximum 
segment 
limit and grow down to the base 
linear address 
plus the limit. On the other hand, data 
segments 
start 
at the base 
linear 
address 
and ex- 
pand to the base linear address 
plus limit. 
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The write W bit controls the ability to write into a 
segment. Data segments are read-only if W=O. The 
stack segment must have W= 1. 


The B bit controls the size of the stack pointer regis- 
ter. If B= 1, then PUSHes, POPs, and CALLs all use 
the 32-bit ESP register for stack references and as- 
sume an upper limit of FFFFFFFFH. If B= 0, stack 
instructions all use the 16-bit SP register and as- 
sume an upper limit of FFFFH. 


4.3.4.3 System Descriptor 
Formats 


System segments describe information about oper- 
ating system tables, tasks, and gates. Figure 4.7 
shows the general format of system segment de- 
scriptors, and the various types of system segments. 
Intel486 Microprocessor system descriptors contain 
a 32-bit base linear address and a 20-bit segment 
limit. 80286 system descriptors have a 24-bit base 
address and a 16-bit segment limit. 80286 system 
descriptors are identified by the upper 16 bits being 
all zero. 


4.3.4.4 LOT Descriptors 
(S= 0, TYPE= 2) 


LOT descriptors (S= 0, TYPE= 2) contain informa- 
tion about Local Descriptor Tables. LOTs contain a 
table of segment descriptors, unique to a particular 
task. Since the instruction to load the LDTR is only 
available at privilege level 0, the DPL field is ignored. 
LOT descriptors are only allowed in the Global De- 
scriptor Table (GOT). 
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SEGMENT BASE 15 ... 0 


LIMIT 
19 ... 16 


Type 
o 
1 
2 
3 
4 
5 
6 
7 


Dellnea 


Invalid 
Available 
60286 
TSS 
LOT 
Busy 60286 
TSS 
80286 
Call Gate 
Task 
Gate 
(for 80286 
or Intel486TM 
CPU Task) 
80288 
Interrupt 
Gate 
80286 
Trap 
Gate 


4.3.4.5 TSS Descriptors 
(S=0, 
TYPE= 1,3,9, 
B) 


A Task State Segment (TSS) descriptor contains in- 
formation about the location, size, and privilege level 
of a Task State Segment (TSS). A TSS in turn is a 
special fixed format segment which contains all the 
state information for a task and a linkage field to 
permit nesting tasks. The TYPE field is used to indi- 
cate whether the task is currently BUSY (Le., on a 
chain of active tasks) or the TSS is available. The 
TYPE field also indicates if the segment contains a 
80286 or an Intel486 Microprocessor TSS. The Task 
Register (TR) contains the selector which points to 
the current Task State Segment. 


4.3.4.6 Gate Descriptors 
(S=0, 
TYPE= 4-7, C, F) 


Gates are used to control access to entry points 
within the target code segment. The various types of 
gate descriptors are call gates, task gates, Inter- 
rupt gates, and trap gates. Gates provide a level of 
indirection between the source and destination of 
the control transfer. This indirection allows the proc- 
essor to automatically perform protection checks. It 
also allows system designers to control entry points 
to the operating system. Call gates are used to 
change privilege levels (see Section 4.4 Protec- 
tion), task gates are used to perform a task switch, 
and interrupt and trap gates are used to specify in- 
terrupt service routines. 


BASE 
23 ... 
16 


Type 
8 
9 
A 
B 
C 
o 
E 
F 


Dellnea 


Invalid 
Available 
Intel488™ 
CPU T55 
Undefined 
(Intel 
Reserved) 
Busy 
Intel486TM 
CPU TSS 
Intel486TM 
CPU Call Gate 
Undefined 
(Intel 
Reserved) 
Intel486™ 
CPU Interrupt 
Gate 
Intel486™ 
CPU Trep 
Gate 
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Figure 4.8 shows the format of the four types of gate 
descriptors. Call gates are primarily used to transfer 
program control to a more privileged level. The call 
gate descriptor consists of three fields: the access 
byte, a long pointer (selector and offset) which 
points to the start of a routine and a word count 
which specifies how many parameters are to be cop- 
ied from the caller's stack to the stack of the called 
routine. The word count field is only used by call 
gates when there is a change in the privilege level, 
other types of gates ignore the word count field. 


Interrupt and trap gates use the destination selector 
and destination offset fields of the gate descriptor as 
a pointer to the start of the interrupt or trap handler 
routines. The difference between'interrupt gates and 
trap gates is that the interrupt gate disables inter- 
rupts (resets the IF bit) while the trap gate does not. 


Task gates are used to switch tasks. Task gates 
may only refer to a task state segment (see Section 
4.4.6 Task Switching) therefore only the destination 
selector portion of a task gate descriptor is used, 
and the destination offset is ignored. 


Exception 13 is generated when a destination selec- 
tor does not refer to a correct descriptor type, Le., a 
code segment for an interrupt, trap or call gate, a 
TSS for a task gate. 
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The access byte format is the same for all gate de- 
scriptors. P= 1 indicates that the gate contents are 
valid. P=0 indicates the contents are not valid and 
causes exception 11 if referenced. DPL is the de- 
scriptor privilege level and specifies when this de- 
scriptor may be used by a task (see Section 4.4 Pro- 
tection). The S field, bit 4 of the access rights byte, 
must be 0 to indicate a system control descriptor. 
The type field specifies the descriptor type as indi- 
cated in Figure 4.8. 


4.3.4.7 Differences 
Between Intel486TM 
Microprocessor 
and 80286 Descriptors 


In order to provide operating system compatibility 
between the 80286 and Intel486 Microprocessor, 
the Intel486 Microprocessor supports all of the 
80286 segment descriptors. Figure 4.9 shows the 
general format of an 80286 system segment de- 
scriptor. The only differences between 80286 and 
Intel486 Microprocessor descriptor formats are that 
the values of the type fields, and the limit and base 
address fields have been expanded for the Intel486 
Microprocessor. The 80286 system segment de- 
scriptors contained a 24·bit base address and 16-bit 
limit, while the Intel486 Microprocessor system seg- 
ment descriptors have a 32-bit base address, a 20- 
bit limit field, and a granularity bit. 


SELECTOR 
OFFSET 15 ... 0 


WORD 
OFFSET 31 ... 16 
P 
DPL 
0 
TYPE 
0 
0 
0 
COUNT 


I 
4 ... 0 


Value 
4 
5 
6 
7 
C 
E 
F 
o 
1 


Description 
80286 
call gate 
Task 
gate 
(lor 80286 
or Inte1486T'" 
CPU task) 
80286 
interrupt 
gate 
80286 
trap gate 
Inte1486"" 
CPU call gate 
Inte1486"" 
CPU interrupt 
gate 
Inte1486T'" 
CPU trap gate 
Descriptor 
contents 
are not valid 
Descriptor 
contents 
are valid 


DPL--Ieast 
privileged 
level at which 
a task may access 
the gate. WORD 
COUNT 
0-31-the 
number 
01 parameters 
to copy from caller's 
stack 
to the called 
procedure's 
stack. 
The 
parameters 
are 32-bit 
quantities 
lor Inte1486"" 
CPU gates. 
and 
16-bit 
quant~ies 
lor 80286 
gates. 


DESTINATION 
16-bit 
Selector 
to the target 
code 
segment 
SELECTOR 
selector 
or 
Selector 
to the target 
task 
state 
segment 
lor task 
gate 


offset 
Entry 
point 
within 
the target 
code 
segment 
16-bit 
80286 
32-bi\ 
Intel486"" 
CPU 
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By supporting 80286 system segments the Intel486 
Microprocessor is able to execute 80286 application 
programs on an Intel486 Microprocessor operating 
system. This is possible because the processor au- 
tomatically 
understands 
which 
descriptors 
are 
80286-style descriptors and which descriptors are 
Intel486 Microprocessor-style descriptors. In partic- 
ular, if the upper word of a descriptor is zero, then 
that descriptor is a 80286-style descriptor. 


The only other differences between 80286-style de- 
scriptors and Intel486 Microprocessor descriptors is 
the interpretation of the word count field of call gates 
and the B bit. The word count field specifies the 
number of 16-bit quantities to copy for 80286 call 
gates and 32-bit quantities for Intel486 Microproces- 
sor call gates. The B bit controls the size of PUSHes 
when using a call gate; if B= 0 PUSHes are 16 bits, 
if B= 1 PUSHes are 32 bits. 


4.3.4.8 selector Fields 


A selector in Protected Mode has three fields: Local 
or Global Descriptor Table Indicator (TI), Descriptor 


31 


SEGMENT BASE 15 ... 0 


Intel Reserved 
Setto 0 


BASE 
LIMIT 
P 


Base Address of the segment 
The length of the segment 
Present B~ 
1= Present 
0= Not Present 


Entry Index (Index), and Requestor (the selector's) 
Privilege Level (RPL) as shown in Figure 4.10. The 
TI bits select one of two memory-based tables of 
descriptors (the Global Descriptor Table or the Local 
Descriptor Table). The Index selects one of 8K de- 
scriptors in the appropriate descriptor table. The 
RPL bits allow high speed testing of the selector's 
privilege attributes. 


4.3.4.9 Segment Descriptor Cache 


In addition to the selector value, every segment reg- 
ister has a segment descriptor cache register asso- 
ciated with it. Whenever a segment register's con- 
tents are changed, the 8-byte descriptor associated 
with that selector is automatically loaded (cached) 
on the chip. Once loaded, all references to that seg- • 
ment use the cached descriptor information instead 
of reaccessing the descriptor. The contents of the 
descriptor cache are not visible to the programmer. 
Since descriptor caches only change when a seg- 
ment register is changed, programs which modify 
the descriptor tables must reload the appropriate 
segment registers after changing a descriptor's val- 
ue. 


o 


BASE 
+4 
23 ... 16 


Descriptor Privilege Level 0-3 
System Descriptor 
0 =System 
1= User 
Type of Segment 


TABLE 
INDICATOR 


TI=1 


LOCAL 
DESCRIPTOR 
TABLE 


N 


DESCRIPTOR 
NUMBER 


6 


5 


4 


3 


2 


0 
NULL 


GLOBAL 
DESCRIPTOR 
TABLE 


6 


5 


4 
3p¢$~IM~ 


2 


4.3.4.10 
Segment 
Descriptor 
Register 
Settings 


The contents 
of the segment 
descriptor 
cache vary 
depending 
on the mode the Intel486 
Microprocessor 
is operating 
in. When 
operating 
in Real 
Address 
Mode, the segment 
base, limit, and other attributes 
within 
the segment 
cache 
registers 
are defined 
as 
shown in Figure 4.11. For compatibility 
with the 8086 


architecture, 
the base is set to sixteen times the cur- 
rent selector 
value, the limit is fixed at OOOOFFFFH, 
and the attributes 
are fixed so as to indicate 
the seg- 
ment 
is present 
and fully usable. 
In Real Address 
Mode, the internal 
"privilege 
level" 
is always fixed to 
the highest level, level 0, so I/O and other privileged 
opcodes 
may be executed. 


32 - BIT L1toliT 


(FIXED) 


OTHER 
ATTRIBUTES 


(FIXED) 


32- 
BIT BASE 


(UPDATED 
DURING 
SELECTOR 
LOAD 
INTO 
SEGtolENT REGISTER) 


CONFORtollNG 
PRIVILEGE 
---------------------., 


STACK 
SIZE 
-------------------, 
EXECUTABLE 
------------------., 
WRITEABLE 
--------------'----., 
READABLE 
-----------------, 


EXPANSION 
DIRECTION ----------------1 
GRANULARITY 
1 
ACCESSED 
1 


~:!~~~~E 
_L~E~ 
~A-~-~-_-_-_-_-_-_-_-_-_-_-_-~-I~-I!-_-_-_-t-J 
__ 
_ 
__ 


CS 
16X 
CURRENT 
CS SELECTOR" 
OOOOFFFFH 
Y 
0 
Y 
B 
U 
Y 
Y 
Y 
- 
N 


SS 
16X 
CURRENT 
SS SELECTOR 
OOOOFFFFH 
Y 
0 
Y 
B 
U 
Y 
Y 
N 
W 
- 


OS 
16X 
CURRENT 
OS SELECTOR 
OOOOFFFFH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 
ES 
16X 
CURRENT 
ES SELECTOR 
OOOOFFFFH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 
FS 
16X 
CURRENT 
FS SELECTOR 
OOOOFFFFH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 
GS 
16X 
CURRENT 
GS SELECTOR 
OOOOFFFFH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


'Except 
the 
32-biI 
CS 
base 
is initialized 
to 
FFFFFOOOH 
alter 
reset 
until 
first 
intersegment 
control 
transfer 
(i.e., 
intersegment 
CALL, 
or 
intersegment 
JMP, or INT). (See 
Figure 
4.13 
Example.) 
Key: 
Y 
= yes 
N 
= no 
o 
~ 
privilege 
level 
0 
1 
= privilege 
level 
1 
2 
= privilege 
level 
2 
3 
= privilege 
level 
3 
U 
= expand 
up 


o ~ expand 
down 
B 
= byte granularity 
P 
= page 
granularity 
W 
~ 
push/pop 
16-bit 
words 
F 
~ 
push/pop 
32-bit 
dwords 
- 
~ 
does 
not apply 
to that 
segment 
cache 
register 


Figure 4.11. Segment Descriptor Caches for Real Address Mode 
(Segment Limit and Attributes are Fixed) 


When operating in Protected Mode, the segment 
base, limit, and other attributes within the segment 
cache registers are defined as shown in Figure 4.12. 
In Protected Mode, each of these fields are defined 


according to the contents of the segment descriptor 
indexed by the selector value loaded into the seg- 
ment register. 
• 
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SEGMENT 


32 - 
BIT BASE 


(UPDATED 
DURING 
SELECTOR 
LOAD 
INTO 
SEGMENT 
REGISTER) 


CONfORMING 
PRIVILEGE 
---------------------.., 


STACK 
SIZE 
-------------------, 
EXECUTABLE 
------------------, 
WRITEABLE 
-----------------.., 
READABLE 
-----------------, 


EXPANSION 
DIRECTION ---------------1 
GRANULARITY 
1 
ACCESSED 
1 


~:!~~~~E_L~E~ 
~~s~_-_-_-_-_-_-_-_-_-_-_-_-L1-~-I!-_-_-_-_-_-_-~-J 
_ 


DESCRIPTOR 
CACHE 
REGISTER 
CONTENTS 


32 - 
BIT LIMIT 
OTHER 
ATIRIBUTES 


(UPDATED 
DURING 
(UPDATED 
DURING 
SELECTOR 
LOAD 
INTO 
SELECTOR 
LOAD 
INTO 
SEGMENT 
REGISTER) 
SEGMENT 
REGISTER) 


CS 
BASE 
PER SEG DESCR 
LIMIT 
PER SEG DESCR 
P 
d 
d 
d 
d 
d 
N 
y 
- 
d 


SS 
BASE 
PER SEG DESCR 
LIMIT 
PER SEG DESCR 
P 
d 
d 
d 
d 
r 
w 
N 
d - 


OS 
BASE 
PER SEG DESCR 
LIMIT 
PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


ES 
BASE 
PER SEG DESCR 
LIMIT 
PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


fS 
BASE 
PER SEG DESCR 
LIMIT 
PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


GS 
BASE 
PER SEG DESCR 
LIMIT 
PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


Key: 
Y = fixed 
yes 
N = fixed 
no 
d = per segment 
descriptor 
p ~ 
per segment 
descriptor; 
descriptor 
must 
indicate 
"present" 
to avoid 
exception 
11 
(exception 
12 in case 
of 85) 
r = per segment 
descriptor. 
but descriptor 
must 
indicate 
"readable" 
to avoid 
exception 
13 
(special 
case 
for 85) 
w ~ 
per segment 
descriptor, 
but descriptor 
must 
indicate 
"writable" 
to avoid 
exception 
13 
(special 
case 
for SS) 
- = does 
not apply 
to that 
segment 
cache 
register 


When operating in a Virtual 8086 Mode within the 
Protected Mode, the segment base, limit, and other 
attributes within the segment cache registers are de- 
fined as shown in Figure 4.13. For compatibility with 
the 8086 architecture, the base is set to sixteen 
times the current selector value, the limit is fixed at 


OOOOFFFFH,and the attributes are fixed so as to 
indicate the segment is present and fully usable. The 
virtual program executes at lowest privilege level, 
level 3, to allow trapping of all IOPL-sensitive in- 
structions and level-O-onlyinstructions. 
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OTHER ATIRIBUTES 


(FIXED) 


32- BIT BASE 


(UPDATED 
DURING SELECTOR 
LOAD INTO SEGMENT REGISTER) 


CONfORMING 
PRIVILEGE --------------------., 
STACK SIZE 
------ 
,-------------., 
EXECUTABLE 
WRITEABLE 
READABLE 


EXPANSION 
DIRECTION 
1 
GRANULARITY 
1 
ACCESSED 
1 


~:!~~~~E_L~E~ 
B~-:-E_-_-_-_-_-_-_-_-_-_-_-_-~-I~-I-!-_-_-_-~-J 
_ _ 
_ 
__ 
_ _ 
_ 


CS 
16X CURRENT CS SELECTOR 
OOOOffffH 
Y 
3 
Y 
B 
U 
Y 
Y 
Y 
- 
N 


SS 
16X CURRENT SS SELECTOR 
OOOOffffH 
Y 
3 
Y 
B 
U 
Y 
Y 
N 
W 
- 
OS 
16X CURRENT OS SELECTOR 
OOOOffffH 
Y 
3 
Y 
B 
U 
Y 
Y 
N - - 
ES 
16X CURRENT ES SELECTOR 
OOOOffffH 
Y 
3 
Y 
B 
U 
Y 
Y 
N - - 
fS 
16X CURRENT fS 
SELECTOR 
OOOOffffH 
Y 
3 
Y 
B 
U 
Y 
Y 
N - - 
GS 
16X CURRENT GS SELECTOR 
OOOOffffH 
Y 
3 
Y 
B 
U 
Y 
Y 
N - - 


Key: 
Y 
= yes 
N 
= no 
o 
= privilege level 0 
1 
= privilege level 1 
2 
= privilege level 2 
3 
= privilege level 3 
U 
= expand up 


o 
= expand down 
8 
= byte granularity 
P 
= page granularity 
W 
= push/pop 16-bil words 
F 
= push/pop 32-bil dwords 
- 
= does not apply to that segment cache register 


Figure 4.13. Segment 
Descriptor 
Caches for Virtual 8086 Mode within Protected 
Mode 
(Segment 
Limit and Attributes 
are Fixed) 


HIGH SPEED 
OPERATtMG 
SYSTEM 
wrEA'ACE 


CPU 
ENFORCE.D 
SOFTWARE 
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The Intel486 Microprocessor has four levels of pro- 
tection which are optimized to support the need!?of 
a multi-tasking operating system to isolate and pro- 
tect user programs from each other and the operat- 
ing system. The privilege levels control the use of 
privileged instructions, I/O instructions, and access 
to segments and segment descriptors. Unlike tradi- 
tional 
microprocessor-based systems where this 
protection is achieved only through the use of com- 
plex external hardware and software the Intel486 Mi- 
croprocessor provides the protection as part of its 
integrated Memory Management Unit. The Intel486 
Microprocessor offers an additional type of protec- 
tion on a page basis, when paging is enabled (See 
Section 4.5.3 Page Level Protection). 


The four-level hierarchical privilege system is illus- 
trated in Figure 4-14, It is an extension of the user/ 
supervisor privilege mode commonly used by mini- 
computers and, in fact, the user/supervisor mode is 
fully supported by the Intel486 Microprocessor pag- 


• 


ing mechanism. The privilege levels (PL) are num- 
bered 0 through 3. Level 0 is the most privileged or 
trusted level. 


The Intel486 Microprocessor controls access to 
both data and procedures between levels of a task, 
according to the following rules. 
• Data stored in a segment with privilege level p 


can be accessed only by code executing at a 
privilege level at least as privileged as p. 


• A code segment/procedure with privilege level p 


can only be called by a task executing at the 
same or a lesser privilege level than p. 


4.4.3.1 Task Privilege 


At any point in time, a task on the Intel486 Micro- 
processor always executes at one of the four privi- 
lege levels. The Current Privilege Level (CPL) speci- 
fies the task's privilege level. A task's CPL may only 
be changed by control transfers through gate de- 
scriptors to a code segment with a different privilege 
level. (See Section 4.4.4 Privilege Level Transfers) 
Thus, an application program running at PL = 3 may 
call an operating system routine at PL = 1 (via a 
gate) which would cause the task's CPL to be set to 
1 until the operating system routine was finished. 


4.4.3.2 Selector Privilege (RPL) 


The privilege level of a selector is specified by the 
RPL field. The RPL is the two least significant bits of 
the selector. The selector's RPL is only used to es- 
tablish a less trusted privilege level than the current 
privilege level for the use of a segment. This level is 
called the task's effective privilege level (EPL). The 
EPL is defined as being the least privileged (Le. nu- 
merically larger) level of a task's CPL and a selec- 
tor's RPL. Thus, if selector's RPL = 0 then the CPL 
always specifies the privilege level for making an ac- 
cess using the selector. On the other hand if RPL = 
3 then a selector can only access segments at level 
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3 regardless of the task's CPL. The RPL is most 
commonly used to verify that pointers passed to an 
operating system procedure do not access data that 
is of higher privilege than the procedure that origi- 
nated the pointer. Since the originator of a selector 
can specify any RPL value, the Adjust RPL (ARPL) 
instruction is provided to force the RPL bits to the 
originator's CPL. 


4.4.3.3 110 Privilege and 1/0 Permission Bltmap 


The I/O privilege level (IOPL, a 2-bit field in the 
EFLAG register) defines the least privileged level at 
which I/O instructions can be unconditionally per- 
formed. I/O instructions can be unconditionally per- 
formed when CPL ~ 10PL.(The I/O instructions are 
IN, OUT, INS, OUTS, REP INS, and REP OUTS.) 
When CPL > 10PL,and the current task is associat- 
ed with a 286 TSS, attempted I/O instructions cause 
an exception 13 fault. When CPL > 10PL, and the 
current task is associated with an Intel486 Micro- 
processor TSS. the I/O Permission Bitmap (part of 
an Intel486 Microprocessor TSS) is consulted on 
whether I/O to the port is allowed, or an exception 
13 fault is to be generated instead. For diagrams of 
the I/O Permission Bitmap, refer to Figures 4.15a 
and 4.15b. For further information on how the I/O 
Permission Bitmap is used in Protected Mode or in 
Virtual 8086 Mode, refer to Section 4.6.4 Protection 
and I/O Permission Bitmap. 


The I/O privilege level (IOPL) also affects whether 
several other instructions can be executed or cause 
an exception 13 fault instead. These instructions are 
called "IOPL-sensitive" 
instructions and they are 


CLI and STI. (Note that the LOCK prefix is not 10PL- 
sensitive on the Intel486 Microprocessor.) 


The 10PL also aff.ectswhether the IF (interrupts en- 
able flag) bit can be changed by loading a value into 
the EFLAGS register. When CPL ~ 10PL, then the 
IF bit can be changed by loading a new value into 
the EFLAGS register. When CPL > 10PL,the IF bit 
cannot be changed by a new value POP'ed into (or 
otherwise loaded into) the EFLAGS register; the IF 
bit merely remains unchanged and no exception is 
generated. 


Instruction 
Operands 
Function 


ARPL 
Selector, 
Adjust Requested Privi- 
Register 
lege Level: adjusts the 
RPL of the selector to the 
numeric maximum of 
current selector RPL value 
and the RPL value in the 
register. Set zero flag if 
selector RPL was 
changed. 


VERR 
Selector 
VERify for Read: sets the 
zero flag if the segment 
referred to by the selector 
can be read. 


VERW 
Selector 
VERify for Write: sets the 
zero flag if the segment 
referred to by the selector 
can be written. 


LSL 
Register, 
Load Segment Limit: reads 
Selector 
the segment limit into the 
register if privilege rules 
and descriptor type allow. 
Set zero flag if successful. 


LAR 
Register, 
Load Access Rights: reads 
Selector 
the descriptor access 
rights byte into the register 
if privilege rules allow. Set 
zero flag if successful. 


4.4.3.4 Privilege Validation 


The Intel486 Microprocessor provides several in- 
structions to speed pointer testing and help maintain 
system integrity by verifying that the selector value 
refers to an appropriate segment. Table 4.2 summa- 
rizes the selector validation procedures available for 
the Intel486 Microprocessor. 


This pointer verification prevents the common prob- 
lem of an application at PL = 3 calling a operating 
systems routine at PL = 0 and passing the operat- 
ing system routine a "bad" pointer which corrupts a 
data structure belonging to the operating system. If 
the operating system routine uses the ARPL instruc- 


tion to ensure that the RPL of the selector has no 
greater privilege than that of the caller, then this 
problem can be avoided. 


4.4.3.5 Descriptor Access 


There are basically two types of segment accesses: 
those involving code segments such as control 
transfers, and those involving data accesses. Oeter- 
mining the ability of a task to access a segment in- 
volves the type of segment to be accessed, the in- 
struction used, the type of descriptor used and CPL, 
RPL, and OPL as described above. 


Any time an instruction loads data segment registers 
(OS, ES, FS, GS) the 
Intel486 Microprocessor 
makes protection validation checks. Selectors load- 2 
ed in the OS, ES, FS, GS registers must refer only to 
data segments or readable code segments. The 
data access rules are specified in Section 4.4.2 
Rules of Privilege. 
The only exception to those 
rules is readable conforming code segments which 
can be accessed at any privilege level. 


Finally the privilege validation checks are performed. 
The CPL is compared to the EPL and if the EPL is 
more privileged than the CPL an exception 13 (gen- 
eral protection fault) is generated. 


The rules regarding the stack segment are slightly 
different than those involving data segments. In- 
structions that load selectors into SS must refer to 
data segment descriptors for writeable data seg- 
ments. The OPL and RPL must equal the CPL. All 
other descriptor types or a privilege level violation 
will cause exception 13. A stack not present fault 
causes exception 12. Note that an exception 11 is 
used for a not-present code or data segment. 


Inter-segment control transfers occur when a selec- 
tor is loaded in the CS register. For a typical system 
most of these transfers are simply the result of a call 
or a jump to another routine. There are five types of 
control transfers which are summarized in Table 4.3. 
Many of these transfers result in a privilege level 
transfer. Changing privilege levels is done only via 
control transfers, by using gates, task switches, and 
interrupt or trap gates. 
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Control Transfer Types 
Operation Types 
Descriptor 
Descriptor 
Referenced 
Table 


Intersegment within the same privilege level 
JMP, CALL, RET, IREP 
Code Segment 
GOT/LOT 


Intersegment to the same or higher privilege level 
CALL 
Call Gate 
GOT/LOT 
Interrupt within task may change CPL 
Interrupt Instruction, 
Trap or 
lOT 
Exception, External 
Interrupt 
Interrupt 
Gate 


Intersegment to a lower privilege level 
RET,IREP 
Code Segment 
GOT/LOT 
(changes task CPL) 


CALL, JMP 
Task State 
GOT 
Segment 


Task Switch 
CALL, JMP 
Task Gate 
GOT/LOT 


IREP" 
Task Gate 
lOT 
Interrupt Instruction, 
Exception, External 
Interrupt 


·NT 
(Nested 
Task bit of flag register) 
= 0 
•• NT (Nested 
Task bit of flag register) 
= 1 


Control transfers can only occur if the operation 
which loaded the selector references the correct de- 
scriptor type. Any violation of these descriptor usage 
rules will cause an exception 13 (e.g. JMP through a 
call gate, or IRET from a normal subroutine call). 


In order to provide further system security, all control 
transfers are also subject to the privilege rules. 


The privilege rules require that: 
- 
Privilege level transitions can only occur via 
gates. 


- 
JMPs can be made to a non-conforming code 
segment with the same privilege or to a conform- 
ing code segment with greater or equal privilege. 
- 
CALLs can be made to a non-conforming code 
segment with the same privilege or via a gate to 
a more privileged level. 


- 
Interrupts handled within the task obey the same 
privilege rules as CALLs. 
- 
Conforming Code segments are accessible by 
privilege levels which are the same or less privi- 
leged than the conforming-code segment's OPL. 


- 
Both the requested privilege level (RPL) in the 
selector pointing to the gate and the task's CPL 
must be of equal or greater privilege than the 
gate's OPL. 


- 
The code segment selected in the gate must be 
the same or more privileged than the task's CPL. 


- 
Return instructions that do not switch tasks can 
only return control to a code segment with same 
or less privilege. 
- 
Task switches can be performed by a CALL, 
JMP, or INT which references either a task gate 
or task state segment who's OPL is less privi- 
leged or the same privilege as the old task's CPL. 


Any control transfer that changes CPL within a task 
causes a change of stacks as a result of the privi- 
lege level change. The initial values of SS:ESP for 
privilege levels 0, 1, and 2 are retained in the task 
state segment (see Section 4.4.6 Task Switching). 
During a JMP or CALL control transfer, the new 
stack pointer is loaded into the SS and ESP regis- 
ters and the previous stack pointer is pushed onto 
the new stack. 


When RETurning to the original privilege level, use 
of the lower-privileged stack is restored as part of 
the RET or IRET instruction operation. For subrou- 
tine calls that pass parameters on the stack and 
cross privilege levels, a fixed number of words (as 
specified in the gate's word count field) are copied 
from the previous stack to the current stack. The 
inter-segment RET instruction with a stack adjust- 
ment value will correctly restore the previous stack 
pointer upon return. 
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Figure 4.15b. sample 
1/0 Permission 
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address space, and a link to the previous task), 
loads a new execution state, performs protection 
checks, and commences execution in the new task, 
in about 10 microseconds. Like transfer of control 
via gates, the task switch operation is invoked by 
executing an inter-segment JMP or CALL instruction 
which refers to a Task State Segment (TSS), or a 
task gate descriptor in the GOT or LOT. An INT n 
instruction, exception, trap, or external interrupt may 
also invoke the task switch operation if there is a 
task gate descriptor in the associated lOT descriptor 
slot. 


Gates provide protected, indirect CALLs. One of the 
major uses of gates is to provide a secure method of 
privilege transfers within a task. Since the operating 
system defines all of the gates in a system, it can 
ensure that all gates only allow entry into a few trust- 
ed procedures (such as those which allocate memo- 
ry, or perform 110). 


Gate descriptors follow the data access rules of priv- 
ilege; that is, gates can be accessed by a task if the 
EPL, is equal to or more privileged than the gate 
descriptor's OPL. Gates follow the control transfer 
rules of privilege and therefore may only transfer 
control to a more privileged level. 


Call Gates are accessed via a CALL instruction and 
are syntactically identical to calling a normal subrou- 
tine. When an inter-level Intel486 Microprocessor 
call gate is activated, the following actions occur. 
1. Load CS:EIP from gate check for validity 
2. SS is pushed zero-extended to 32 bits 
3. ESP is pushed 
4. Copy Word Count 32-bit parameters from the 
old stack to the new stack 
5. Push Return address on stack 


The procedure is identical for 80286 Call gates, ex- 
cept that 16·bit parameters are copied and 16-bit 
registers are pushed. 


Interrupt Gates and Trap gates work in a similar 
fashion as the call gates, except there is no copying 
of parameters. The only difference between Trap 
and Interrupt gates is that control transfers through 
an Interrupt gate disable further interrupts (Le.the IF 
bit is set to 0), and Trap gates leave the interrupt 
status unchanged. 


A very important attribute of any multi-tasking/multi- 
user operating systems is its ability to rapidly switch 
between tasks or processes. The Intel486 Micro- 
processor directly supports this operation by provid- 
ing a task switch instruction in hardware. The In- 
tel486 Microprocessor task switch operation saves 
the entire state of the machine (all of the registers, 


The TSS descriptor points to a segment (see Figure 
4.15) containing the entire Intel486 Microprocessor 
execution state while a task gate descriptor contains 
a TSS selector. The Intel486 Microprocessor sup- 
ports both 80286 and Intel486 Microprocessor style 
TSSs. Figure 4.16 shows a 80286 TSS. The limit of 
an Intel486 Microprocessor TSS must be greater 
than 0064H (002BH for a 80286 TSS), and can be 
as large as 4 Gigabytes. In the additional TSS 
space, the operating system is free to store addition- 
al information such as the reason the task is inac- 
tive, time the task has spent running, and open files 
belong to the task. 


Each task must have a TSS associated with it. The 
current TSS is identified by a special register in the 
Intel486 Microprocessor called the Task State Seg- 
ment Register (TR). This register contains a selector 
referring to the task state segment descriptor that 
defines the current TSS. A hidden base and limit 
register associated with TR are loaded whenever TR 
is loaded with a new selector. Returning from a task 
is accomplished by the IRET instruction. When IRET 
is executed, control is returned to the task which 
was interrupted. The current executing task's state 
is saved in the TSS and the old task state is restored 
from its TSS. 


Several bits in the flag register and machine status 
word (CRO) give information about the state of a 
task which are useful to the operating system. The 
Nested Task (NT) (bit 14 in EFLAGS) controls the 
function of the IRET instruction. If NT = 0, the IRET 
instruction performs the regular return; when NT = 
1, IRET performs a task switch operation back to the 
previous task. The NT bit is set or reset in the follow- 
ing fashion: 
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When a CALL or INT instruction initiates a task 
switch, the new TSS will be marked busy and the 
back link field of the new TSS set to the old TSS 
selector. The NT bit of the new task is set by CALL 
or INT initiated task switches. An interrupt that does 
not cause a task switch will clear NT. (The NT bit will 
be restored after execution of the interrupt handler) 
NT may also be set or c1earerj by POPF or IRET 
instructions. 


The Intel486 Microprocessor task state segment is 
marked busy by changing the descriptor type field 
from TYPE 9H to TYPE BH. An 80286 TSS is 
marked busy by changing the descriptor type field 
from TYPE 1 to TYPE 3. Use of a selector that refer- 
ences a busy task state segment causes an excep- 
tion 13. 


The Virtual Mode (VM) bit 17 is used to indicate if a 
task, is a virtual 8086 task. If VM = 1, then the tasks 
will use the Real Mode addressing mechanism. The 
virtual 8086 environment is only entered and exited 
via a task switch (see Section 4.6 Virtual Mode). 


The FPU's state is not automatically saved when a 
task switch occurs, because the incoming task may 
not use the FPU. The Task Switched (TS) Bit (bit 3 in 
the CRO)helps deal with the FPU's state in a multi- 
tasking environment. Whenever the Intel486 Micro- 


processor switches tasks, it sets the TS bit. The In- 
tel486 Microprocessor detects the first use of a 
processor extension instruction after a task switch 
and causes the processor extension not available 
exception 7. The exception handler for exception 7 
may then decide whether to save the state of the 
FPU. A processor extension not present exception 
(7) will occur when attempting to execute a Floating 
Point or WAIT instruction if the Task Switched and 
Monitor coprocessor extension bits are both set (Le. 
TS = 1 and MP = 1). 


The T bit in the Intel486 Microprocessor TSS indi- 
cates that the processor should generate a debug 
exception when switching to a task. If T = 1 then 
upon entry to a new task a debug exception 1 will be 
generated. 
• 


4.4.7 
INITIALIZATION 
AND TRANSITION 
TO 
PROTECTED 
MODE 


Since the Intel486 Microprocessor begins executing 
in Real Mode immediately after RESET it is neces- 
sary to initialize the system tables and registers with 
the appropriate values. 


The GOT and lOT registers must refer to a valid GOT 
and lOT. The lOT should be at least 256 bytes long, 
and GOT must contain descriptors for the initial 
code, and data segments. Figure 4.17 shows the ta- 
bles and Figure 4.18 the descriptors needed for a 
simple Protected Mode Intel486 Microprocessor 
system. It has a single code and single data/stack 
segment each four gigabytes long and a single privi- 
lege level PL = O. 


The actual method of enabling Protected Mode is to 
load CROwith the PE bit set, via the MOV CRO,R/M 
instruction. This puts the Intel486 Microprocessor in 
Protected Mode. 


After enabling Protected Mode, the next instruction 
should execute an intersegment JMP to load the CS 
register and flush the instruction decode queue. The 
final step is to load all of the data segment registers 
with the initial selector values. 


An alternate approach to entering Protected Mode 
which is especially appropriate for multi-tasking op- 
erating systems, is to use the built in task-switch to 
load all of the registers. In this case the GOT would 
contain two TSS descriptors in addition to the code 
and data descriptors needed for the first task. The 
first JMP instruction in Protected Mode would jump 
to the TSS causing a task switch and loading all of 
the registers with the values stored in the TSS. The 
Task State Segment Register should be initialized to 
point to a valid TSS descriptor since a task switch 
saves the state of the current task in a task state 
segment. 
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4.4.8 
TOOLS 
FOR BUILDING 
PROTECTED 
SYSTEMS 


In order to simplify the design of a protected multi· 
tasking system, Intel provides a tool which allows 
the system designer an easy method of constructing 
the data structures needed for a Protected Mode 
Intel486 Microprocessor system. This tool is the 
builder BLD-386. BLD-386 lets the operating system 
writer specify all of the segment descriptors dis- 
cussed in the previous sections (LOTs, lOTs, GDTs, 
Gates, and TSSs) in a high-level language. 


Paging is another type of memory management 
useful for virtual memory multitasking operating sys- 
tems. Unlike segmentation which modularizes pro- 
grams and data into variable length segments, pag- 
ing divides programs into multiple uniform size 
pages. Pages bear no direct relation to the logical 
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structure of a program. While segment selectors can 
be considered the logical "name" 
of a program 
module or data structure, a page most likely corre- 
sponds to only a portion of a module or data struc- 
ture. 


By taking advantage of the locality of reference dis- 
played by most programs, only a small number of 
pages from each active task need be in memory at 
anyone moment. 


4.5.2.1 
Page Mechanism 


The Intel486 Microprocessor uses two levels of ta- 
bles to translate the linear address (from the seg- 
mentation unit) into a physical address. There are 
three components to the paging mechanism of the 
Intel486 Microprocessor: the page directory, the 
page tables, and the page itself (page frame). All 
memory-resident elements of the Intel486 Micro- 
processor paging mechanism are the same size, 
namely, 4 Kbytes. A uniform size for all of the ele- 
ments simplifies memory allocation and reallocation 
schemes, since there is no problem with memory 
fragmentation. Figure 4.19 shows how the paging 
mechanism works. 
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4.5.2.2 
Page Descriptor 
Base Register 


CR2 is the Page Fault Linear Address register. It 
holds the 32-bit linear address which caused the last 
page fault detected. 


CR3 is the Page Directory Physical Base Address 
Register. It contains the physical starting address of 
the Page Directory. The lower 12 bits of CR3 are 
always zero to ensure that the Page Directory is al- 
ways page aligned. Loading it via a MOV CR3, reg 
instruction causes the Page Table Entry cache to be 
flushed, as will a task switch through a TSS which 
changes 
the value of CRO.(See 4.5.5 Translation 
Lookaslde 
Buffer). 


4.5.2.3 
Page Directory 


The Page Directory is 4 Kbytes long and allows up to 
1024 Page Directory Entries. Each Page Directory 
Entry contains the address of the next level of ta- 
bles, the Page Tables and information about the 
page table. The contents of a Page Directory Entry 
are shown in Figure 4.20. The upper 10 bits of the 
linear address (A22-A31) are used as an index to 
select the correct Page Directory Entry. 
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Figure 4.21. Page Table Entry (Points to Page) 


The (User/Supervisor) U/S bit 2 and the (Read/ 
Write) R/W bit 1 are used to provide protection attri- 
butes for individual pages. 
Each Page Table is 4 Kbytes and holds up to 1024 
Page Table Entries. Page Table Entries contain the 
starting address of the page frame and statistical 
information about the page (see Figure 4.21). Ad- 
dress bits A12-A21 are used as an index to select 
one of the 1024 Page Table Entries. The 20 upper- 
bit page frame address is concatenated with the 
lower 12 bits of the linear address to form the physi- 
cal address. Page tables can be shared between 
tasks and swapped to disks. 


4.5.2.5 Page Directory/Table 
Entries 


The lower 12 bits of the Page Table Entries and 
Page Directory Entries contain statistical information 
about pages and page tables respectively. The P 
(Present) bit 0 indicates if a Page Directory or Page 
Table entry can be used in address translation. If 
P = 1 the entry can be used for address translation 
if P = 0 the entry can not be used for translation, 
and all of the other bits are available for use by the 
software. For example the remaining 31 bits could 
be used to indicate where on the disk the page is 
stored. 


The A (Accessed) bit 5, is set by the Intel486 Micro- 
processor for both types of entries before a read or 
write access occurs to an address covered by the 
entry. The D (Dirty) bit 6 is set to 1 before a write to 
an address covered by that page table entry occurs. 
The 0 bit is undefined for Page Directory Entries. 
When the P, A and 0 bits are updated by the In- 
tel486 Microprocessor, the processor generates a 
Read-Modify-Write cycle which locks the bus and 
prevents conflicts with other processors or perpheri- 
als. Software which modifies these bits should use 
the LOCK prefix to ensure the integrity of the page 
tables in multi-master systems. 


The 3 bits marked OS Reserved in Figure 4.20 and 
Figure 4.21 (bits 9-11) are software definable. OSs 
are free to use these bits for whatever purpose they 
wish. An example use of the OS Reserved 
bits 
would be to store information about page aging. By 
keeping track of how long a page has been in mem- 
ory since being accessed, an operating system can 
implement a page replacement algorithm like Least 
Recently Used. 


4.5.3 PAGE LEVEL PROTECTION 
(R/W, U/S BITS) 


The Intel486 Microprocessor provides a set of pro- 
tection attributes for paging systems. The paging 
mechanism distinguishes between two levels of pro- 
tection: User which corresponds to level 3 of the 
segmentation 
based 
pr6tection, 
and 
supervisor 
which encompasses all of the other protection levels 
(0,1,2). 


The R/W and U/S bits are used in conjunction with 
the WP bit in the flags register (EFLAGS). The 386 
Microprocessor does not contain the WP bit. The 
WP bit has been added to the Intel486 Microproces- 
sor to protect read-only pages from supervisor write 
accesses. The 386 Microprocessor allows a read- 
only page to be written from protection levels 0, 1 or 
2. WP= 0 is the 386 Microprocessor compatible 
mode. When WP= 0 the supervisor can write to a 
read-only page as defined by the U/S and R/W bits. 
When WP= 1 supervisor access to a read-only page 
(R/W = 0) will cause a page fault (exception 14). 


Table 4.4 shows the affect of the WP, U/S and R/W 
bits on accessing memory. When WP=O, the super- 
visor can write to pages regardless of the state of 
the R/W bit. When WP= 1 and R/W = 0 the supervi- 
sor cannot write to a read-only page. A user attempt 
to access a supervisor only page (U/S=O), or write 
to a read only page will cause a page fault (excep- 
tion 14). 


The R/W and U/S bits provide protection from user 
access on a page by page basis since the bits are 
contained in the Page Table Entry and the Page Di- 
rectory Table. The U/S and R/W bits in the first level 
Page Directory Table apply to all entries in the page 
table pointed to by that directory entry. The U/S and 
R/W bits in the second level Page Table Entry apply 
only to the page described by that entry. The most 
restrictive of the U/S and R/W bits from the Page 
Directory Table and the Page Table Entry are used 
to address a page. 


Example: If the U/S and R/W bits for the Page Di- 
rectory entry were 10 (user read/execute) and the 


U/S and R/W bits for the Page Table Entry were 01 
(no user access at all), the access rights for the 
page would be 01, the numerically smaller of the 
two. 


Note that a given segment can be easily made read- 
only for level 0, 1 or 2 via use of segmented protec- 
tion mechanisms. (Section 4.4 Protection). 


4.5.4 PAGE CACHEABILITY 
(PWT AND PCD BITS) 


PWT (page write through) and PCD (page cache dis- 
able) are two new bits defined in entries in both lev- 
els of the page table structure, the Page Directory 
Table and the Page Table Entry. PCD and PWT con- 
trol page cacheability and write policy. 


PWT controls write policy. PWT= 1 defines a write- 
through policy for the current page. PWT=0 allows 
the possibility of write-back. PWT is ignored internal- 
ly because the Intel486 microprocessor has a write- 
through cache. PWT can be used to control the write 
policy of a second level cache. 


PCD controls cacheability. PCD= 0 enables caching 
in the on-chip cache. PCD alone does not enable 
caching, it must be conditioned by the KEN# (cache 
enable) input signal and the state of the CD (cache 
disable bit) and NW (no write-through) bits in control 
register 0 (CRO).When PCD=1, caching is disabled 
regardless of the state of KEN#, CD and NW. (See 
Section 5.0, On-Chip Cache). 


The state of the PCDand PWT bits are driven out on 
the PCD and PWT pins during a memory access. 


The PWT and PCD bits for a bus cycle are obtained 
either from control register 3 (CR3), the Page Direc- 
tory Entry or the Page Table Entry, depending on the 
type of cycle run. However, when paging is disabled 
(PG = 0 in CRO)or for cycles which bypass paging 
(Le., I/O (input/output) references, INTR (interrupt 
request) and HALT cycles), the PCD and PWT bits 
of CR3 are ignored. The Intel486 CPU assumes PCD 
= 0 and PWT = 0 and drives these values on the 
PCD and PWT pins. 


When paging is enabled (PG= 1 in CRO), the bits 
from the page table entry are cached in the transla- 
tion lookaside buffer (TLB), and are driven any time 
the page mapped by the TLB entry is referenced. 
For normal memory cycles run with paging enabled, 
the PWT and PCD bits are taken from the Page Ta- 
ble Entry. During TLB refresh cycles when the Page 
Directory and Page Table entries are read, the PWT 
and PCD bits must be obtained elsewhere. The bits 
are taken from CR3 when a Page Directory Entry is 
being read. The bits are taken from the Page Direc- 
tory Entry when the Page Table Entry is being updat- 
ed. 


The PCDor PWT bits in CR3 are initialized to zero at 
reset, but can be set to any value by level 0 soft- 
ware. 


The Intel486 Microprocessor paging hardware is de- 
signed to support demand paged virtual memory 
systems. However, performance 
would 
degrade 
substantially if the processor was required to access 
two levels of tables for every memory reference. To 
solve this problem, the Intel486 Microprocessor 
keeps a cache of the most recently accessed pages, 
this cache is called the Translation Lookaside Buffer 
(TLB). The TLB is a four-way set associative 32-en- 
try page table cache. It automatically keeps the most 
commonly used Page Table Entries in the proces- 
sor. The 32-entry TLB coupled with a 4K page size, 
results in coverage of 128 Kbytes of memory ad- 
dresses. For many common multi-tasking systems, 
the TLB will have a hit rate of about 98%. This 
means that the processor will only have to access 
the two-level page structure on 2% of all memory 
references. Figure 4.22 illustrates how the TLB com- 
plements 
the 
Intel486 
Microprocessor's 
paging 
mechanism. 


Reading a new entry into the TLB (TLB refresh) is a 
two step process handled by the Intel486 microproc- 
essor hardware. The sequence of data cycles to per- 
form a TLB refresh are: 


U/S 
R/W 
WP 
User Access 
Supervisor Access 


0 
0 
0 
None 
Read/Write/Execute 
0 
1 
0 
None 
Read/Write/ Execute 
1 
0 
0 
Read/Execute 
Read/Write/ Execute 
1 
1 
0 
Read/Write/ Execute 
Read/Write/Execute 
0 
0 
1 
None 
Read/Execute 
0 
1 
1 
None 
Read/Write/Execute 
1 
0 
1 
Read/Execute 
Read/Execute 
1 
1 
1 
Read/Write/Execute 
Read/Write/Execute 
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1. 
Read the correct Page Directory Entry, as point- 
ed to by the page base register and the upper 
10 bits of the linear address. The page base 
register is in control register 3. 
1a. Optionally perform a locked read/write to set 
the accessed bit in the directory entry. The di- 
rectory entry will actually get read twice if the 
Intel486 Microprocessor needs to set any of the 
bits in the entry. If the page directory entry 
changes between the first and second reads, 
the data returned for the second read will be 
used. 
2. 
Read the correct entry in the Page Table and 
place the entry in the TLB. 
2a. Optionally perform a locked read/write to set 
the accessed and/or dirty bit in the page table 
entry. Again, note that the page table entry will 
actually get read twice if the Intel486 Microproc- 
essor needs to set any of the bits in the entry. 
Like the directory entry, if the data changes be- 
tween the first and second read the data re- 
turned for the second read will be used. 


Note that the directory entry must always be read 
into the processor, since directory entries are never 
placed in the paging TLB. Page faults can be sig- 
naled from either the page directory read or the 
page table read. Page directory and page table en- 
tries may be placed in the Intel486 on-chip cache 
just like normal data. 
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Figure 4.22. Translation 
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The paging hardware operates in the following fash- 
ion. The paging unit hardware receives a 32-bit lin- 
ear address from the segmentation unit. The upper 
20 linear address bits are compared with all 32 en- 
tries in the TLB to determine if there is a match. If 
there is a match (Le., a TLB hit), then the 32-bit 
physical address is calculated and will be placed on 
the address bus. 


However, if the page table entry is not in the TLB, 
the Intel486 Microprocessor will read the appropri- 
ate Page Directory Entry. If P = 1 on the Page Di- 
rectory Entry indicating that the page table is in 
memory, then the Intel486 Microprocessor will read 
the appropriate Page Table Entry and set the Ac- 
cess bit. If P = 1 on the Page Table Entry indicating 
that the page is in memory, the Intel486 Microproc- 
essor will update the Access and Dirty bits as need- 
ed and fetch the operand. The upper 20 bits of the 
linear address, read from the page table, will be 
stored in the TLB for future accesses. However, if 
P = 0 for either the Page Directory Entry or the 
Page Table Entry, then the processor will generate a 
page fault, an Exception 14. 


The processor will also generate an exception 14 
page fault, if the memory reference violated the 
page protection attributes (Le., U/S or R/W) (e.g., 
trying to write to a read-only page). CR2 will hold the 
linear address which caused the page fault. If a sec- 
ond page fault occurs, while the processor is at- 
tempting to enter the service routine for the first, 
then the processor will invoke the page fault (excep- 
tion 14) handler a second time, rather than the dou- 
ble fault (exception 8) handler. Since Exception 14 is 
classified as a fault, CS: EIP will point to the instruc- 
tion causing the page fault. The 16-bit error code 
pushed as part of the page fault handler will contain 
status bits which indicate the cause of the page 
fault. 


The 16-bit error code is used by the operating sys- 
tem to determine how to handle the page fault. Fig- 
ure 4.23a shows the format of the page-fault error 
code and the interpretation of the bits. 


NOTE: 
Even though the bits in the error code (U/S, W/R, 
and P) have similar names as the bits in the Page 
Directory/Table Entries, the interpretation of the er- 
ror code bits is different. Figure 4.23b indicates 
what type of access caused the page fault. 


U 
U U U U U U U U U U U U U U 
W P 
S R 


Figure 4.23a. Page Fault Error Code Format 


U/S: The U/S bit indicates whether the access 
causing the fault occurred when the processor was 
executing in User Mode (U/S = 1) or in Supervisor 
mode (U/S = 0). 


W/R: The W/R bit indicates whether the access 
causing the fault was a Read (W/R = 0) or a Write 
(W/R = 1). 
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P: The P bit indicates whether a page fault was 
caused by a not-present page (P = 0), or by a page 
level protection violation (P = 1). 


U/S 
W/R 
Access Type 


0 
0 
Supervisoro Read 
0 
1 
Supervisor Write 
1 
0 
User Read 
1 
1 
User Write 


*Descriptor table access will fault with U/S 
~ 0, even if the program 
is executing at level 3. 


Figure 4.23b. Type of Access 
Causing Page Fault 


The Intel486 Microprocessor takes care of the page 
address translation process, relieving the burden 
from an operating system in a demand-paged sys- 
tem. The operating system is responsible for setting 
up the initial page tables, and handling any page 
faults. The operating system also is required to inval- 
idate (i.e., flush) the TLB when any changes are 
made to any of the page table entries. The operating 
system must reload CR3 to cause the TLB to be 
flushed. 


Setting up the tables is simply a matter of loading 
CR3 with the address of the Page Directory, and 
allocating space for the Page Directory and the 
Page Tables. The primary responsibility of the oper- 
ating system is to implement a swapping policy and 
handle all of the page faults. 


A final concern of the operating system is to ensure 
that the TLB cache matches the information in the 
paging tables. In particular, any time the operating 
system sets the P present bit of page table entry to 
zero, the TLB must be flushed. Operating systems 
may want to take advantage of the fact that CR3 is 
stored as part of a TSS, to give every task or group 
of tasks its own set of page tables. 


The Intel486 Microprocessor allows the execution of 
8086 application programs in both Real Mode and in 
the Virtual 8086 Mode (Virtual Mode). Of the two 
methods, Virtual 8086 Mode offers the system de- 
signer the most flexibility. The Virtual 8086 Mode al- 
lows the execution of 8086 applications, while still 
allowing the system designer to take full advantage 
of the Intel486 Microprocessor protection mecha- 


nism. In particular, the Intel486 Microprocessor al- 
lows the simultaneous execution of 8086 operating 
systems and its applications, and an Intel486 Micro- 
processor operating system and both 80286 and In- 
tel486 Microprocessor applications. Thus, in a multi- 
user Intel486 Microprocessor computer, one person 
could be running an MS-DOS spreadsheet, another 
person using MS-DOS, and a third person could be 
running multiple Unix utilities and applications. Each 
person in this scenario would believe that he had the 
computer completely to himself. Figure 4.24 illus- 
trates this concept. 


4.6.2 VIRTUAL 8086 MODE ADDRESSING 
MECHANISM 


One of the major differences between Intel486 Mi- 
croprocessor Real and Protected modes is how the 
segment selectors are interpreted. When the proc- 
essor is executing in Virtual 8086 Mode the segment 
registers are used in an identical fashion to Real 
Mode. The contents of the segment register is shift- 
ed left 4 bits and added to the offset to form the 
segment base linear address. 


The Intel486 Microprocessor allows the operating 
system to specify which programs use the 8086 
style address mechanism, and which programs use 
Protected Mode addressing, on a per task basis. 
Through the use of paging, the one megabyte ad- 
dress space of the Virtual Mode task can be mapped 
to anywhere in the 4 gigabyte linear address space 
of the Intel486 Microprocessor. Like Real Mode, Vir- 
tual Mode effective addresses (Le.,segment offsets) 
that exceed 64 Kbyte will cause an exception 13. 
However, these restrictions should not prove to be 
important, because most tasks running in Virtual 
8086 Mode will simply be existing 8086 application 
programs. 


The paging hardware allows the concurrent running 
of multiple Virtual Mode tasks, and provides protec- 
tion and operating system isolation. Although it is 
not strictly necessary to have the paging hardware 
enabled to run Virtual Mode tasks, it is needed in 
order to run multiple Virtual Mode tasks or to relo- 
cate the address space of a Virtual Mode task to 
physical address space greater than one megabyte. 


The paging hardware allows the 20-bit linear ad- 
dress produced by a Virtual Mode program to be 
divided into up to 256 pages. Each one of the pages 
can be located anywhere within the maximum 4 gig- 
abyte physical address space of the Intel486 Micro- 
processor. In addition, since CR3 (the Page Directo- 
ry Base Register) is loaded by a task switch, each 
Virtual Mode task can use a different mapping 
scheme to map pages to different physical locations. 


Finally, the paging hardware allows the sharing of 
the 8086 operating system code between multiple 
8086 applications. Figure 4.24 shows how the In- 
tel486 
Microprocessor 
paging hardware enables 
multiple 8086 programs to run under a virtual memo- 
ry demand paged system. 


4.6.4 
PROTECTION 
AND I/O 
PERMISSION 
BITMAP 


All Virtual 8086 Mode programs execute at privilege 
level 3, the level of least privilege. As such, Virtual 
8086 Mode programs are subject to all of the protec- 
tion checks defined in Protected Mode. (This is dif- 
ferent from Real Mode which implicitly is executing 
at privilege level 0, the level of greatest privilege.) 
Thus, an attempt to execute a privileged instruction 
when in Virtual 8086 Mode will cause an exception 
13 fault. 


The following are privileged instructions, which may 
be executed only at Privilege Level O. Therefore, at- 
tempting to execute these instructions in Virtual 
8086 Mode (or anytime CPL > 0) causes an excep- 
tion 13 fault: 
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LInT; 
LGDT; 
LMSW; 
CLTS; 
HLT; 


MOV DRn,reg; 
MOV TRn, reg; 
MOV CRn,reg: 


MOV reg,DRn; 
MOV reg, 
TRn: 
MOV reg,CRn. 


Several instructions, particularly those applying to 
the multitasking model and protection model, are 
available only in Protected Mode. Therefore, at- 
tempting to execute the following instructions in 
Real Mode or in Virtual 8086 Mode generates an 
exception 6 fault: 


LTR: 
LLDT; 
LAR: 
LSL: 
ARPL. 


STR: 
SLDT: 
VERR: 
VERW: 


The instructions which are IOPL-sensitive in Protect- 
ed Mode are: 


IN: 
OUT: 
INS: 
OUTS; 
REP 
INS: 
REP 
OUTS: 


STI: 
CLI 


• 
TASK 1 
• 
8086 OS 
1I£1I0RY 
1I£1I0RY 


I177J 
TASK 2 
~ 
386 TlI CPU OS 


""" 
1I£1I0RY 
~ 
1I£1I0RY 
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In Virtual 8086 Mode, a slightly different set of in- 
structions are made 10PL-sensitive.The following in- 
structions are 10PL-sensitive in Virtual 8086 Mode: 
INT n; 
STI; 
PUSHF; 
eLl; 
POPF; 
IRET 


The PUSHF, POPF, and IRET instructions are 10PL- 
sensitive in Virtual 8086 Mode only. This provision 
allows the IF flag (interrupt enable flag) to be virtual- 
ized to the Virtual 8086 Mode program. The INT n 
software interrupt instruction is also 10PL-sensitive 
in Virtual 8086 Mode. Note, however, that the INT 3 
(opcode OCCH),INTO, and BOUND instructions are 
not 10PL-sensitive in Virtual 8086 mode (they aren't 
10PL sensitive in Protected Mode either). 


Note that the I/O instructions (IN, OUT, INS, OUTS, 
REP INS, and REP OUTS) are not 10PL-sensitive in 
Virtual 8086 mode. Rather, the I/O instructions be- 
come automatically sensitive to the 1/0 Permission 
Bltmap contained in the Intel486 
Microprocessor 
Task 
State 
Segment. 
The I/O Permission Bitmap, 
automatically used by the Intel486 Microprocessor 
in Virtual 8086 Mode, is illustrated by Figures 4.15a 
and 4.15b. 


The I/O Permission Bitmap can be viewed as a 0- 
64 Kbit bit string, which begins in memory at offset 
BiLMap_Offset 
in the current TSS. BiLMap_ 
Offset must be ,,; DFFFH so the entire bit map and 
the byte FFH which follows the bit map are all at 
offsets ,,; FFFFH from the TSS base. The 16-bit 
pointer BiLMap_Offset 
(15:0) is found in the word 
beginning at offset 66H (102 decimal) from the TSS 
base, as shown in Figure 4.15a. 


Each bit in the I/O Permission Bitmap corresponds 
to a single byte-wide I/O port, as illustrated in Figure 
4.15a. If a bit is 0, I/O to the corresponding byte- 
wide port can occur without generating an excep- 
t!on. Otherwis~ the I/O instruction causes an excep- 
tion 13 fault. Since every byte-wide I/O port must be 
protectable, all bits corresponding to a word-wide or 
dword-wide port must be 0 for the word-wide or 
dword-wide I/O to be permitted. If all the referenced 
bits are 0, the I/O will be allowed. If any referenced 
bits are 1, the attempted I/O will cause an exception 
13 fault. 


Due to the use of a pointer to the base of the I/O 
Permission Bitmap, the bitmap may be located any- 
where within the TSS, or may be ignored completely 
by pointing the BiLMap_Offset 
(15:0) beyond the 
limit of the TSS segment. In the same manner, only 
a small portion of the 64K I/O space need have an 
associated map bit, by adjusting the TSS limit to 
truncate the bitmap. This eliminates the commitment 
of 8K of memory when a complete bitmap is not 
required, while allowing the fully general case if de- 
sired. 


EXAMPLE OF BITMAP FOR I/O PORTS 0-255: 
Setting the TSS limit to {biLMap_Offset 
+ 31 
+ 1""1 ["" see note below] will allow a 32-byte bit- 
map for the I/O ports #0-255, 
plus a terminator 
byte of all 1's ["" see note below]. This allows the 
I/O bitmap to control I/O Permission to I/O port 0- 
255 while causing an exception 13 fault on attempt- 
ed I/O to any I/O port 80256 through 65,565. 


""IMPORTANT 
IMPLEMENTATION 
NOTE: Beyond 
the last byte of I/O mapping information in the I/O 
Permission Bitmap must be a byte containing all 1'so 
The byte of all 1's must be within the limit of the 
Intel486 Microprocessor TSS segment (see Figure 
4.15a). 


In order to fully support the emulation of an 8086 
machine, interrupts in Virtual 8086 Mode are han- 
dled in a unique fashion. When running in Virtual 
Mode all interrupts and exceptions involve a privi- 
lege change back to the host Intel486 Microproces- 
sor operating system. The Intel486 Microprocessor 
operating system determines if the interrupt comes 
from a Protected Mode application or from a Virtual 
Mode program by examining the VM bit in the 
EFLAGS image stored on the stack. 


When a Virtual Mode program is interrupted and ex- 
ecution passes to the interrupt routine at level 0, the 
VM bit is cleared. However, the VM bit is still set in 
the EFLAG image on the stack. 


The Intel466 Microprocessor operating system in 
turn handles the exception or interrupt and then re- 
turns control to the 8086 program. The Intel486 Mi- 
croprocessor operating system may choose to let 
the 8066 operating system handle the interrupt or it 
may emulate the function of the interrupt handler. 
For example, many 6086 operating system calls are 
accessed by PUSHing parameters on the stack, and 
then executing an INT n instruction. If the 10PLis set 
to 0 then all INT n instructions will be intercepted by 
the Intel486 Microprocessor operating system. The 
Intel466 Microprocessor operating system could em- 
ulate the 8086 operating system's call. Figure 4.25 
shows how the Intel486 Microprocessor operating 
system could intercept an 8066 operating system's 
call to "Open a File". 


An Intel466 Microprocessor operating system can 
provide a Virtual 8086 Environment which is totally 
~ransparentto the application software via intercept- 
,ng and then emulating 8086 operating system's 
calls, and intercepting IN and OUT instructions. 


• 


4.6.6 
ENTERING 
AND 
LEAVING 
VIRTUAL 
8086 MODE 


Virtual 
8086 mode is entered 
by executing 
an IRET 
instruction 
(at CPL=O), 
or Task Switch 
(at any CPL) 
to an Intel486 
Microprocessor 
task whose 
Intel486 
Microprocessor 
TSS has a FLAGS image containing 
a 1 in the VM bit position 
while the processor 
is exe- 
cuting in Protected 
Mode. That is, one way to enter 
Virtual 
8086 mode is to switch 
to a task with an In- 
tel486 
Microprocessor 
TSS that 
has a 1 in the VM 
bit in the EFLAGS 
image. 
The other way is to exe- 
cute 
a 32-bit 
IRET 
instruction 
at privilege 
level 
0, 
where the stack has a 1 in the VM bit in the EFLAGS 
image. POPF does not affect 
the VM bit, even if the 
processor 
is in Protected 
Mode 
or level 0, and so 
cannot 
be used to enter Virtual 8086 Mode. PUSHF 
always pushes 
a 0 in the VM bit, even if the proces- 
sor is in Virtual 
8086 Mode, so that a program 
can- 
not tell if it is executing 
in REAL mode, or in Virtual 
8086 mode. 


The VM bit can be set by executing 
an IRET instruc- 
tion only at privilege 
level 0, or by any instruction 
or 
Interrupt 
which 
causes 
a task 
switch 
in Protected 
Mode 
(with VM = 1 in the new FLAGS 
image), 
and 
can be cleared 
only by an interrupt 
or exception 
in 
Virtual 8086 Mode. IRET and POPF instructions 
exe- 
cuted 
in REAL 
mode or Virtual 
8086 
mode will not 
change 
the value in the VM bit. 


The transition 
out of virtual 
8086 
mode to Intel486 
Microprocessor 
protected 
mode 
occurs 
only on re- 
ceipt 
of an interrupt 
or exception 
(such as due to a 
sensitive 
instruction). 
In Virtual 8086 mode, all inter- 
rupts 
and exceptions 
vector 
through 
the protected 
mode lOT, and enter an interrupt 
handler 
in protect- 
ed Intel486 
Microprocessor 
mode. That is, as part of 
interrupt 
processing, 
the VM bit is cleared. 


Because 
the matching 
IRET must occur from level 0, 
if an Interrupt 
or Trap Gate is used to field an inter- 
rupt or exception 
out of Virtual 8086 mode, the Gate 
must perform 
an inter-level 
interrupt 
only to level O. 
Interrupt 
or Trap 
Gates 
through 
conforming 
seg- 
ments, or through 
segments 
with DPL> 0, will raise a 
GP fault with the CS selector 
as the error code. 


4.6.6.1 
Task 
Switches 
To/From 
Virtual 
8086 Mode 


Tasks which can execute 
in virtual 8086 mode must 
be described 
by a TSS with the new Intel486 
Micro- 
processor 
format 
(TYPE 9 or 11 descriptor). 


A task switch 
out of virtual 
8086 mode will operate 
exactly 
the same 
as any other 
task switch 
out of a 
task with an Intel486 
Microprocessor 
TSS. All of the 
programmer 
visible 
state, 
including 
the FLAGS 
reg- 
ister with the VM bit set to 1, is stored 
in the TSS. 
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The segment 
registers 
in the TSS will contain 
8086 
segment 
base values 
rather than selectors. 


A task 
switch 
into a task 
described 
by an Intel486 
Microprocessor 
TSS will have an additional 
check to 
determine 
if the incoming 
task should be resumed 
in 
virtual 8086 mode. Tasks described 
by 80286 format 
TSSs 
cannot 
be resumed 
in virtual 
8086 
mode, 
so 
no check 
is required 
there 
(the 
FLAGS 
image 
in 
80286 format 
TSS has only the low order 16 FLAGS 
bits). 
Before 
loading 
the 
segment 
register 
images 
from 
an Intel486 
Microprocessor 
TSS, the 
FLAGS 
image 
is loaded, 
so that the segment 
registers 
are 
loaded 
from the TSS image as 8086 segment 
base 
values. 
The task 
is now ready to resume 
in virtual 
8086 execution 
mode. 


4.6.6.2 
Transitions 
Through 
Trap 
and Interrupt 
Gates, 
and IRET 


A task switch is one way to enter or exit virtual 8086 
mode. The other method 
is to exit through 
a Trap or 
Interrupt 
gate, as part of handling 
an interrupt, 
and 
to enter 
as part 
of executing 
an IRET 
instruction. 


The transition 
out must use an Intel486 
Microproc- 
essor Trap Gate (Type 14), or Intel486 
Microproces- 
sor Interrupt 
Gate (Type 
15), which 
must point to a 
non-conforming 
level 0 segment 
(DPL = 0) in order 
to permit the trap handler to IRET back to the Virtual 
8086 program. 
The Gate 
must point to a non-con- 
forming 
level 0 segment 
to perform 
a level switch to 
level 0 so that the matching 
IRET can change 
the 
VM bit. Intel486 
Microprocessor 
gates must be used, 
since 80286 
gates 
save only the low 16 bits of the 
FLAGS register, 
so that the VM bit will not be saved 
on transitions 
through 
the 
80286 
gates. 
Also, 
the 
16-bit 
IRET 
(presumably) 
used 
to 
terminate 
the 
80286 
interrupt 
handler 
will pop only the 
lower 
16 
bits from FLAGS, and will not affect the VM bit. The 
action taken for an Intel486 
Microprocessor 
Trap or 
Interrupt 
gate if an interrupt 
occurs 
while the task is 
executing 
in virtual 8086 mode is given by the follow- 
ing sequence. 


(1) Save the FLAGS register in a temp to push later. 
Turn off the VM and TF bits, and if the interrupt 
is serviced 
by an Interrupt 
Gate, turn off IF also. 


(2) 
Interrupt 
and Trap 
gates 
must 
perform 
a level 
switch 
from 
3 (where 
the 
VM86 
program 
exe- 
cutes) to level 0 (so IRET can return). 
This pro- 
cess involves 
a stack 
switch 
to the stack 
given 
in the TSS for privilege 
level O. Save the Virtual 
8086 
Mode SS and ESP registers 
to push in a 
later step. The segment 
register 
load of SS will 
be done 
as a Protected 
Mode 
segment 
load, 
since the VM bit was turned 
off above. 


8086 Application makes "Open File Call" -+ causes 
General Protection Fault (Arrow # 1) 
Virtual 8086 Monitor intercepts call. Calls Inte1486'" CPU OS (Arrow # 2) 
Intel486TMCPU OS opens file returns contralto 8086 OS (Arrow #3) 
8086 OS returns contralto 
application. (Arrow #4) 
Transparent to Application 


(3) Push the 8086 segment register values onto the 
new stack, in the order: GS, FS, DS, ES. These 
are pushed as 32-bit quantities, with undefined 
values in the upper 16 bits. Then load these 4 
registers with null selectors (0). 


(4) Push the old 8086 stack pointer onto the new 
stack by pushing the SS register (as 32-bits, high 
bits undefined), then pushing the 32-bit ESP reg- 
ister saved above. 


(5) Push the 32-bit FLAGS register saved in step 1. 
(6) Push the old 8086 instruction pointer onto the 
new stack by pushing the CS register (as 32-bits, 
high bits undefined), then pushing the 32-bit EIP 
register. 


(7) Load up the new CS:EIP value from the interrupt 
gate, and begin execution of the interrupt routine 
in protected Intel486 Microprocessor mode. 


The transition out of virtual 8086 mode performs a 
level change and stack switch, in addition to chang- 


ing back to protected mode. In addition, all of the 
8086 segment register images are stored on the 
stack (behind the SS:ESP image), and then loaded 
with null (0) selectors before entering the interrupt 
handler. This will permit the handler to safely save 
and restore the DS, ES, FS, and GS registers as 
80286 selectors. This is needed so that interrupt 
handlers which don't care about the mode of the 
interrupted program can use the same prolog and 
epilog code for state saving (Le., push all registers in 
prolog, pop all in epilog) regardless of whether or not 
a "native" mode or Virtual 8086 mode program was 
interrupted. Restoring null selectors to these regis- 
ters before executing the IRET will not cause a trap 
in the interrupt handler. Interrupt routines which ex- 
pect values in the segment registers, or return val- 
ues in segment registers will have to obtain/return 
values from the 8086 register images pushed onto 
the new stack. They will need to know the mode of 
the interrupted program in order to know where to 
find/return segment registers, and also to know how 
to interpret segment register values. 
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The IRET instruction will perform the inverse of the 
above sequence. Only the extended Intel486 Micro- 
processors IRET instruction (operand size=32) can 
be used, and must be executed at level 0 to change 
the VM bit to 1. 
(1) If the NT bit in the FLAGs register is on, an inter- 
task return is performed. The current state is 
stored in the current TSS, and the link field in the 
current TSS is used to locate the TSS for the 
interrupted task which is to be resumed. 
Otherwise, 
continue 
with 
the 
following 
se- 
quence. 
(2) Read the FLAGS image from SS:8[ESP] into the 
FLAGS register. This will set VM to the value 
active in the interrupted routine. 
(3) Pop off the instruction pointer CS:EIP. EIP is 
popped first, then a 32-bit word is popped which 
contains the CS value in the lower 16 bits. If 
VM = 0, this CS load is done as a protected 
mode segment load. If VM = 1, this will be done 
as an 8086 segment load. 


(4) Increment the ESP register by 4 to bypass the 
FLAGS image which was "popped" in step 1. 


(5) If VM = 1, load segment registers ES, OS, FS, 


and GS from memory locations SS:[ESP+8], 
SS:[ESP+ 12], 
SS:[ESP+ 16], 
and 
SS:[ESP+ 20], respectively, where the new val- 
ue of ESP stored in step 4 is used. Since VM = 1, 
these are done as 8086 segment register loads. 
Else if VM =0, check that the selectors in ES, 
OS, FS, and GS are valid in the interrupted rou- 
tine. Null out invalid selectors to trap if an at- 
tempt is made to access through them. 
(6) If (RPL(CS) > 
CPL), pop the stack pointer 
SS:ESP from the stack. The ESP register is 
popped first, followed by 32-bits containing SS in 
the lower 16 bits. If VM= 0, SS is loaded as a 
protected mode segment register load. If VM= 1, 
an 8086 segment register load is used. 


(7) Resume execution of the interrupted routine. The 
VM bit in the FLAGS register (restored from the 
interrupt routine's stack image in step 1) deter- 
mines whether the processor resumes the inter- 
rupted routine in Protected mode of Virtual 8086 
mode. 


To meet its performance goals the Intel486 Micro- 
processor contains an eight Kbyte cache. The cache 
is software transparent to maintain binary compati- 
bility with previous generations of the InteI386™/ln- 
tel486TMArchitecture. 


The on-chip cache has been designed for maximum 
flexibility and performance. The cache has several 
operating modes offering flexibility duriflg program 
execution and debugging. Memory areas can be de- 
fined as non-cacheable by software and external 
hardware. Protocols for cache line invalidations and 
replacement are implemented in hardware, easing 
system design. 


The on-chip cache is a unified code and data cache. 
The cache is used for both instruction and data ac- 
cesses and acts on physical addresses. 


The cache organization is 4-way set associative and 
each line is 16 bytes wide. The eight Kbytes of 
cache memory are logically organized as 128 sets, 
each containing four lines. 


\ 


The cache memory is physically split into four 
2-Kbyte blocks each containing 128 lines (see Fig- 
ure 5.1). Associated with each 2-Kbyte block are 
128 21-bit tags. There is a valid bit for each line in 
the cache. Each line in the cache is either valid or 
not valid. There are no provisions for partially valid 
lines. 


r- 16-Byte 
Line Slze--j 
O 


k 
Bytes 
~ 
Sets 


~ 


r- 3 lRU --t-- ~Valid ----j 
I 
Bits 
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Bits 
I 
I 
I~ 
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The write strategy of on-chip cache is write-through. 
All writes will drive an external write bus cycle in 
addition to writing the information to the internal 
cache if the write was a cache hit. A write to an 
address not contained in the internal cache will only 
be written to external memory. Cache allocations 
are not made on write misses. 


Control of the cache is provided by the CD and NW 
bits in CRO.CD enables and disables the cache. NW 
controls memory write-through and invalidates. 


The CD and NW bits define four operating modes of 
the on-chip cache as given in Table 5.1. These 
modes provide flexibility in how the on-chip cache is 
used. 


CD 
NW 
Operating 
Mode 


1 
1 
Cache fills disabled, write-through and 
invalidates disabled 
1 
0 
Cache fills disabled, write-through and 
invalidates enabled 
0 
1 
INVALID. IF CROis loaded with this 
configuration of bits, a GP fault with 
error code of 0 is raised. 


0 
0 
Cache fills enabled, write-through and 
invalidates enabled 


CD=1, 
NW=1 
The cache is completely disabled by setting 
CD= 1 and NW= 1 and then flushing the 
cache. This mode may be useful for debug- 
ging programs where it is important to see 
all memory cycles at the pins. Writes which 
hit in the cache will not appear on the exter- 
nal bus. 
It is possible to use the on-chip cache as 
fast static RAM by "pre-loading" 
certain 
memory areas into the cache and then set- 
ting CD= 1 and NW= 1. Pre-loading can be 
done by careful choice of memory refer- 
ences with the cache turned on or by use of 
the testability functions (see Section 8.2). 
When the cache is turned off the memory 
mapped by the cache is "frozen" into the 
cache since fills and invalidates are dis- 
abled. 


intelGP 


CD=1, 
NW=O 
Cache fills are disabled but write-throughs 
and invalidates are enabled. This mode is 
the same as if the KEN'" pin was strapped 
HIGH disabling cache fills. Write-throughs 
and invalidates may still occur to keep the 
cache valid. This mode is useful if the soft- 
ware must disable the cache for a short pe- 
riod of time, and then re-enable it without 
flushing the original contents. 


CD=O, NW=1 
INVALID. If CROis loaded with this bit con- 
figuration, a General Protection fault with 
error code of 0 is raised. Note that this 
mode would imply a non-transparent write- 
back cache. A future processor may define 
this combination of bits to implement a 
write-back cache. 
CD=O, NW=O 
This is the normal operating mode. 


Completely disabling the cache is a two step pro- 
cess. First CD and NW must be set to 1 and then the 
cache must be flushed. If the cache is not flushed, 
cache hits on reads will still occur and data will be 
read from the cache. 


Any area of memory can be cached in the Intel486 
Microprocessor. Non-cacheable portions of memory 
can be defined by the external system or by soft- 
ware. The external system can inform the Intel486 
Microprocessor that a memory address is non- 
cacheable by returning the KEN'" pin inactive during 
a memory access (refer to Section 7.2.3). Software 
can prevent certain pages from being cached by set- 
ting the PCD bit in the page table entry. 


A read request can be generated from program op- 
eration or by an instruction pre-fetch. The data will 
be supplied from the on-chip cache if a cache hit 
occurs on the read address. If the address is not in 
the cache, a read request for the data is generated 
on the external bus. 


If the read request is to a cacheable portion of mem- 
ory, the Intel486 Microprocessor initiates a cache 
line fill. During a line fill a 16-byte line is read into the 
Intel486 Microprocessor. 


Cache fills will only be generated for read misses. 
Write misses will never cause a line in the internal 
cache to be allocated. If a cache hit occurs on a 
write, the line will be updated. 


Cache line fills can be performed over 8- and 16-bit 
busses using the dynamic bus sizing feature. Refer 
to Section 7.1.3 for a description of dynamic bus 
sizing. 


Refer to Section 7.2.3 for further information on 
cacheable cycles. 


The Intel486 Microprocessor contains both a hard- 
ware and software mechanism for invalidating lines 
in its internal cache. Cache line invalidations are 
needed 
to 
keep 
the 
Intel486 
Microprocessor's 
cache contents consistent with external memory. 


Refer to Section 7.2.8 for further information on 
cache line invalidations. 


When a line needs to be placed in its internal cache 
the Intel486 Microprocessor first checks to see if 
there is a non-valid line in the set that can be re- 
placed. If all four lines in the set are valid, a pseudo 
least-recently-used mechanism is used to determine 
which line should be replaced. 


A valid bit is associated with each line in the cache. 
When a line needs to be placed in a set, the four 


valid bits are checked to see if there is a non-valid 
line that can be replaced. If a non-valid line is found, 
that line is marked for replacement. 


The four lines in the set are labeled 10,11,12,and 13. 
The order in which the valid bits are checked during 
an invalidation is 10,11,12and 13.All valid bits are 
cleared when the processor is reset or when the 
cache is flushed. 


Replacement in the cache is handled by a pseudo 
least recently used (LRU) mechanism when all four 
lines in a set are valid. Three bits, 80, 81 and 82, 
are defined for each of the 128 sets in the cache. 
These bits are called the LRU bits. The LRU bits are 
updated for every hit or replace in the cache. 


If the most recent access to the set was to 10or 11, 
80 is set to 1. 80 is set to 0 if the most recent ac- 
cess was to 12or 13.If the most recent access to 
10:11was to 10,81 is set to 1, else 81 is set to o. If 
the most recent access to 12:13was to 12,82 is set to 
1, else 82 is set to O. 


The pseudo LRU mechanism works in the following 
manner. When a line must be replaced, the cache 
will first select which of 10:11and 12:13was least re- 
cently used. Then the cache will determine which of 
the two lines was least recently used and mark it for 
replacement. This decision tree is shown in Figure 
5.2. When the processor is reset or when the cache 
is flushed all 128 sets of three LRU bits are set to O. 


BO=07 
Yes: 10or 11leost recently used 
No: 12or 13leost recently used 


Bl =07 
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Replace 
Replace 
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~ 
Replace 
Replace 
12 
13 
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Two bits for cache control, PWT and PCD, are de- 
fined in the page table and page directory entries. 
The state of these bits are driven out on the PWT 
and PCD pins during memory access cycles. 


The PWT bit controls write policy for second level 
caches used with the Intel486 Microprocessor. Set- 
ting PWT= 1 defines a write-through policy for the 
current page while PWT= 0 allows the possibility of 
write-back. The state of PWT is ignored internally by 
the Intel486 Microprocessor since the on-chip cache 
is write through. 


The PCD bit controls cacheability on a page by page 
basis. The PCD bit is internally ANDed with the 
KEN# signal to control cacheability on a cycle by 
cycle basis (see Figure 5.3). PCD= 0 enables cach- 
ing while PCD= 1 forbids it. Note that cache fills are 
enabled when PCD=O AND KEN# =0. This logical 
AND is implemented physically with a NOR gate. 


The state of the PCD bit in the page table entry is 
driven on the PCDpin when a page in external mem- 
ory is accessed. The state of the PCD pin informs 
the external system of the cacheability of the re- 
quested information. The external system then re- 
turns KEN" 
telling the Intel486 Microprocessor if 


the area is cacheable. The Intel486 Microprocessor 
initiates a cache line fill if PCD and KEN" 
indicate 


that the requested information is cacheable. 
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The PCD bit is masked with the CD (cache disable) 
bit in control register 0 to determine the state of the 
PCD pin. If CD= 1 the Intel486 Microprocessor 
forces the PCD pin HIGH. If CD=O the PCD pin is 
driven with the value for the page table entry/direc- 
tory. See Figure 5.3. 


The PWT and PCD bits for a bus cycle are obtained 
from either CR3, the page directory or page table 
entry. These bits are assumed to be zero during real 
mode, whenever paging is disabled, or for cycles 
that bypass paging, (I/O references, interrupt ac- 
knowledge and Halt cycles), the PWT and PCD bits 
are taken from CR3. These bits are initialized to 0 on 
reset, but can be set to any value by level 0 soft- 
ware. 


When paging is enabled, the bits from the page table 
entry are cached in the TLB, and are driven any time 
the page mapped by the TLB entry is referenced. 
For normal memory cycles, PWT and PCD are taken 
from the page table entry. During TLB refresh cycles 
where the page table and directory entries are read, 
the PWT and PCD bits must be obtained elsewhere. 
During page table updates the bits are obtained from 
the page directory. When the page directory is up- 
dated the bits are obtained from CR3. 


5.7 
Cache Flushing 


The on-chip cache can be flushed by external hard- 
ware or by software instructions. Flushing the cache 
clears all valid bits for all lines in the cache. The 
cache is flushed when external hardware asserts the 
FLUSH# pin. 


The flush pin needs to be asserted for one clock if 
driven synchronously or for two clocks if driven 
asynchronously. The flush input is asynchronous but 
setup and hold times must be met. The flush pin 
should be deasserted after the cache flush is com- 
plete. Failure to deassert the pin will cause execu- 
tion to stop as the processor will be repeatedly flush- 
ing the cache. If external hardware activates flush in 
response to an I/O write, flush must be asserted for 
at least two clocks prior to ready being returned for 
the I/O write. This ensures that the flush completes 
before the CPU begins execution of the instruction 
following the OUT instruction. 


Flush is recognized during HOLD just like EADS#. 


The instructions INVD and WBINVD cause the on- 
cache to be flushed. External caches connected to 
the Intel486 microprocessor are signalled to flush 
their contents when these instructions are executed. 


WBINVD will cause an external write-back cache to 
write back dirty lines before flushing its contents. 
The external cache is signalled using the bus cycle 
definition pins and the byte enables (refer to Section 


6.2.5 for the bus cycle definition pins and Section 
7.2.11 for special bus cycles). Refer to the Intel486 
Microprocessor programmers reference manual for 
detailed instruction definitions. 


The results of the INVD and WBINVD instructions 
are identical for the operation of the Intel486 Micro- 
processor's on-chip cache since the cache is write- 
through. Note that the INVD and WBINVD instruc- 
tions are machine dependent. Future members of 
the Intel486 Microprocessor family may change the 
definition of this instruction. 


5.8 
Caching Translation 
Lookaside 
Buffer Entries 


The Intel486 Microprocessor contains an integrated 
paging unit with a translation lookaside buffer (TLB). 
The TLB contains 32 entries. The TLB has been en- 
hanced over the 386 Microprocessor's TLB by up- 
grading the replacement strategy to a pseudo-LRU 
(least recently used) algorithm. The pseudo-LRU re- 
placement algorithm is the same as that used in the 
on-chip cache. 


The paging TLB operation is automatic whenever 
paging is enabled. The TLB contains the most re- 
cently used page table entries. A page table entry 
translates the linear address pointing to a particular 
page to the physical address where the page is 
stored in memory (refer to Section 4.5, Paging). 


The paging unit will look up the linear address in the 
TLB in response to an internal bus request. The cor- 
responding physical address is passed on to the on- 
chip cache or the external bus (in the event of a 
cache miss) when the linear address is present in 
the TLB. 


The paging unit will access the page tables in exter- 
nal memory if the linear address is not in the TLB. 
The required page table entry will be read into the 
TLB and then the cache or bus cycle for the actual 
data will take place. The process of reading a new 
page table entry into the TLB is called a TLB refresh. 


A TLB refresh is a two step process. The paging unit 
must first read the page directory entry which points 
to the appropriate page table. The page table entry 
to be stored in the TLB is then read from the page 
table. Control register 3 (CR3) points to the base of 
the page directory table. 


The Intel486 Microprocessor will allow page directo- 
ry and page table entries (returned during TLB re- 
freshes) to be stored in the on-chip cache. Setting 
the PCD bits in CR3 and the page directory entry to 
1 will prevent the page directory and page table en· 
tries from being stored in the on-chip cache (see 
Section 5.6, Page Cacheabllity). 


• 


The Intel486 Microprocessor bus has been designed 
to be similar to the 386 Microprocessor bus whenev- 
er possible. Several new features have been added 
to the Intel486 Microprocessor bus resulting in in- 
creased performance and functionality. New fea- 
tures include a 1X clock, a burst bus mechanism for 
high-speed internal cache fills, a cache line invalida- 
tion mechanism, enhanced bus arbitration capabili- 
ties, a Bs8# 
bus sizing mechanism and parity sup- 
port. 


The Intel486 Microprocessor is driven by a 1X clock 
as opposed to a 2X clock in the 386 Microprocessor. 
A 25 MHz Intel486 Microprocessor uses a 25 MHz 
clock in contrast to a 25 MHz 386 Microprocessor 
which requires a 50 MHz clock. A 1X clock allows 
simpler system design by cutting in half the clock 
speed required in the external system. 
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Like the 386 Microprocessor, the Intel486 Micro- 
processor has separate parallel busses for data and 
addresses. The bidirectonal data bus is 32 bits in 
width. The address bus consists of two components: 
30 address lines (A2-A31) and 4 byte enable lines 
(BEO#-BE3#). 
The address bus addresses exter- 
nal memory in the same manner as the 386 Micro- 
processor: The address lines form the upper 30 bits 
of the address and the byte enables select individual 
bytes within a 4 byte location. The address lines are 
bidirectional for use in cache line invalidations. 


The Intel486 Microprocessor's burst bus mechanism 
enables high-speed cache fills from external memo- 
ry. Burst cycles can strobe data into the processor at 
a rate of one item every clock. Non-burst cycles 
have a maximum rate of one item every two clocks. 
Burst cycles are not limited to cache fills: all bus 
cycles requiring more than a single data cycle can 
be bursted. 
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The Intel486 Microprocessor has a bus hold feature 
similar to that of the 386 Microprocessor. Ouringbus 
hold, the Intel486 Microprocessor relinquishes con- 
trol of the local bus by floating its address, data and 
control busses. 


The Intel486 Microprocessor has an address hold 
feature in addition to bus hold. Ouring address hold 
only the address bus is floated, the data and control 
busses can remain active. Address hold is used for 
cache line invalidations. 


Ahead is a brief description of the Intel486 Micro- 
processor input and output signals arranged by func- 
tional groups. Before beginning the signal descrip- 
tions a few terms need to be defined. The # symbol 
at the end of a signal name indicates the active, or 
asserted, state occurs when the signal is at a low 
voltage. When a # is not present after the signal 
name, the signal is active at the high voltage level. 
The term "ready" is used to indicate that the cycle is 
terminated with ROY# or BROY#. 


Section 6 and 7 will discuss bus cycles and data 
cycles. A bus cycle is at least two clocks long and 
begins with AOS# active in the first clock and ready 
active in the last clock. Oata is transferred to or from 
the Intel486 Microprocessor during a data cycle. A 
bus cycle contains one or more data cycles. 


ClK provides the fundamental timing and the inter- 
nal operating frequency for the Intel486 Microproc- 
essor. All external timing parameters are specified 
with respect to the rising edge of ClK. 


The Intel486 Microprocessor can operate over a 
wide frequency range but ClK's 
frequency cannot 
change rapidly while RESET is inactive. ClK's fre- 
quency must be stable for proper chip operation 
since a single edge of ClK is used internally to gen- 
erate two phases. ClK 
only needs TIl 
levels for 
proper operation. Figure 6.2 illustrates the ClK 
waveform. 


6.2.2 
ADDRESS BUS (A31-A2, BEO#-BE3#) 


A31-A2 
and BEO#-BE3# 
form the address bus 
and provide physical memory and I/O port address- 
es. The Intel486 Microprocessor is capable of ad- 
dressing 4 gigabytes of physical memory space 
(OOOOOOOOH 
through FFFFFFFFH), and 64 Kbytes 
of 
I/O 
address 
space 
(OOOOOOOOHthrough 2 


OOOOFFFFH).A31-A2 identify addresses to a 4-byte 
location. BEO#-BE3# 
identify which bytes within 
the 4-byte location are involved in the current trans- 
fer. 


Addresses are driven back into the Intel486 Micro- 
processor over A31-A4 
during cache liRe invalida- 
tions. The address lines are active HIGH. When 
used as inputs into the processor, A31-A4 
must 
meet the setup and hold times, t22 and t23. A31-A2 
are not driven during bus or address hold. 


The byte enable outputs, BEO#-BE3#, 
determine 
which bytes must be driven valid for read and write 
cycles to external memory. 


BE3# applies to 024-031 
BE2# applies to 016-023 
BE1# applies to 08-015 
BEO# applies to 00-07 


Ix = input setup times 
ty = input hold times, 
output 
float, valid and hold times 


BEO#-BE3# 
can be decoded to generate AO, A1 
and BHE# signals used in 8- and 16-bit systems 
(see Table 7.5). BEO#-BE3# 
are active LOW and 
are not driven during bus hold. 


The bidirectional lines, 031-00, 
form the data bus 
for the Intel486 Microprocessor. 00-07 
define the 
least significant byte and 024-031 
the most signifi- 
cant byte. Data transfers to 8- or 16-bit devices is 
possible using the data bus sizing feature controlled 
by the Bs8# or Bs16# input pins. 


031-00 
are active HIGH. For reads, 031-00 
must 
meet the setup and hold times, t22 and t23. 031-00 
are not driven during read cycles and bus hold. 


6.2.4 
PARITY 


Data Parity Input/Outputs (DPO-DP3) 


OPO-OP3 are the data parity pins for the processor. 
There is one pin for each byte of the data bus. Even 
parity is generated or checked by the parity genera- 
torsi checkers. Even parity means that there are an 
even number of HIGH inputs on the eight corre- 
sponding data bus pins and parity pin. 


Data parity is generated on all write data cycles with 
the same timing as the data driven by the Intel486 
Microprocessor. Even parity information must be 
driven back to the Intel486 Microprocessor on these 
pins with the same timing as read information to in- 
sure that the correct parity check status is indicated 
by the Intel486 Microprocessor. 


The values read on these pins do not affect program 
execution. It is the responsibility of the system to 
take appropriate actions if a parity error occurs. 


Input signals on DPO-OP3 must meet setup and 
hold times t22 and t23 for proper operation. 


Parity status is driven on the PCHK# pin, and a pari- 
ty error is indicated by this pin being LOW. PCHK# 
is driven the clock after ready for read operations to 
indicate the parity status for the data sampled at the 
end of the previous clock. Parity is checked during 
code reads, memory reads and 1/0 reads. Parity is 
not checked during interrupt acknowledge cycles. 
PCHK# only checks the parity status for enabled 
bytes as indicated by the byte enable and bus size 
signals. It is valid only in the clock immediately after 
read data is returned to the Intel486 microprocessor. 
At all other times it is inactive (HIGH). PCHK# is 
never floated. 


Driving PCHK# is the only effect that bad input pari- 
ty has on the Intel486 Microprocessor. The Intel486 
Microprocessor will not vector to a bus error inter- 
rupt when bad data parity is returned. In systems 
that will not employ parity, PCHK# can be ignored. 
In systems not using parity, OPO-OP3 should be 
connected to Vcc through a pullup resistor. 


6.2.5 BUS CYCLE DEFINITION 


M/IO#, D/C#, 
W/R# Outputs 


M/IO#, 
O/C# 
and W/R# are the primary bus cycle 
definition signals. They are driven valid as the ADS# 
signal is asserted. M/IO# 
distinguishes between 
memory and 1/0 
cycles, O/C# 
distinguishes be- 


tween data and control cycles and W/R# 
distin- 


guishes between write and read cycles. 


Bus cycle definitions as a function of M/IO#, 
O/C# 


and W/R# 
are given in Table 6.1. Note there is a 
difference between the Intel486 Microprocessor and 
386 Microprocessor bus cycle definitions. The halt 
bus cycle type has been moved to location 001 in 
the Intel486 Microprocessor from location 101 in the 
386 Microprocessor. Location 101 is now reserved 
and will never be generated by the Intel486 Micro- 
processor. 


MilO # 
D/C# 
W/R# 
Bus Cycle Initiated 


0 
0 
0 
InterruptAcknowledge 
0 
0 
1 
Halt/SpecialCycle 
0 
1 
0 
1/0 Read 
0 
1 
1 
1/0 Write 
1 
0 
0 
CodeRead 
1 
0 
1 
Reserved 
1 
1 
0 
MemoryRead 
1 
1 
1 
MemoryWrite 


LOCK# indicates that the Intel486 Microprocessor 
is running a read-modify-write cycle where the eX1er- 
nal bus must not be relinquished between the read 
and write cycles. Read-modify-write cycles are used 
to implement memory-based semaphores. Multiple 
reads or writes can be locked. 


When LOCK# is asserted, the current bus cycle is 
locked and the Intel486 Microprocessor should be 
allowed 
exclusive 
access 
to 
the 
system 
bus. 


LOCK# goes active in the first clock of the first 
locked bus cycle and goes inactive after ready is 
returned indicating the last locked bus cycle. 


The Intel486 Microprocessor will not acknowledge 
bus hold when LOCK'* is asserted (though it will 
allow an address hold). LOCK,* is active LOW and 
is floated during bus hold. Locked read cycles will 
not be transformed into cache fill cycles if KEN'* is 
returned active. Refer to Section 7.2.6 for a detailed 
discussion of Locked bus cycles. 


The pseudo-lock feature allows atomic reads and 
writes of memory operands greater than 32 bits. 
These operands require more than one cycle to 
transfer. 
The 
Intel486 
Microprocessor 
asserts 
PLOCK'* during floating point long reads and writes 
(64 bits), segment table descriptor reads (64 bits) 
and cache line fills (128 bits). 


When PLOCK,* is asserted no other maste~will be 
given control of the bus between cycles. A bus hold 
request (HOLD) is not acknowledged during pseudo- 
locked reads and writes, with one exception. During 
non-cacheable non-bursted code prefetches, HOLD 
is recognized on memory cycle boundaries even 
though PLOCK,* is asserted. The Intel486 Micro- 
processor will drive PLOCK,* active until the ad- 
dresses for the last bus cycle of the transaction 
have been driven regardless of whether BRDY,* or 
RDY,* are returned. 


A pseudo-locked transfer is meaningful only if the 
memory operand is aligned and if its completely con- 
tained within a single cache line. A 64-bit floating 
point number must be aligned to an 8-byte boundary 
to guarantee an atomic access. 


Normally PLOCK,* and BLAST,* 
are inverse of 
each other. However during the first cycle of a 64-bit 
floating point write, both PLOCK'* and BLAST'* will 
be asserted. 


Since PLOCK,* is a function of the bus size and 
KEN,* inputs, PLOCK,* should be sampled only in 
the clock ready is returned. This pin is active LOW 
and is not driven during bus hold. Refer to Section 
7.2.7 for a detailed discussion of pseudo-locked bus 
cycles. 


The bus control signals allow the processor to indi- 
cate when a bus cycle has begun, and allow other 
system hardware to control burst cycles, data bus 
width and bus cycle termination. 


The ADS,* output indicates that the address and 
bus cycle definition signals are valid. This signal will 


go active in the first clock of a bus cycle and go 
inactive in the second and subsequent clocks of the 
cycle. ADS,* is also inactive when the bus is.idle. 


ADS,* is used by external bus circuitry as the indica- 
tion that the processor has started a bus cycle. The 
external circuit must sample the bus cycle definition 
pins on the next rising edge of the clock after ADS'* 
is driven active. 


ADS,* is active LOW and is not driven during bus 
hold. 


RDY,* indicates that the current bus cycle is com- 
plete. In response to a read, RDY,* indicates that • 
the external system has presented valid data on the 
data pins. In response to a write request, RDY'* indi- 
cates that the external system has accepted the In- 
tel486 microprocessor data. RDY,* is ignored when 
the bus is idle and at the end of the first clock of the 
bus cycle. Since RDY,* is sampled during address 
hold, data can be returned to the processor when 
AHOLD is active. 


RDY,* is active LOW, and is not provided with an 
internal pullup resistor. This input must satisfy setup 
and hold times t16 and t17for proper chip operation. 


6.2.7 
BURST 
CONTROL 


Burst Ready 
Input (BRDY,*) 


BRDY,* performs the same function during a burst 
cycle that RDY,* performs during a non-burst cycle. 
BRDY,* indicates that the external system has pre- 
sented valid data on the data pins in response to a 
read or that the external system has accepted the 
Intel486 Microprocessor data in response to a write. 
BRDY,* is ignored when the bus is idle and at the 
end of the first clock in a bus cycle. 


During a burst cycle, BRDY,* will be sampled each 
clock, and if active, the data presented on the data 
bus pins will be strobed into the Intel486 Microproc- 
essor. ADS,* is negated during the second through 
last data cycles in the burst, but address lines A2- 
A3 and byte enables will change to reflect the next 
data item expected by the Intel486 Microprocessor. 


If RDY,* is returned simultaneously with BRDY,*, 
BRDY,* is ignored and the burst cycle is premature- 
ly aborted. An additional complete bus cycle will be 
initiated after an aborted burst cycle if the cache line 
fill was not complete. BRDY'* is treated as a normal 
ready for the last data cycle in a burst transfer or for 
non-burstable cycles. Refer to Section 7.2.2 for 
burst cycle timing. 


BRDY# is active LOW and is provided with,a small 
internal pullup resistor. BRDY# must satisfy the set- 
up and hold times t16 and t17. 


Burst Last Output 
(BLAST #) 


BLAST# indicates that the next time BRDY# is re- 
turned it will be treated as a normal RDY#, terminat- 
ing the line fill or other multiple-data-cycle transfer. 
BLAST# is active for all bus cycles regardless of 
whether they are cacheable or not. This pin is active 
LOW and is not driven during bus hold. 


6.2.8 
INTERRUPT 
SIGNALS 
(RESET, 
INTR, 
NMI) 


The interrupt signals can interrupt or suspend exe- 
cution of the processor's current instruction stream. 


Reset 
Input (RESET) 


RESET forces the Intel486 Microprocessor to begin 
execution at a known state. For a power-up (cold 
start) reset, Vcc and CLK must reach their proper 
DC and AC specifications for at least 1 ms before 
the Intel486 Microprocessor begins instruction exe- 
cution. The RESET pin should remain active during 
this time to ensure proper Intel486 Microprocessor 
operation. 
However, for 
a warm boot-up case, 
RESET is required to remain active for a minimum of 
15 clocks. The testability operating modes are pro- 
grammed by the falling (inactive going) edge of 
RESET. (Refer to Section 8.0 for a description of the 
test modes during reset.) 


Maskable 
Interrupt 
Request 
Input (INTR) 


INTR indicates that an external interrupt has been 
generated. Interrupt processing is initiated if the IF 
flag is active in the EFLAGS register. 


The 
Intel486 
Microprocessor 
will 
generate 
two 
locked 
interrupt acknowledge bus cycles in re- 
sponse to asserting the INTR pin. An 8-bit interrupt 
number will be latched from an external interrupt 
controller at the end of the second interrupt ac- 
knowledge cycle. INTR must remain active until the 
interrupt acknowledges have been performed to as- 
sure program interruption. Refer to Section 7.2.10 
for a detailed discussion of interrupt acknowledge 
cycles. 


The INTR pin is active HIGH and is not provided with 
an internal pulldown resistor. INTR is asynchronous, 
but the INTR setup and hold times, t20 and t21, must 
be met to assure recognition on any specific clock. 


Non-maskable 
Interrupt 
Request 
Input (NMI) 


NMI is the non-maskable interrupt request signal. 
Asserting NMI causes an interrupt with an internally 
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supplied vector value of 2. External interrupt ac- 
knowledge cycles are not generated since the NMI 
interrupt vector is internally generated. When NMI 
processing begins, the NMI signal will be masked 
internally until the IRET instruction is executed. 


NMI is rising edge sensitive after internal synchroni- 
zation. NMI must be held LOW for at least four CLK 
periods before this rising edge for proper operation. 
NMI is not provided with an internal pulldown resis- 
tor. NMI is asynchronous but setup and hold times, 
t20and t21 must be met to assure recognition on any 
specific clock. 


This section describes the mechanism by which the 
processor relinquishes control of its local bus when 
requested by another bus master. 


The Intel486 Microprocessor asserts BREa when- 
ever a bus cycle is pending internally. Thus, BREa is 
always asserted in the first clock of a bus cycle, 
along with ADS#. Furthermore, if the Intel486 Mi- 
croprocessor is currently not driving the bus (due to 
HOLD, AHOLD, or BOFF#), BREa is asserted in 
the same clock that ADS# would have been assert- 
ed if the processor were driving the bus. After the 
first clock of the bus cycle, BREa may change state. 
It will be asserted if additional cycles are necessary 
to complete a transfer (via BS8#, BS16#, KEN#), 
or if more cycles are pending internally. However, if 
no additional cycles are necessary to complete the 
current transfer, BREa can be negated before ready 
comes back for the current cycle. External logic can 
use the BREa signal to arbitrate among multiple 
processors. This pin is driven regardless of the state 
of bus hold or address hold. BREa is active HIGH 
and is never floated. During a hold state, internal 
events may cause BREa to be deasserted prior to 
any bus cycles. 


HOLD allows another bus master complete control 
of the Intel486 Microprocessor bus. The Intel486 Mi- 
croprocessor will respond to an active HOLD signal 
by asserting HLDA and placing most of its output 
and input! output pins in a high impedance state 
(floated) after completing its current bus cycle, burst 
cycle, or sequence of locked cycles. In addition, if 
the Intel486 CPU receives a HOLD request while 
performing a code fetch, and that cycle is backed off 
(BOFF#), the Intel486 CPU will recognize HOLD be- 
fore restarting the cycle. The BREa, HLDA, PCHK# 
and FERR# pins are not floated during bus hold. 
The Intel486 Microprocessor will maintain its bus in 
this state until the HOLD is deasserted. Refer to 


Section 7.2.9 for timing diagrams for bus hold cycles 
and HOLD request acknowledge during BOFF# . 


Unlike the 386 Microprocessor, the Intel486 Micro- 
processor will recognize HOLD during reset. Pullup 
resistors are not provided for the outputs that are 
floated in response to HOLD. HOLD is active HIGH 
and is not provided with an internal pulldown resis- 
tor. HOLD must satisfy setup and hold times t18 and 
t19 for proper chip operation. 


HLDA indicates that the Intel486 Microprocessor 
has given the bus to another local bus master. HLDA 
goes active in response to a hold request presented 
on the HOLD pin. HLDA is driven active in the same 
clock that the Intel486 Microprocessor floats its bus. 


HLDA will be driven inactive when leaving bus hold 
and the Intel486 Microprocessor will resume driving 
the bus. The Intel486 Microprocessor will not cease 
internal activity during bus hold since the internal 
cache will satisfy the majority of bus requests. HLDA 
is active HIGH and remains driven during bus hold. 


Asserting the BOFF# input forces the Intel486 Mi- 
croprocessor to release control of its bus in the next 
clock. The pins floated are exactly the same as in 
response to HOLD. The response to BOFFII differs 
from the response to HOLD in two ways: First, the 
bus is floated immediately in response to BOFFII 
while the Intel486 Microprocessor completes the 
current bus cycle before floating its bus in response 
to HOLD. Second the Intel486 does not assert 
HLDA in response to BOFFII. 


The processor remains in bus hold until BOFFII is 
negated. Upon negation, the Intel486 Microproces- 
sor restarts the bus cycle aborted when BOFFII was 
asserted. To the internal execution engine the effect 
of BOFFII is the same as inserting a few wait states 
to the original cycle. Refer to Section 7.2.12 for a 
description of bus cycle restart. 


Any data returned to the processor while BOFFII is 
asserted is ignored. BOFF# has higher priority than 
ROYII or BRDYII. If both BOFFII and ready are 
returned in the same clock, BOFFII takes effect. If 
BOFFII is asserted while the bus is idle, the Intel486 
Microprocessor will float its bus in the next clock. 
BOFFII is active LOW and must meet setup and 
hold times t18 and t19 for proper chip operation. 


6.2.10 CACHE INVALIDATION 


The AHOLD and EADSII 
inputs are used during 
cache invalidation cycles. AHOLD conditions the In- 


tel486 Microprocessors address lines, A4-A31, 
to 
accept an address input. EADSII indicates that an 
external address is actually valid on the address in- 
puts. Activating EADSII will cause the Intel486 Mi- 
croprocessor to read the external address bus and 
perform an internal cache invalidation cycle to the 
address indicated. Refer to Section 7.2.8 for cache 
invalidation cycle timing. 


AHOLD is the address hold request. It allows anoth- 
er bus master access to the Intel486 Microproces- 
sor address bus for performing an internal cache in- 
validation cycle. Asserting AHOLD will force the 
Intel486 Microprocessor to stop driving its address 
bus in the next clock. While AHOLD is active only 
the address bus will be floated, the remainder of the 
bus can remain active. For example, data can be 
returned for a previously specified bus cycle when 
AHOLD is active. The Intel486 Microprocessor will 
not initiate another bus cycle during address hold. 
Since the Intel486 Microprocessor floats its bus im- 
mediately in response to AHOLD, an address hold 
acknowledge is not required. If AHOLD is asserted 
while a bus cycle is in progress, and no readies are 
returned during the time AHOLD is asserted, the In- 
tel486 will redrive the same address (that it originally 
sent out) once AHOLD is negated. 


AHOLD is recognized during reset. Since the entire 
cache is invalidated by reset, any invalidation cycles 
run during reset will be unnecessary. AHOLD is ac- 
tive HIGH and is provided with a small internal pull- 
down resistor. It must satisfy the setup and hold 
times t18 and t19 for proper chip operation. This pin 
determines whether or not the built in self test fea- 
tures of the Intel486 Microprocessor will be exer- 
cised on assertion of RESET. 


EADSII indicates that a valid external address has 
been driven onto the Intel486 address pins. This ad- 
dress will be used to perform an internal cache inval- 
idation cycle. The external address will be checked 
with the current cache contents. If the address spec- 
ified matches any areas in the cache, that area will 
immediately be invalidated. 


An invalidation cycle may be run by asserting 
EADSII regardless of the state of AHOLD, HOLD 
and BOFFII. EADSII is active LOW and is provided 
with an internal pullup resistor. EADSII must satisfy 
the setup and hold times t12 and t13 for proper chip 
operation. 
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6.2.11 
CACHE 
CONTROL 


Cache 
Enable 
Input 
(KEN # ) 


KEN # 
is the cache 
enable 
pin. KEN # 
is used 
to 
determine 
whether 
the data 
being 
returned 
by the 
current 
cycle 
is cacheable. 
When 
KEN # 
is active 
and the Intel486 
Microprocessor 
generates 
a cycle 
that can be cached 
(most any memory 
read cycle), 
the 
cycle 
will be transformed 
into a cache 
line fill 
cycle. 


A cache line is 16 bytes long. During the first cycle of 
a cache 
line fill the byte-enable 
pins should 
be ig- 
nored and data should be returned 
as if all four byte 
enables 
were 
asserted. 
The 
Intel486 
Microproces- 
sor will run between 
4 and 16 contiguous 
bus cycles 
to fill the line depending 
on the bus data width 
se- 
lected by BS8# 
and BS16#. 
Refer to Section 
7.2.3 
for a description 
of cache 
line fill cycles. 


The KEN # input is active 
LOW and is provided 
with 
a small 
internal 
pullup 
resistor. 
It must 
satisfy 
the 
setup and hold times t14 and t15 for proper 
chip op- 
eration. 


The FLUSH # input forces the Intel486 
Microproces- 
sor to flush its entire internal 
cache. 
FLUSH # is ac- 
tive LOW and need only be asserted 
for one clock. 


FLUSH # is asynchronous 
but setup and hold times 
t20 and t21 must be met for recognition 
on any spe- 
cific clock. 


FLUSH # also determines 
whether 
or not the tristate 
test mode of the Intel486 
Microprocessor 
will be in- 
voked 
on assertion 
of RESET. 


6.2.12 
PAGE CACHEABILITY 
(PWT, 
PCD) 


The PWT and PCD output signals correspond 
to two 
user attribute 
bits in the page table entry. When pag- 
ing is enabled, 
PWT and PCD correspond 
to bits 3 
and 4 of the page table 
entry 
respectively. 
For cy- 
cles that are not paged when paging is enabled 
(for 
example 
I/O 
cycles) 
PWT and PCD correspond 
to 
bits 3 and 4 in control 
register 
3. When 
paging 
is 
disabled, 
the 
Intel486 
CPU 
ignores 
the 
PCD 
and 
PWT bits and assumes they are zero for the purpose 
of caching 
'and driving 
PCD and PWT. 


PCD is masked 
by the CD (cache disable) 
bit in con- 
trol register 
0 (CRO). When 
CD= 
1 (cache 
line fills 
disabled) 
the 
Intel486 
Microprocessor 
forces 
PCD 
HIGH. When CD = 0, PCD is driven with the value of 
the page table entry/directory. 


The purpose 
of PCD is to provide 
a cacheable/non- 
cacheable 
indication 
on a page by page basis. The 
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Intel486 
will not perform 
a cache 
fill to any page in 
which bit 4 of the page table entry is set. PWT corre- 
sponds 
to the write-back 
bit and can be used by an 


external 
cache to provide this functionality. 
PCD and 
PWT bits are assigned 
to be zero during 
real mode 
or whenever 
paging 
is disabled. 
Refer 
to Sections 
4.5.4 
and 
5.6 
for 
a discussion 
of 
non-cacheable 
pages. 


PCD and PWT have the same 
timing 
as the cycle 
definition 
pins 
(M/IO#, 
D/C#, 
W/R#). 
PCD 
and 
PWT are active HIGH and are not driven during bus 
hold. 


6.2.13 
NUMERIC 
ERROR 
REPORTING 
(FERR #, IGNNE #) 


To allow 
PC-type 
floating 
point 
error 
reporting, 
the 
Intel486 
Microprocessor 
provides 
two pins, FERR # 
and IGNNE#. 


The Intel486 
Microprocessor 
asserts 
FERR # when- 


ever 
an unmasked 
floating 
point 
error 
is encoun- 
tered. 
FERR# 
is similar to the ERROR# 
pin on the 
387 Math Coprocessor. 
FERR # can be used by ex- 


ternal logic for PC-type floating 
point error reporting 
in Intel486 
Microprocessor 
systems. 
FERR# 
is ac- 


tive LOW, and is not floated 
during bus hold. 


In some 
cases, 
FERR # is asserted 
when 
the next 
floating 
point instruction 
is encountered 
and in other 
cases 
it is asserted 
before 
the 
next 
floating 
point 
instruction 
is encountered 
depending 
upon the exe- 
cution state of the instruction 
causing 
the exception. 


The following 
class of floating 
point exceptions 
drive 
FERR # at the time the exception 
occurs (Le., before 
encountering 
the next floating 
point instruction). 


1. The stack 
fault, 
invalid 
operation, 
and denormal 
exceptions 
on all transcendental 
instructions, 
in- 
teger 
arithmetic 
instructions, 
FSQRT, 
FSCALE, 


FPREM(1), 
FXTRACT, 
FBLD, and FBSTP. 


2. Any 
exceptions 
on store 
instructions 
(including 
integer 
store instructions). 


The following 
class of floating 
point exceptions 
drive 
FERR # 
only 
after 
encountering 
the 
next 
floating 
point instruction. 


1. Exceptions 
other 
than 
on all transcendental 
in- 
structions, 
integer 
arithmetic 
instructions, 


FSQRT, 
FSCALE, 
FPREM(1), 
FXTRACT, 
FBLD, 


and FBSTP. 


2. Any exception 
on all basic arithmetic, 
load, com- 
pare, 
and 
control 
instructions 
(Le., all other 
in- 
structions). 


The Intel486 Microprocessor will ignore a numeric 
error and continue executing non-control floating 
point instructions when IGNNE# 
is asserted, but 
FERR# will still be activated. When deasserted, the 
Intel486 Microprocessor will freeze on a non-control 
floating point instruction if a previous instruction 
caused an error. IGNNE# has no effect when the 
NE bit in control register 0 is set. 


The IGNNE# input is active LOW and is provided 
with a small internal pullup resistor. This input is 
asynchronous, but must meet setup and hold times 
t20 and t21 to insure recognition on any specific 
clock. 


6.2.14 BUS SIZE CONTROL (BS16#, BS8#) 


The 8S16# and 8S8# inputs allow external 16- and 
8-bit busses to be supported with a small number of 
external components. The Intel486 CPU samples 
these pins every clock. The value sampled in the 
clock before ready determines the bus size. When 
asserting 8S16# or 8S8# only 16 or 8 bits of the 
data bus need be valid. If both 8S16# and 8S8# 
are asserted, an 8-bit bus width is selected. 


When 8S16# 
or 8S8# 
are asserted the Intel486 
Microprocessor will convert a larger data request to 
the appropriate number of smaller transfers. The 
byte enables will also be modified appropriately for 
the bus size selected. 


8S16# and 8S8# are active LOW and are provided 
with small internal pullup resistors. 8S16# 
and 
8S8# must satisfy the setup and hold times t14 and 
t15 for proper chip operation. 


6.2.15 ADDRESS BIT 20 MASK (A20M#) 


Asserting the A20M# input causes the Intel486 Mi- 
croprocessor to mask physical address bit 20 before 
performing a lookup in the internal cache and before 
driving a memory cycle to the outside world. When 
A20M# 
is asserted, the Intel486 Microprocessor 
emulates the 1 Mbyte address wraparound that oc- 
curs on the 8086. A20M# is active LOW and must 
be asserted only when the processor is in real 
mode. The A20M # 
is not defined in Protected 
Mode. A20M # is asynchronous but should meet 
setup and hold times t20 and t21 for recognition in 
any specific clock. For correct operation of the chip, 
A20M# should be sampled high 2 clocks before and 
2 clocks after RESET goes low. When A20M# is 
asserted synchronously, A20M# 
should be high 
(non-active) at the clock prior to the falling edge of 
RESET. A20M# exhibits a minimum 4 clock latency, 
from time of assertion to masking of the A20 bit. 
A20M # is ignored during cache invalidation cycles. 
I/O writes require A20M # to be asserted a minimum 


of 2 clocks prior to ROY being returned for the I/O 
write. This insures recognition of the address mask 
before the i486 SX Microprocessor/Intel OverDrive 
Processor begins execution of the instruction follow- 
ing OUT. If A20M# is asserted after the AOS# of a 
data cycle, the A20 address signal is not masked 
during this cycle but is masked in the next cycle. 
During a prefetch (cacheable or not), if A20M # is 
asserted after the first ADS#, A20 is not masked for 
the duration of the prefetch; even if 8S16# or 8S8# 
is asserted. 


The following boundary scan test signals are only 
available on the 50 MHz version of the Intel486 
CPU. 


Test Clock (TCK) 


TCK is an input to the Intel486 CPU and provides 
the clocking function required by the JTAG boundary 
scan feature. TCK is used to clock state information 
and data into and out of the component. State select 
information and data are clocked into the compo- 
nent on the rising edge of TCK on TMS and TOI, 
respectively. Data is clocked out of the part on the 
falling edge of TCK on TOO. 


In addition to using TCK as a free running clock, it 
may be stopped in a low, 0, state, indefinitely as 
described in IEEE 1149.1. While TCK is stopped in 
the low state, the boundary scan latches retain their 
state. 


When boundary scan is not used, TCK should be 
tied high or left as a NC (This is important during 
power up to avoid the possibility of glitches on the 
TCK which could prematurely initiate boundary scan 
operations). TCK is supplied with an internal pullup 
resistor. 


TCK is a clock signal and is used as a reference for 
sampling other JTAG signals. On the rising edge of 
TCK, TMS and TOI are sampled. On the falling edge 
of TCK, TOO is driven. 


Test Mode Select (TMS) 


TMS is decoded by the JTAG TAP (Tap Access 
Port) to select the operation of the test logic, as de- 
scribed in Section 8.5.4. 


To guarantee deterministic behavior of the TAP con- 
troller, TMS is provided with an internal pull-up resis- 
tor. If boundary scan is not used, TMS may be tied 
high or left unconnected. TMS is sampled on the 
rising edge of TCK. TMS is used to select the inter- 
nal TAP states required to load boundary scan in- 
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structions 
to data on TOI. For proper 
initialization 
of 
the JTAG logic, TMS should be driven high, "1", 
for 
at least four TCK cycles following 
the rising edge of 
RESET. 


Test Data Input (TOI) 


TOI is the serial input used to shift JT AG instructions 
and data into the component. 
The shifting of instruc- 
tions 
and 
data 
occurs 
during 
the 
SHIFT-IR 
and 
SHIFT-DR 
controller 
states, 
respectively. 
These 
states 
are 
selected 
using 
the 
TMS 
signal 
as de- 
scribed 
in Section 
8.5.4. 


An internal 
pull-up resistor 
is provided 
on TOI to en- 
sure a known 
logic state if an open circuit occurs on 
the 
TOI path. 
Note 
that 
when 
"1" 
is continuously 
shifted 
into 
the 
instruction 
register, 
the 
BYPASS 
instruction 
is 
selected. 
TOI 
is 
sampled 
on 
the 
rising 
edge 
of TCK, 
during 
the 
SHIFT-IR 
and 
the 
SHIFT-DR 
states. 
During 
all 
other 
TAP 
controller 
states, 
TOI is a "don't 
care". 


Test Data Output 
(TOO) 


TOO is the serial output 
used to shift JTAG 
instruc- 
tions and data out of the component. 
The shifting 
of 
instructions 
and 
data 
occurs 
during 
the 
SHIFT-IR 
and SHIFT-DR 
TAP controller 
states, 
respectively. 


These 
states 
are selected 
using the TMS signal as 
described 
in Section 
8.5.4. When not in SHIFT-IR 
or 
SHIFT-DR 
state, TOO is driven to a high impedance 
state to allow connecting 
TOO of different 
devices 
in 
parallel. 


TOO is driven on the falling edge of TCK during the 
SHIFT-IR 
and SHIFT-DR 
TAP controller 
states. 
At 
all other times TOO is driven to the high impedance 
state. 


The 
Intel486 
Microprocessor 
contains 
four 
write 
buffers 
to enhance 
the performance 
of consecutive 
writes to memory. 
The buffers can be filled at a rate 
of one write per clock until all four buffers 
are filled. 


When all four buffers are empty and the bus is idle, a 
write 
request 
will propagate 
directly 
to the external 
bus bypassing 
the 
write 
buffers. 
If the 
bus is not 
available 
at the time the write is generated 
internally, 
the write will be placed in the write buffers and prop- 
agate to the bus as soon as the bus becomes 
avail- 
able. The write is stored 
in the on-chip 
cache imme- 
diately 
if the write is a cache 
hit. 


Writes 
will be driven 
onto 
the 
external 
bus in the 
same order 
in which 
they are received 
by the write 
buffers. 
Under certain 
conditions 
a memory 
read will 


go onto the external 
bus before 
the memory 
writes 
pending 
in the 
buffer 
even 
though 
the 
writes 
oc- 
curred 
earlier in the program 
execution. 


A memory 
read will only be reordered 
in front 
of all 
writes in the buffers under the following 
conditions: 
If 
all writes 
pending 
in the buffers 
are cache 
hits and 
the read is a cache miss. Under these conditions 
the 
Intel486 
Microprocessor 
will not read from an exter- 
nal memory 
location 
that 
needs 
to be updated 
by 
one of the pending 
writes. 


Reordering 
of a read with the writes 
pending 
in the 
buffers 
can only occur 
once 
before 
all the 
buffers 
are emptied. 
Reordering 
read once 
only 
maintains 
cache consistency. 
Consider 
the following 
example: 


The CPU writes 
to location 
X. Location 
X is in the 
internal 
cache, 
so it is updated 
there 
immediately. 


However, 
the bus is busy so the write 
out to main 
memory 
is buffered 
(see Figure 6.3(a». At this point, 


any reads 
to location 
X would 
be cache 
hits and 
most up-to-date 
data would 
be read. 
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The 
next 
instruction 
causes 
a read 
to location 
Y. 


Location 
Y is not in the cache (a cache 
miss). Since 
the write in the write buffer is a cache hit, the read is 
reordered. 
When location 
Y is read, it is put into the 
cache. 
The possibility 
exists 
that 
location 
Y will re- 


place location 
X in the cache. 
If this is true, location 
X would 
no longer be cached 
(see Figure 6.3(b». 


Cache 
consistency 
has been maintained 
up to this 
point. 
If a subsequent 
read is to location 
X (now a 
cache miss) and it was reordered 
in front of the buff- 
ered write 
to location 
X, stale 
data would 
be read. 


This is why only 1 read is allowed 
to be reordered. 


Once a read is reordered, 
all the writes 
in the write 
buffer are flagged 
as cache misses to ensure that no 
more reads are reordered. 
Since 
one of the condi- 
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tions to reorder 
.a read is that all writes 
in the write 
buffer 
must be cache 
hits, no more reordering 
is al- 
lowed 
until all of those 
flagged 
writes 
propogate 
to 
the bus. Similarly, 
if an invalidation 
cycle 
is run all 
entries 
in the write buffer are flagged 
as cache miss- 
es. 


For multiple 
processor 
systems 
and/or 
systems 
us- 
ing DMA techniques, 
such as bus snooping, 
locked 
semaphores 
should 
be used to maintain 
cache con- 
sistency. 


6.3.1 
WRITE 
BUFFERS 
AND I/O CYCLES 


Input/Output 
(I/O) 
cycles 
must be handled 
in a dif- 
ferent 
manner 
by the write buffers. 


I/O 
reads 
are never 
reordered 
in front 
of buffered 
memory 
writes. This insures that the Intel486 
Micro- 
processor 
will 
update 
all memory 
locations 
before 
reading 
status from an I/O device. 


The 
Intel486 
Microprocessor 
never 
buffers 
single 
I/O writes. When processing 
an OUT instruction, 
in- 
ternal 
execution 
stops 
until 
the 
I/O 
write 
actually 
completes 
on the external 
bus. This allows 
time for 
the external 
system 
to drive 
an invalidate 
into the 
Intel486 
Microprocessor 
or to mask interrupts 
before 
the processor 
progresses 
to the instruction 
following 
OUT. REP OUTS instructions 
will be buffered. 


I/O 
device 
recovery 
time 
must 
be handlec1 slightly 
differently 
by the Intel486 
Microprocessor 
than with 
the 
386 
Microprocessor. 
I/O 
device 
back-to-back 
write recovery 
times could be guaranteed 
by the 386 
Microprocessor 
by inserting 
a jump 
to the next in- 
struction 
in the code that writes 
to the device. 
The 
jump 
forces 
the 386 Microprocessor 
to generate 
a 
prefetch 
bus cycle 
which 
can't 
begin 
until the 
I/O 
write completes. 


Inserting 
a jump to the next write will not work with 
the 
Intel486 
Microprocessor 
because 
the 
prefetch 
could be satisfied 
by the on-chip 
cache. A read cycle 
must be explicitly 
generated 
to a non-cacheable 
lo- 
cation in memory 
to guarantee 
that a read bus cycle 
is performed. 
This read will not be allowed 
to pro- 
ceed to the bus until after the I/O write has complet- 
ed because 
I/O writes are not buffered. 
The I/O de- 
vice will have time to recover to accept another write 
during the read cycle. 


6.3.2 
WRITE 
BUFFERS 
IMPLICATIONS 
ON 
LOCKED 
BUS CYCLES 


Locked 
bus cycles 
are used 
for 
read-modify-write 
accesses 
to memory. 
During a read-modify-write 
ac- 
cess, a memory 
base variable 
is read, modified 
and 
then written 
back to the same memory 
location. 
It is 
important 
that 
r,o other 
bus cycles, 
generated 
by 


other bus masters or by the Intel486 
Microprocessor 
itself, 
be allowed 
on the external 
bus between 
the 
read and write portion 
of the locked 
sequence. 


During a locked 
read cycle 
the Intel486 
Microproc- 
essor 
will 
always 
access 
external 
memory, 
it will 
never look for the location 
in the on-chip 
cache, 
but 
for write cycles, 
data is written 
in the internal 
cache 
(if cache 
hit) and in the external 
memory. 
All data 
pending 
in the Intel486 
Microprocessor's 
write buff- 
ers will be written to memory before a locked cycle is 
allowed 
to proceed 
to the external 
bus. 


The Intel486 
Microprocessor 
will assert the LOCK IF 
pin 
after 
the 
write 
buffers 
are 
emptied 
during 
a 
locked bus cycle. With the LOCK IF pin asserted, 
the 
microprocessor 
will read 
the 
data, 
operate 
on the 
data and place the results in a write buffer. The con- 
• 
tents of the write buffer will then be written 
to exter- 
nal memory. 
LOCKIF 
will become 
inactive 
after the 
write part of the locked 
cycle. 


6.4 
Interrupt and Non-Maskable 
Interrupt 
Interface 


The 
Intel486 
Microprocessor 
provides 
two 
asyn- 
chronous 
interrupt 
inputs, 
INTR 
(interrupt 
request) 
and NMI (non-maskable 
interrupt 
input). This section 
describes 
the 
hardware 
interface 
between 
the 
in- 


struction 
execution 
unit and the pins. For a descrip- 
tion of the algorithmic 
response 
to interrupts 
refer to 
Section 
2.7. 
For interrupt 
timings 
refer 
to Section 
7.2.10. 


The 
Intel486 
Microprocessor 
contains 
a two-clock 
synchronizer 
on the 
interrupt 
line. An interrupt 
re- 
quest 
will 
reach 
the 
internal 
instruction 
execution 
unit two 
clocks 
after 
the 
INTR 
pin is asserted, 
if 
proper setup is provided 
to the first stage of the syn- 


chronizer. 


There 
is no special 
logic in the interrupt 
path other 
than the synchronizer. 
The INTR signal is level sen- 
sitive and must remain active for the instruction 
exe- 
cution 
unit to recognize 
it. The interrupt 
will not be 
serviced 
by the Intel486 
Microprocessor 
if the INTR 
signal does not remain active. 


The instruction 
execution 
unit will look at the state of 
the synchronized 
interrupt 
signal 
at specific 
clocks 
during the execution 
of instructions 
(if interrupts 
are 
enabled). 
These 
specific 
clocks 
are 
at instruction 
boundaries, 
or iteration 
boundaries 
in the 
case 
of 
string 
move 
instructions. 
Interrupts 
will only be ac- 
cepted 
at these 
boundaries. 


An interrupt 
must be presented 
to the Intel486 
Mi- 
croprocessor 
INTR pin three 
clocks 
before 
the end 
of an 
instruction 
for 
the 
interrupt 
to 
be 
acknowl- 


edged. Presenting the interrupt 3 clocks before the 
end of an instruction allows the interrupt to pass 
through the two clock synchronizer leaving one 
clock to prevent the initiation of the next sequential 
instruction and to begin interrupt service. If the inter- 
rupt is not received in time to prevent the next in- 
struction, it will be accepted at the end of next in- 
struction, assuming INTR is still held active. The in- 
terrupt service microcode will start after two dead 
clocks. 


The longest latency between when an interrupt re- 
quest is presented on the INTR pin and when the 
interrupt service begins is: longest instruction used 
+ the two clocks for synchronization + one clock 
required to vector into the interrupt service micro- 
code. 


The NMI pin has a synchronizer like that used on the 
INTR line. Other than the synchronizer, the NMI log- 
ic is different from that of the maskable interrupt. 


NMI is edge triggered as opposed to the level trig- 
gered INTR signal. The rising edge of the NMI signal 
is used to generate the interrupt request. The NMI 
input need not remain active until the interrupt is ac- 
tually serviced. The NMI pin only needs to remain 
active for a single clock if the required setup and 
hold times are met. NMI will operate properly if it is 
held active for an arbitrary number of clocks. 


The NMI input must be held inactive for at least four 
clocks after it is asserted to reset the edge triggered 
logic. A subsequent NMI may not be generated if the 
NMI is not held inactive for at least two clocks after 
being asserted. 


The NMI input is internally masked whenever the 
NMI routine is entered. The NMI input will remain 
masked until an IRET (return from interrupt) instruc- 
tion is executed. Masking the NMI signal prevents 
recursive NMI calls. If another NMI occurs while the 
NMI is masked off, the pending NMI will be executed 
after the current NMI is done. Only one NMI can be 
pending while NMI is masked. 


The Intel486 OX Microprocessor has a built in self 
test (BIST) that can be run during reset. The BIST is 
invoked if the AHOLO pin is asserted for 2 clocks 
before and 2 clocks after RESET is deasserted. 
RESET must be active for 15 clocks with or without 
BIST enabled. To ensure proper results. FLUSH*" 
must not be asserted while BIST is executing. Refer 
to Section 8.0 for information on Intel486 OX Micro- 
processor testability. 


The Intel486 Microprocessor registers have the val- 
ues shown in Table 6.2 after RESET is performed. 
The EAX register contains information on the suc- 
cess or failure of the BIST if the self test is executed. 
The OX register always contains a component iden- 
tifier at the conclusion of RESET. The upper byte of 
OX (OH) will contain 04 and the lower byte (OL) will 
contain a stepping identifier (see Table 6-3). The 
floating point registers are initialized as if the FINIT/ 
FNINIT (initialize processor) instruction was execut- 
ed if the BIST was performed. If the BIST is not exe- 
cuted, the floating point registers are unchanged. 


Register 
Initial 
Value 
Initial 
Value 


(BISn 
(No Bist) 


EAX 
Zero (Pass) 
Undefined 


ECX 
Undefined 
Undefined 


EoX 
0400 + Revision 
10 
0400 + Revision 
10 


EBX 
Undefined 
Undefined 


ESP 
Undefined 
Undefined 


EBP 
Undefined 
Undefined 


ESI 
Undefined 
Undefined 


Eol 
Undefined 
Undefined 


EFLAGS 
00OOOOO2h 
00000002h 


EIP 
OFFFOh 
OFFFOh 


ES 
OOOOh 
OOOOh 


CS 
FOOOh' 
FOOOh' 


SS 
OOOOh 
OOOOh 


OS 
OOOOh 
OOOOh 


FS 
OOOOh 
OOOOh 


GS 
OOOOh 
OOOOh 


10TR 
Base = O. Limit = 3FFh 
Base=O. 
Limit=3FFh 


CRO 
60000010h 
60000010h 


oR7 
OOOOOOOOh 
OOOOOOOOh 


CW 
037Fh 
Unchanged 


SW 
OOOOh 
Unchanged 


TW 
f'FFFh 
Unchanged 


FIP 
OOOOOOOOh 
Unchanged 


FEA 
OOOOOOOOh 
Unchanged 


FCS 
OOOOh 
Unchanged 


FoS 
OOOOh 
Unchanged 


FOP 
OOOh 
Unchanged 


FSTACK 
Undefined 
Unchanged 


inteJ~ 


Intel486TM CPU 
Component 
Revision 
Stepping Name 
10 
10 


B3 
04 
01 


B4 
04 
01 


B5 
04 
01 


B6 
04 
01 


CO 
04 
02 


C1 
04 
03 


DO 
04 
04 


cA2 
04 
10 


cA3 
04 
10 


cBO 
04 
11 


cB1 
04 
11 


IntelOverOrlve™ 
Processor 
Stepping Name 


A2 
04 
32 


B1 
04 
33 


The Intel486 Microprocessor will start executing in- 
structions at location FFFFFFFOH after RESET. 
When the first InterSegment Jump or Call is execut- 
ed, address lines A20-A31 will drop LOW for CS-rel- 
ative memory cycles, and the Intel486 Microproces- 
sor will only execute instructions in the lower one 
Mbyte of physical memory. This allows the system 
designer to use a ROM at the top of physical memo- 
ry to initialize the system and take care of RESETs. 


RESET forces the Intel486 Microprocessor to termi- 
nate all execution and local bus activity. No instruc- 
tion or bus activity will occur as long as RESET is 
active. 


The Intel486 Microprocessor recognizes and can re- 
spond to HOLD, AHOLD, and BOFFII requests re- 
gardless of the state of RESET. Thus, even though 
the processor is in reset, it can still float its bus in 
response to any of these requests. 


While in reset, the Intel486 Microprocessor bus is in 
the state shown in Figure 6.4 if the HOLD, AHOLD 
and BOFFII requests are inactive. Note that the ad- 
dress (A31-A2, BE3II -BEO II) and cycle definition 
(M/IOI1, 
D/CI1, 
W/RI1) 
pins are undefined from the 
time reset is asserted up to the start of the first bus 
cycle. All undefined pins (except FERRI1) assume 
known values at the beginning of the first bus cycle. 
The first bus cycle is always a code fetch to address 
FFFFFFFOH. 


FERRII reflects the state of the ES (Error Summary 
status) bit in the floating point unit status word. The 
ES bit is initialized whenever the floating point unit 
state is initialized. The floating point unit's status 
word register can be initialized by BIST or by execut- 
ing FINITIFNINIT instruction. Thus, after reset and 
before executing the first FINIT or FNINIT instructon, 
the values of the FERRII and the numeric status 
word register bits 0-7 
depends on whether or not 
BIST is performed. Table 6-4 shows the state ,of 
FERRII signal after reset and before the execution 
of the FINITIFNI NIT instruction. 


BIST 
FERRI1 
FPU Status 


Performed 
Pin 
Word Register 
Bits 0-7 


YES 
Inactive 
Inactive 
(High) 
(Low) 


NO 
Undefined 
Undefined 
(Low or High) 
(Low or High) 


After the first FINIT or FNINIT instruction, FERRII 
pin and the FPU status word register bits (0-7) will 
be inactive irrrespective of the Built-In Self-Test 
(BIST). 
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NOTES: 
1. RESET 
is an asynchronous 
input. t20 must be met only to guarantee 
recognition 
on a specific 
clock 
edge. 


2a. When A20M# 
is driven synchronously, 
it must be driven 
high (inactive) 
for the CLK edge prior to the falling 
edge of RESET to ensure 
proper 
operation. 
A20M# 


setup and hold times 
must be met. 


2b. When 
A20M# 
is driven 
asynchronously, 
it must be driven 
high (inactive) 
for two CLKs prior to and two CLKs after the falling 
edge of RESET 
to ensure 
proper 
operation. 
3a. When 
FLUSH# 
is driven 
synchronously, 
it should 
be driven 
low (active) 
for the CLK edge prior to the falling 
edge of RESET to invoke 
the Tri-State 
Output 
Test 
Mode. All outputs 
are guaranteed 
tri-stated 
within 
10 CLKs of RESET being deasserted. 
FLUSH# 
setup and hold times 
must be met. 


3b. When 
FLUSH# 
is driven 
asynchronously, 
it must be driven 
low (active) 
for two CLKs prior to and two CLKs after the falling 
edge 
of RESET 
to invoke 
the Tri- 
State Output 
Test Mode. 
All outputs 
are guaranteed 
tri-stated 
within 
10 CLKs of RESET being deasserted. 


3c. FLUSH# 
must be driven 
high (inactive) 
during Build-in-Self-Test 
(BIST). 


4. AHOLD 
should 
be driven 
high (active) 
for the CLK edge 
prior to the falling 
edge of RESET to invoke 
the Built-In-Self-Test 
(BIST). AHOLD 
setup 
and hold times 
must be met. 
5. Hold is recognized 
normally 
during 
RESET. 


6. 15 CLKs RESET 
pulse width for warm resets. 
Power-up 
resets 
require 
RESET to be asserted 
for at least 
1 ms after Vcc 
and CLK are stable. 


All data transfers occur as a result of one or more 
bus cycles. logical data operands of byte, word and 
dword lengths may be transferred without restric- 
tions on physical address alignment. Oata may be 
accessed at any byte boundary but two or three cy- 
cles may be required for unaligned data transfers. 
See Section 7.1.3 Dynamic Bus Sizing and 7.1.6 Op- 
erand Alignment. 


The Intel486 Microprocessor address signals are 
split into two components. High-order address bits 
are provided by the address lines, A2-A31. The byte 
enables, BEO#-BE3#, 
form the low-order address 


and provide linear selects for the four bytes of the 
32-bit address bus. 


The byte enable outputs are asserted when their as- 
sociated data bus bytes are involved with the pres- 
ent bus cycle, as listed in Table 7.1. Byte enable 
patterns which have a negated byte enable separat- 
ing two or three asserted byte enables will never 
occur (see Table 7.5). All other byte enable patterns 
are possible. 


Table 7.1. Byte Enables and Associated 
Data and Operand Bytes 


Byte 
Enable 
Associated Data Bus Signals 
Signal 


BEO# 
00-07 
(byte o-Ieast 
significant) 


BE1# 
08-015 
(byte 1) 


BE2# 
016-023 
(byte 2) 


BE3# 
024-031 
(byte 3-most 
significant) 


Address bits AO and A1 of the physical operand's 
base address can be created when necessary. Use 
of the byte enables to create AOand A1 is shown in 
Table 7.2. The byte enables can also be decoded to 
generate BlE # (byte low enable) and BHE# (byte 
high enable). These signals are needed to address 
16-bit memory systems (see Section 7.1.4 Inter- 
facing with 8- and 16-bit memories). 


Table 7.2. Generating AO-A31 from 
BEO#-BE3# 
andA2-A31 


Inte1486T11 
CPU Address 
Signals 


A31 
......... 
A2 
BE3# 
BE2# 
BE 1# 
BEO# 


Physical 
Base 


Address 


A31 
......... A2 
A1 
AO 


A31 
......... 
A2 
0 
0 
X 
X 
X 
Low 


A31 
......... 
A2 
0 
1 
X 
X 
Low 
High 


A31 
......... 
A2 
1 
0 
X 
Low 
High 
High 


A31 
......... A2 
1 
1 
Low 
High 
High 
High 


Bus cycles may access physical memory space or 
I/O space. Peripheral devices in the system may ei- 
ther be memory-mapped, or I/O-mapped, or both. 
Physical memory addresses range from OOOOOOOOH 
to FFFFFFFFH (4 gigabytes). I/O addresses range 
from OOOOOOOOH 
to OOOOFFFFH(64 Kbytes) for pro- 


grammed I/O. See Figure 7.1. 
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7.1.2 
MEMORY 
AND 
I/O 
SPACE 
ORGANIZATION 


The 
Intel486 
Microprocessor 
datapath 
to 
memory 
and 
input/output 
(I/O) 
spaces 
can 
be 32-, 
16- or 
8-bits wide. The byte enable 
signals, 
8EO#-8E3#, 
allow byte granularity 
when addressing 
any memory 
or I/O structure 
whether 
8, 16 or 32 bits wide. 


The 
Intel486 
Microprocessor 
includes 
bus 
control 
pins, 8516# 
and 858#, 
which allow direct connec- 


tion to 16· and 8·bit memories 
and I/O devices. 
Cy- 
cles 
to 
32-, 
16- and 
8-bit 
may 
occur 
in any 
se- 
quence, 
since 
the 
858# 
and 
8516# 
signals 
are 
sampled 
during each bus cycle. 


32-bit 
wide 
memory 
and I/O 
spaces 
are organized 
as arrays of physical 
4-byte words. Each memory 
or 


I/O 
4-byte 
word 
has four 
individually 
addressable 
bytes 
at 
consecutive 
byte 
addresses 
(see 
Figure 
7.2). The lowest 
addressed 
byte is associated 
with 
data 
signals 
00-07; 
the 
highest-addressed 
byte 
with 024-031. 
Physical 
4·byte 
words 
begin 
at ad- 
dresses 
divisible 
by four. 


32-Blt 
Wide Organization 
''''''''''ITID"""",. 


00000003H 
'-' 
• 
'.-J 
oOOOOOOOH 


BE3# 
BE2# 
BE1# 
BEO# 


16·Blt Wide Organization 
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Figure 
7.2. Physical 
Memory 
and I/O Space Organization 


16-bit memories are organized as arrays of physical 
2-byte words. Physical 2-byte words begin at ad- 
dresses divisible by two. The byte enables 8EO#- 
8E3 # , must be decoded to A1, 8LE # and 8HE # to 
address 16-bit memories (see Section 7.1.4). 


To address 8-bit memories, the two low order ad- 
dress bits AOand A1, must be decoded from 8EO# - 
8E3 #. The same logic can be used for 8- and 16-bit 
memories since the decoding logic for 8LE # and AO 
are the same (see Section 7.1.4). 


Dynamic data bus sizing is a feature allowing proc- 
essor connection to 32-, 16- or 8-bit buses for mem- 
ory or I/O. A processor may connect to all three bus 
sizes. Transfers to or from 32-, 16- or 8-bit devices 
are supported by dynamically determining the bus 
width during each bus cycle. Address decoding cir- 
cuitry may assert 8S16# 
for 16-bit devices, or 
8S8# for 8-bit devices during each bus cycle. 8S8# 
and 8S16# must be negated when addressing 32- 
bit devices. An 8-bit bus width is selected if both 
8S 16# and 8S8 # are asserted. 


8S16# and 8S8# force the Intel486 Microproces- 
sor to run additional bus cycles to complete re- 
quests larger than 16- or 8 bits. A 32-bit transfer will 
be converted into two 16-bit transfers (or 3 transfers 
if the data is misaligned) when 8S16# is asserted. 
Asserting 8S8# 
will convert a 32-bit transfer into 
four 8-bit transfers. 


Extra cycles forced by 8S16# or 8S8# should be 
viewed as independent bus cycles. 8S16# or 8S8# 
must be driven active during each of the-extra cycles 
unless the addressed device has the ability to 
change the number of bytes it can return between 
cycles. 


The Intel486 Microprocessor will drive the byte en- 
ables appropriately during extra cycles forced by 
8S8# 
and 8S16#. A2-A31 
will not change if ac- 
cesses are to a 32-bit aligned area. Table 7.3 shows 
the set of byte enables that will be generated on the 
next cycle for each of the valid possibilities of the 
byte enables on the current cycle. 


The dynamic bus sizing feature of the Intel486 Mi- 
croprocessor is significantly different than that of the 
386 Microprocessor. Unlike the 386 Microprocessor, 
the Intel486 Microprocessor requires that data bytes 
be driven on the addressed data pins. The simplest 
example of this function is a 32-bit aligned, 8S16# 
read. When the Intel486 Microprocessor reads the 
two high order bytes, they must be driven on the 
data bus pins 016-031. 
The Intel486 Microproces- 
sor expects the two low order bytes on 00-015. 
The 386 Microprocessor expects both the high and 
low order bytes on 00-015. 
The 386 Microproces- 


sor always reads or writes data on the lower 16 bits 
of the data bus when 8S16# is asserted. 


The external system must contain buffers to enable 
the Intel486 Microprocessor to read and write data 
on the appropriate data bus pins. Table 7.4 shows 
the data bus lines where the Intel486 Microproces- 
sor expects data to be returned for each valid com- 
bination of byte enables and bus,sizing options. 


Valid data will only be driven onto data bus pins cor- 
responding to active byte enables during write cy- 
cles. Other pins in the data bus will be driven but 
they will not contain valid data. Unlike the 386 Micro- 
processor, the Intel486 Microprocessor will not du- 
plicate write data onto parts of the data bus for 
which the corresponding byte enable is negated. 


Current 
Next with BSS # 
Next with BS16# 
BE3# 
BE2# 
BE1# 
BEO# 
BE3# 
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BE1# 
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BE1# 
BEO# 


1 
1 
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n 
n 
n 
n 
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BE3# 
BE2# 
BE1# 
BEG# 
wlo BS8# IBS16# 
wBS8# 
WBS16# 


1 
1 
1 
0 
07-00 
07-00 
07-00 
1 
1 
0 
0 
015-00 
07-00 
015-00 
1 
0 
0 
0 
023-00 
07-00 
015-00 
0 
0 
0 
0 
031-00 
07-00 
015-00 
1 
1 
0 
1 
015-08 
015-08 
015-08 
1 
0 
0 
1 
023-08 
015-08 
015-08 
0 
0 
0 
1 
031-08 
015-08 
015-08 
1 
0 
1 
1 
023-016 
023-016 
023-016 
0 
0 
1 
1 
031-016 
023-016 
031-016 
0 
1 
1 
1 
031-024 
031-024 
031-024 


7.1.4 
INTERFACING 
WITH 8-, 16· AND 32-BIT 
MEMORIES 


In 32-bit physical memories such as Figure 7.3, each 
4-by1eword begins at a by1eaddress that is a multi- 
ple of four. A2-A31 
are used as a 4-by1eword se- 


lect. BEO# -BE3 # select individual by1eswithin the 
4-by1eword. B88# and B816# are negated for all 
bus cycles involving the 32-bit array. 


32, 
DATABUS (00-031) 


Intol486N 
32-BIT 
CPU 
ADDRESSBUS (BEO"-BE3",A2-A31) 
IolE••ORY 


1858" 
l8S 16" 


"HIGH" "HIGH" 
240440-36 


Figure 7.3.lnteI486™ 
Microprocessor 
with 32-Blt Memory 


16- and 8-bit memories require external by1eswap- 
ping logic for routing data to the appropriate data 
lines and logic for generating BHE#, BLE# and A1. 
In systems where mixed memory widths are used, 
extra address decoding logic is necessary to assert 
B816# or B88#. 


Figure 7.4 shows the Intel486 microprocessor ad- 
dress bus interface to 32-, 16- and 8-bit memories. 
To address 16-bit memories the by1eenables must 
be decoded to produce A1, BHE# and BLE# (AO). 
For 8-bit wide memories the by1eenables must be 
decoded to produce AOand A1. The same by1ese- 
lect logic can be used in 16- and 8-bit systems since 
BLE# is exactly the same as AO(see Table 7.5). 


BEO#-BE3# 
can be decoded as shown in Table 
7.5 to generate A1, BHE# and BLE#. The by1ese- 
lect logic necessary to generate BHE# and BLE# is 
shown in Figure 7.5. 


Inle1486TIol 
Address 
Bus (A31-A2 
8EO"-8E3") 
32-8;1 


~icroprocessor 
"emory 


58,,1 
1 
BS16" 


A31-A2 


Address 
+-- 
16-Bit 
Decode 
BHE", 8LE", 
A1 
•.•• mory 


8EO"-8E3" 
Byle 
Selecl 
Logic 


AO(BLE"), A1 
8-Bil 


A31-A2 
••••mory 


Intel486™ 
CPU Signals 
8, 16-Bit Bus Signals 
Comments 
BE3# 
BE2# 
BE1# 
BEO# 
A1 
BHE# 
BLE# 
(AO) 


H* 
H* 
H* 
H* 
x 
x 
x 
x-no 
active bytes 


H 
H 
H 
L 
L 
H 
L 
H 
H 
L 
H 
L 
L 
H 
H 
H 
L 
L 
L 
L 
L 
H 
L 
H 
H 
H 
H 
L 
H* 
L* 
H* 
L* 
x 
x 
x 
x-not 
contiguous 
bytes 
H 
L 
L 
H 
L 
L 
H 
H 
L 
L 
L 
L 
L 
L 


L 
H 
H 
H 
H 
L 
H 
L* 
H* 
H* 
L* 
x 
x 
x 
x-not 
contiguous 
bytes 


L* 
H* 
L* 
H* 
x 
x 
x 
x-not 
contiguous 
bytes 


L* 
H* 
L* 
L* 
x 
x 
x 
x-not 
contiguous 
bytes 
L 
L 
H 
H 
H 
L 
L 
L* 
L* 
H* 
L* 
x 
x 
x 
x-not 
contiguous 
bytes 
L 
L 
L 
H 
L 
L 
H 
L 
L 
L 
L 
L 
L 
L 


BLE# 
asserted 
when 00-07 
of 16-bit bus is active. 


BHE # asserted 
when 08-015 
of 16-bit bus is active. 


A 1 low for all even words; A 1 high for all odd words. 


Key: 


x 
= don't care 
H = high voltage 
level 
L = low voltage 
level 


* = a non-occurring 
pattern of Byte Enables; either none are asserted, 
or the pattern has Byte Enables asserted 
for non-contiguous 
bytes 


~ 


BEO# 
A1 
BE1# 


240440-38 
~ 


BE1# 
BHE 
BE3# 


240440-39 


BEO# 


BE2# 


Combinations 
of BEO#-BE3# 
which 
never 
occur 
are those 
in which 
two or three 
asserted 
byte en- 
ables 
are separated 
by one or more 
negated 
byte 
enables. 
These 
combinations 
are "don't 
care" 
con- 
ditions 
in the decoder. 
A decoder 
can use the non- 
occurring 
BED # - BE3 # combinations 
to its best ad- 


vantage. 


Figure 
7.6 shows 
an Intel486 
Microprocessor 
data 
bus interface 
to 16- and 8-bit wide memories. 
Exter- 
nal byte swapping 
logic is needed 
on the data lines 
so that 
data 
is supplied 
to, and received 
from 
the 
Intel486 
Microprocessor 
on 
the 
correct 
data 
pins 
(see Table 7.4). 
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7.1.5 
DYNAMIC 
BUS SIZING 
DURING 
CACHE 
LINE 
FILLS 


888# 
and 8816# 
can be driven 
during cache 
line 
fills. 
The 
Intel486 
Microprocessor 
will 
generate 
enough 
8- or 16-bit cycles to fill the cache 
line. This 
can be up to 16 8-bit cycles. 


The external 
system should assume that all byte en- 
ables are active for the first cycle of a cache line fill. 
The 
Intel486 
Microprocessor 
will 
generate 
proper 
byte enables 
for subsequent 
cycles 
in the 
line fill. 


Table 7.6 shows the appropriate 
AO (8LE#), 
A1 and 
SHE # for the various 
combinations 
of the Intel486 
Microprocessor 
byte enables 
on both the first and 
subsequent 
cycles 
of the 
cache 
line fill. The 
••••• 
marks 
all combinations 
of byte enables 
that will be 
generated 
by the Intel486 
Microprocessor 
during 
a 
cache 
line fill. 


Physical 
4-byte 
words 
begin 
at addresses 
that 
are 
multiples 
of four. 
It is possible 
to transfer 
a logical 
operand 
that spans 
more than one physical 
4-byte 
word of memory 
or I/O at the expense 
of extra cy- 
cles. Examples 
are 4-byte operands 
beginning 
at ad- 
dresses 
that are not evenly 
divisible 
by 4, or 2-byte 
words 
split 
between 
two 
physical 
4-byte 
words. 
These 
are referred 
to as unaligned 
transfers. 


Operand 
alignment 
and data bus size dictate 
when 
multiple 
bus cycles are required. Table 7.7 describes 
the transfer 
cycles generated 
for all combinations 
of 
logical operand 
lengths, alignment, 
and data bus siz- 
ing. When 
multiple 
cycles 
are required 
to transfer 
a 
multi-byte 
logical 
operand, 
the 
highest-order 
bytes 
are transferred 
first. For example, 
when the proces- 
sor does a 4-byte unaligned 
read beginning 
at loca- 
tion x11 in the 4-byte 
aligned 
space, 
the three 
high 
order bytes are read in the first bus cycle. 
The low 
byte is read in a subsequent 
bus cycle. 


BE3# 
BE2# 
BEH 
BEO# 
First Cache 
Fill Cycle 
Any Other 
Cycle 
AO 
A1 
BHE# 
AO 
A1 
BHE# 


1 
1 
1 
0 
0 
0 
0 
0 
0 
1 
1 
1 
0 
0 
0 
0 
0 
0 
0 
0 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
·0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
0 
1 
0 
0 
0 
1 
0 
0 
1 
0 
0 
1 
.0 
0 
0 
1 
0 
0 
·0 
0 
0 
1 
0 
0 
0 
1 
0 
0 
1 
0 
1 
1 
0 
0 
0 
0 
1 
1 
·0 
0 
1 
1 
0 
0 
0 
0 
1 
0 
·0 
1 
1 
1 
0 
0 
0 
1 
1 
0 


Byte-Length of Logical Operand 


1 
2 
4 


Physical Byte Address in 
xx 
00 
01 
10 
11 
00 
01 
10 
11 
Memory (Low Order Bits) 


Transfer Cycles 
b 
hb 
d 
hb 
hw 
h3 
over 32-Bit Bus 
w 
w 
w 
Ib 
13 
Iw 
Ib 


Transfer Cycles over 
Ib 
hb 
Iw 
hb 
hw 
mw 
16-Bit Data Bus 
b 
w 
hb 
w 
Ib 
hw 
Ib 
Iw 
hb 
= BS16# Asserted 
mw 
Ib 


Transfer Cycles over 
Ib 
hb 
mhb 
mlb 
8-Bit Data Bus 
b 
Ib 
Ib 
Ib 
hb 
mlb 
Ib 
hb 
mhb 
= BS8# Asserted 
hb 
hb 
hb 
Ib 
mhb 
mlb 
Ib 
hb 


hb 
mhb 
mlb 
Ib 


KEY: 
b = byte transfer 
w = 2-byte transfer 
3 = 3-byte transfer 
d = 4-byte transfer 


h = high-order portion 
I = low-order portion 
m = mid-order portion 


The function of unaligned transfers with dynamic 
bus sizing is not obvious. When the external systems 
asserts BS16# or BS8# forcing extra cycles, low- 
order bytes or words are transferred first (opposite 
to the example above). When the Intel486 Micro- 
processor requests a 4-byte read and the external 
system asserts BS16#, the lower 2 bytes are read 
first followed by the upper 2 bytes. 


In the unaligned transfer described above, the proc- 
essor requested three bytes on the first cycle. If the 
external system asserted BS16# during this 3-byte 
transfer, the lower word is transferred first followed 
by the upper byte. In the final cycle the lower byte of 
the 4·byte operand is transferred as in the 32-bit ex- 
ample above. 


The Intel486 Microprocessor supports a wide variety 
of bus transfers to meet the needs of high perform- 
ance systems. Bus transfers can be single cycle or 
multiple cycle, burst or non-burst, cacheable or non- 
cacheable, 8-, 16- or 32-bit, and pseudo-locked. To 
support multiprocessing systems there are cache in- 
validation cycles and locked cycles. 


4-ByteOperand 
~ 
t 
t 
byte with 
byte with 
lowest 
highest 
address 
address 


This section begins with basic non-cacheable non- 
burst single cycle transfers. It moves on to multiple 
cycle transfers and introduces the burst mode. 
Cacheability is introduced in Section 7.2.3. The re- 
maining sections describe locked, pseudo-locked, 
invalidate, bus hold and interrupt cycles. 


Bus cycles and data cycles are discussed in this 
section. A bus cycle is at least two clocks long and 
begins with ADS# active in the first clock and ready 
active in the last clock. Data is transferred to or from 
the Intel486 Microprocessor during a data cycle. A 
bus cycle contains one or more data cycles. 


Refer to Section 7.2.13 for a description of the bus 
states shown in the timing diagrams. 


7.2.1 NON·CACHEABLE NON·BURST SINGLE 
CYCLE 


7.2.1.1 No Wait States 


The fastest non-burst bus cycle that the Intel486 Mi- 
croprocessor supports is two clocks long. These cy- 
cles are called 2-2 cycles because reads and writes 
take two cycles each. The first 2 refers to reads and 


• 


the second to writes. For example, if a wait state 
needs to be added to a write, the cycle would be 
called 2-3. 


Basic two clock read and write cycles are shown in 
Figure 7.7. The Intel486 Microprocessor initiates a 
cycle by asserting the address status signal (ADS# ) 
at the rising edge of the first clock. The ADS# out- 
put indicates that a valid bus cycle definition and 
address is available on the cycle definition lines and 
address bus. 


The non-burst ready input (ROY#) is returned by the 
external system in the second clock. ROY# indi- 
cates that the external system has presented valid 
data on the data pins in response to a read or the 
external system has accepted data in response to a 
write. 


The Intel486 Microprocessor samples ROY# at the 
end of the second clock. The cycle is complete if 
ROY# is active (LOW) when sampled. Note that 
ROY# is ignored at the end of the first clock of the 
bus cycle. 


The burst last signal (BLAST#) is asserted (LOW) 
by the Intel486 Microprocessor during the second 
clock of the first cycle in all bus transfers illustrated 
in Figure 7.7. This indicates that each transfer is 
complete after a single cycle. The Intel486 Micro- 
processor asserts BLAST# in the last cycle of a bus 
transfer. 


The timing of the parity check output (PCHK#) is 
shown in Figure 7.7. The Intel486 Microprocessor 
drives the PCHK# output one clock after ready ter- 
minates a read cycle. PCHK# indicates the parity 
status for the data sampled at the end of the previ- 
ous clock. The PCHK# signal can be used by the 
external system. The Intel486 Microprocessor does 
nothing in response to the PCHK# output. 


7.2.1.2 
Inserting 
Wait States 


The external system can insert wait states into the 
basic 2-2 cycle by driving ROY# inactive at the end 
of the second clock. ROY# must be driven inactive 
to insert a wait state. Figure 7.8 illustrates a simple 
non-burst, non-cacheable signal with one wait state 
added. Any number of wait states can be added to 
an Intel486 Microprocessor bus cycle by maintaining 
ROY# inactive. 


The burst ready input (BRDY#) must be driven inac- 
tive on all clock edges where ROY# is driven inac- 
tive for proper operation of these simple non-burst 
cycles. 
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7.2.2 
MULTIPLE 
AND BURST 
CYCLE 
BUS 
TRANSFERS 


Multiple cycle bus transfers can be caused by inter- 
nal requests from the Intel486 Microprocessor or by 
the external memory system. An internal request for 
a 64-bit floating point load or a 128-bit pre-fetch 
must take more than one cycle. Internal requests for 
unaligned data may also require multiple bus cycles. 
A cache line fill requires multiple cycles to complete. 
The external system can cause a multiple cycle 
transfer when it can only supply 8 or 16 bits per 
cycle. 


Only multiple cycle transfers caused by internal re- 
quests are considered in this section. Cacheable cy- 
cles and 8- and 16-bit transfers are covered in Sec- 
tions 7.2.3 and 7.2.5. 


7.2.2.1 
Burst Cycles 


The Intel486 Microprocessor can accept burst cy- 
cles for any bus requests that require more than a 
single data cycle. During burst cycles, a new data 
item is strobed into the Intel486 Microprocessor ev- 
ery clock rather than every other clock as in non- 
burst cycles. The fastest burst cycle requires 2 
clocks for the first data item with subsequent data 
items returned every clock. 


The Intel486 Microprocessor is capable of bursting a 
maximum of 32 bits during a write. Burst writes can 
only occur if BS8# or BS16# is asserted. For exam- 
ple, the Intel486 Microprocessor can burst write four 
8-bit operands or two 16-bit operands in a single 
burst cycle. But the Intel486 Microprocessor cannot 
burst multiple 32-bit writes in a single burst cycle. 


Burst cycles begin with the Intel486 Microprocessor 
driving out an address and asserting ADS# in the 
same manner as non-burst cycles. The Intel486 mi- 
croprocessor indicates that it is willing to perform a 
burst 
cycle 
by 
holding 
the 
burst 
last 
signal 
(BLAST#) inactive in the second clock of the cycle. 
The external system indicates its willingness to do a 
burst cycle by returning the burst ready signal 
(BRDY#) active. 


The addresses of the data items in a burst cycle will 
all fall within the same 16-byte aligned area (corre- 
sponding to an internal Intel486 Microprocessor 
cache line). A 16-byte aligned area begins at loca- 
tion XXXXXXXOand ends at location XXXXXXXF. 
During a burst cycle, only BEO-3 #, A2' and A3 may 
change. A4-A31' M/IO#, 
D/C#, and W/R# 
will re- 


main stable throughout a burst. Given the first ad- 
dress in a burst, external hardware can easily calcu- 
late the address of subsequent transfers in advance. 
An external memory system can be designed to 
quickly fill 
the 
Intel486 
microprocessor 
internal 


cache lines. 
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Figure 7.8. Basic 3·3 Bus Cycle 
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Burst cycles 
are not limited 
to cache 
line fills. Any 
multiple 
cycle 
read 
request 
by the 
Intel486 
Micro- 
processor 
can be converted 
into a burst cycle. The 
Intel486 
Microprocessor 
will only burst the number 
of bytes 
needed 
to complete 
a transfer. 
For exam- 
ple, eight bytes will be bursted 
in for a 64-bit floating 
point non-cacheable 
read. 


The 
external 
system 
converts 
a multiple 
cycle 
re- 
quest into a burst cycle by returning 
BROY # active 
rather than ROY # (non-burst 
ready) in the first cycle 
of a transfer. 
For cycles that cannot 
be bursted such 
as interrupt 
acknowledge 
and halt, BROY # has the 
same 
effect 
as ROY #. 
BROY # 
is ignored 
if both 
BROY # and ROY # are returned 
in the same clock. 
Memory 
areas 
and 
peripheral 
devices 
that 
cannot 
perform 
bursting 
must terminate 
cycles with ROY # . 


7.2.2.2 
Terminating 
Multiple 
and 
Burst Cycle Transfers 


The 
Intel486 
Microprocessor 
drives 
BLAST# 
inac- 
tive for all but the last cycle in a multiple 
cycle trans- 
fer. BLAST# 
is driven 
inactive 
in the first cycle 
to 
inform 
the 
external 
system 
that 
the transfer 
could 
take 
additional 
cycles. 
BLAST # 
is driven 
active 
in 
the last cycle of the transfer 
indicating 
that the next 
time 
BROY # 
or ROY # 
is returned 
the transfer 
is 
complete. 


BLAST # is not valid in the first clock of a bus cycle. 
It should 
be sampled 
only in the second 
and subse- 


,quent clocks 
when ROY # or BROY # is returned. 


The number 
of cycles 
in a transfer 
is a function 
of 
several factors 
including 
the number of bytes the mi- 
croprocessor 
needs to complete 
an internal 
request 
(1, 2, 4, 8, or 16), the state 
of the bus size inputs 
(BS8# 
and BS16#), 
the state of the cache 
enable 
input (KEN #) and alignment 
of the data to be trans- 
ferred. 


When 
the 
Intel486 
Microprocessor 
initiates 
a re- 
quest 
it knows 
how many 
bytes will be transferred 
and if the data is aligned. 
The external 
system 
must 
tell the microprocessor 
whether 
the data is cache- 
able (if the transfer 
is a read) and the width 
of the 
bus by returning 
the state of the KEN #, BS8 # and 
BS16# 
inputs one clock before ROY# 
or BROY# 
is 
retur'led. 
The 
Intel486 
Microprocessor 
determines 
how 
many 
cycles 
a transfer 
will take 
based 
on its 
internal 
information 
and inputs from the external 
sys- 
tem. 


BLAST # is not valid in the first clock of a bus cycle 
because 
the Intel486 
Microprocessor 
cannot 
deter· 
mine the number 
of cycles 
a transfer 
will take until 
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the 
external 
system 
returns 
KEN # , 
BS8 # 
and 
BS16#. 
BLAST# 
should 
only 
be 
sampled 
in the 
second 
and subsequent 
clocks 
of a cycle when the 
external 
system 
returns 
ROY # or BROY #. 


The system 
may terminate 
a burst cycle by returning 
ROY # 
instead 
of 
BROY #. 
BLAST # 
will 
remain 
deasserted 
until 
the 
last 
transfer. 
However, 
any 
transfers 
required 
to complete 
a cache 
line fill will 
follow the burst order, e.g., if burst order was 4, 0, C, 
8 and ROY # was returned 
at after 0, the next trans- 
fers will be from C and 8. 


7.2.2.3 
Non-Cacheable, 
Non-Burst, 
Multiple 
Cycle Transfers 


Figure 7.9 illustrates 
a 2 cycle non·burst, 
non-cache- 
able 
multiple 
cycle 
read. 
This 
transfer 
is simply 
a 
sequence 
of two single cycle transfers. 
The Intel486 
Microprocessor 
indicates 
to the external 
system that 
this is a multiple 
cycle 
transfer 
by driving 
BLAST # 
inactive 
during 
the second 
clock 
of the first cycle. 


The external 
system 
returns 
ROY # active indicating 
that 
it will not burst the data. The external 
system 
also indicates 
that the data is not cacheable 
by reo 
turning 
KEN # 
inactive 
one 
clock 
before 
it returns 
ROY # 
active. 
When 
the 
Intel486 
Microprocessor 
samples 
ROY # active 
it ignores 
BROY # . 


Each 
cycle 
in the 
transfer 
begins 
when 
ADS # 
is 
driven 
active 
and 
the cycle 
is complete 
when 
the 
external 
system 
returns 
ROY # active. 


The Intel486 
Microprocessor 
indicates 
the last cycle 
of the transfer 
by driving 
BLAST # active. 
The next 
ROY # 
returned 
by the external 
system 
terminates 
the transfer. 


7.2.2.4 
Non-Cacheable 
Burst Cycles 


The 
external 
system 
converts 
a multiple 
cycle 
re- 


quest into a burst cycle by returning 
BROY # active 
rather 
than 
ROY # in the first cycle 
of the transfer. 


This is illustrated 
in Figure 7.10. 


There are several features 
to note in the burst read. 


ADS # is only driven 
active 
during 
the first cycle 
of 
the transfer. 
ROY # 
must 
be driven 
inactive 
when 
BROY # is returned 
active. 


BLAST # behaves 
exactly as it does in the non-burst 
read. BLAST # is driven inactive 
in the second 
clock 
of the first cycle of the transfer 
indicating 
more cy- 
cles to follow. 
In the last cycle, 
BLAST# 
is driven 
active telling the external 
memory 
system to end the 
burst after returning 
the next BROY # . 
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Any memory read can become 
a cache fill operation. 
The external 
memory 
system 
can allow 
a read re- 
quest 
to fill a cache 
line by returning 
KEN # active 
one clock 
before 
ROY # or BROY # during the first 
cycle 
of 
the 
transfer 
on 
the 
external 
bus. 
Once 
KEN # is asserted 
and the remaining 
three 
require- 
ments described 
below are met, the Intel486 
Micro- 
processor 
will fetch 
an entire cache 
line regardless 
of the state of KEN #. 
KEN # must be returned 
ac- 
tive in the last cycle of the transfer 
for the data to be 
written 
into the internal 
cache. 
The Intel486 
Micro- 
processor 
will only 
convert 
memory 
reads 
or pre- 
fetches 
into a cache 
fill. 


KEN# 
is ignored during write or 1/0 cycles. 
Memory 
writes 
will 
only 
be stored 
in the 
on-chip 
cache 
if 
there 
is a cache 
hit. I/O 
space 
is never 
cached 
in 
the internal 
cache. 


To transform 
a read or a prefetch 
into a cache 
line 
fill the following 
conditions 
must be met: 


1. The KEN # pin must be asserted 
one clock pri- 
or to ROY # or BROY # being returned 
for the 
first data cycle. 


2. The cycle must be of the type that can be inter- 
nally 
cached. 
(Locked 
reads, 
1/0 
reads, 
and 
interrupt 
acknowledge 
cycles 
are never cach- 
ed). 


3. The 
page 
table 
entry 
must 
have 
the 
page 
cache 
disable 
bit (PCO) set to O. To cache 
a 
page table entry, the page directory 
must have 
PCO = O. To cache 
reads or prefetches 
when 
paging 
is disabled, 
or to cache the page direc- 
tory entry, control 
register 
3 (CR3) must have 
PCO=O. 


4. The cache disable 
(CD) bit in control 
register 0 
(CRO) must be clear. 


External 
hardware 
can determine 
when the Intel486 
Microprocessor 
has transformed 
a read or prefetch 
into a cache 
fill by examining 
the 
KEN#, 
MIIO#, 
O/C#, 
W/R#, 
LOCK#, 
and PCO pins. These 
pins 
convey to the system the outcome 
of conditions 
1-3 
in the above list. In addition, 
the Intel486 
drives PCO 
high whenever 
the CD bit in CRO is set, so that ex- 
ternal 
hardware 
can evaluate 
condition 
4. 


7.2.3.1 
Byte Enables 
during a Cache 
Line Fill 


For the first cycle in the line fill, the state of the byte 
enables 
should 
be 
ignored. 
In 
a 
non-cacheable 
memory 
read, the 
byte 
enables 
indicate 
the 
bytes 
actually 
required 
by the memory 
or code fetch. 


The Intel486 
Microprocessor 
expects 
to receive val- 
id data on its entire bus (32 bits) in the first cycle of a 
cache 
line fill. Data should 
be returned 
with the as- 
sumption 
that all the byte enable 
pins are driven ac- 
tive. 
However 
if BS8 # 
is asserted 
only 
one 
byte 
need be returned 
on data lines 00-07. 
Similarly 
if 
BS16# 
is asserted 
two bytes should 
be returned 
on 
00-015. 


The 
Intel486 
Microprocessor 
will generate 
the ad- 
dresses 
and byte enables 
for all subsequent 
cycles 
in the line fill. The order in which data is read during 
a line fill depends 
on the address 
of the first item 
read. Byte ordering 
is discussed 
in Section 
7.2.4. 


7.2.3.2 
Non-Burst 
Cacheable 
Cycles 


Figure 7.11 shows a non-burst 
cacheable 
cycle. The 
cycle becomes 
a cache fill when the Intel486 
Micro- 
processor 
samples 
KEN # active 
at the end of the 
first 
clock. 
The 
Intel486 
Microprocessor 
drives 
BLAST # inactive in the second 
clock in response 
to 
KEN #. BLAST # is driven inactive 
because 
a cache 
fill 
requires 
3 
additional 
cycles 
to 
complete. 
BLAST # 
remains 
inactive 
until the 
last transfer 
in 
the cache 
line fill. KEN # must be returned 
active 
in 
the last cycle of the transfer 
for the data to be writ- 
ten into the internal 
cache. 


Note that this cycle 
would 
be a single 
bus cycle 
if 
KEN # was not sampled 
active at the end of the first 
clock. 
The subsequent 
three 
reads would 
not have 
happened 
since a cache fill was not requested. 


The BLAST # output 
is invalid 
in the first clock 
of a 
cycle. 
BLAST # may be active 
during the first clock 
due to earlier 
inputs. 
Ignore 
BLAST # until the sec- 
ond clock. 


During the first cycle of the cache 
line fill the exter- 
nal system 
should 
treat the byte enables 
as if they 
are all active. 
In subsequent 
cycles 
in the burst, the 
Intel486 
Microprocessor 
drives 
the 
address 
lines 
and byte enables 
(see Section 
7.2.4.2 for Burst and 
Cache 
Line Fill Order). 
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The external system informs the Intel486 Microproc- 
essor that it will burst the line in by driving BRDY# 
active at the end of the first cycle in the transfer. 


7.2.3.3 Burst Cacheable Cycles 


Figure 7.12 illustrates a burst mode cache fill. As in 
.Figure 7.11, the transfer becomes a cache line fill 
when the external system returns KEN# active at 
the end of the first clock in the cycle. 


Note that during a burst cycle ADS# is only driven 
with the first address. 
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7.2.3.4 
Effect 
of Changing 
KEN# 
during a 
Cache 
Line Fill 


cache line fill. Similarly. 
it uses the value of KEN# 
in 
the last cycle. 
before 
early 
ROY # to load the line 
just 
retrieved 
from 
the 
memory 
into 
the 
cache. 


KEN # is sampled 
every clock. 
it must satisfy 
setup 
and hold time. 


KEN# 
can change 
multiple 
times 
as long as it ar- 
rives at its final value 
in the clock 
before 
ROY # or 
BROY # is returned. 
This is illustrated 
in Figure 7.13. 


Note 
that 
the 
timing 
of 
BLAST # 
follows 
that 
of 
KEN # 
by one clock. 
The 
Intel486 
samples 
KEN # 
every clock and uses the value returned 
in the clock 
before 
ready to determine 
if a bus cycle would be a 


KEN # can also change 
multiple times before a burst 


cycle as long as it arrives at its final value one clock 
before 
ready is returned 
active. 
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Driving BRDY# and ROY# inactive adds a wait 
state to the transfer. A burst cycle where two clocks 
are required for every burst item is shown in Figure 
7.14. 
7.2.4.1 
Adding 
Walt States 
to Burst Cycles 


Burst cycles need not return data on every clock. 
The Intel486 Microprocessor will only strobe data 
into the chip when either ROY# or BRDY# are ac- 
tive. 
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The burst order used by the Intel486 
Microprocessor 
is shown in Table 7.8. This burst order is followed 
by 
any burst cycle 
(cache 
or not), cache 
line fill (burst 
or not) or code prefetch. 


The microprocessor 
presents 
each request 
for data 
in an order 
determined 
by the first 
address 
in the 
transfer. 
For example, 
if the first address 
was 
104 
the 
nex1 three 
addresses 
in the 
burst will be 100, 


10C and 108. 


First 
Second 
Third 
Fourth 
Addr. 
Addr. 
Addr. 
Addr. 


0 
4 
8 
C 
4 
0 
C 
8 
8 
C 
0 
4 
C 
8 
4 
0 


An example 
of burst address 
sequencing 
is shown in 
Figure 7.15. 
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The 
sequences 
shown 
in Table 
7.7 accommodate 
systems 
with 64-bit busses 
as well as systems 
with 
32-bit 
data 
busses. 
The 
sequence 
applies 
to 
all 
bursts, 
regardless 
of whether 
the 
purpose 
of the 
burst is to fill a cache 
line, do a 64-bit read, or do a 
pre-fetch. 
If either 
Bs8# 
or Bs16# 
is returned 
ac- 
tive, 
the 
Intel486 
Microprocessor 
completes 
the 
transfer 
of the current 
32-bit word 
before 
progress- 


ing to the next 32-bit word. 
For example, 
a Bs16# 
burst to address 
4 has the following 
order: 4-6-0-2- 
C-E-8-A. 


7.2.4.3 Interrupted Burst Cycles 


Some memory 
systems 
may not be able to respond 
with burst cycles 
in the order 
defined 
in Table 
7.7. 


To support 
these systems 
the Intel486 
Microproces- 
sor allows a burst cycle to be interrupted 
at any time. 


The Intel486 
Microprocessor 
will automatically 
gen- 


erate another 
normal bus cycle after being interrupt- 


ed to complete 
the data transfer. 
This is called 
an 
interrupted 
burst cycle. The external 
system 
can re- 


spond 
to 
an 
interrupted 
burst 
cycle 
with 
another 


burst cycle. 


The external 
system 
can interrupt 
a burst cycle 
by 
returning 
ROY # instead 
of BROY #. 
ROY # can be 
returned 
after any number of data cycles terminated 
with BROY#. 


An example 
of an interrupted 
burst cycle is shown in 


Figure 
7.16. 
The 
Intel486 
Microprocessor 
immedi- 


ately drives ADS # active to initiate a new bus cycle 
after 
ROY # 
is returned 
active. 
BLAST # 
is driven 
inactive 
one 
clock 
after 
AOs# 
begins 
the second 
bus cycle indicating 
that the transfer 
is not complete. 
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KEN# need not be returned active in the first data 
cycle of the second part of the transfer in Figure 
7.16. The cycle had been converted to a cache fill in 
the first part of the transfer and the Intel486 Micro- 
processor expects the cache fill to be completed. 
Note that the first half and second half of the trans- 
fer in Figure 7.16 are each tWocycle burst transfers. 


The order in which the Intel486 Microprocessor re- 
quests operands during an interrupted burst transfer 
is determined by Table 7.7. Mixing ROY# 
and 
BROY# does not change the order in which oper- 
and addresses are requested by the Intel486 Micro- 
processor. 


An example of the order in which the Intel486 Micro- 
processor requests operands during a cycle in which 
the external system mixes ROY# and BROY# is 
shown in Figure 7.17. The Intel486 Microprocessor 
initially requests a transfer beginning at location 104. 
The transfer becomes a cache line fill when the ex- 
ternal system returns KEN# active. The first cycle of 
the cache fill transfers the contents of location 104 
and is terminated with ROY#. The Intel486 Micro- 
processor drives out a new request (by asserting 
AOS#) to address 100. If the external system termi- 
nates the second cycle with BROY#, the Intel486 
Microprocessor will next request/expect 
address 
10C. The correct order is determined by the first cy- 
cle in the transfer, which may not be the first cycle in 
the burst if the system mixes ROY# with BROY#. 
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The Intel486 
Microprocessor 
supports 
both 16- and 
8-bit external 
busses through 
the BS16# 
and BS8# 
inputs. BS16# 
and BS8# 
allow the external 
system 
to specify, 
on a cycle 
by cycle 
basis, whether 
the 
addressed 
component 
can supply 
8, 16 or 32 bits. 


BS16# 
and BS8# 
can be used in burst cycles 
as 
well as non-burst 
cycles. 
If both BS16# 
and BS8# 
are returned 
active 
for any bus cycle, 
the Intel486 
Microprocessor 
will respond 
as if only 
BS8# 
were 
active. 


The timing of BS16# 
and BS8# 
is the same as that 
of KEN #. BS 16 # and BS8 # must be driven 
active 
before 
the first 
ROY # 
or BRDY # 
is driven 
active. 


Driving the BS16# 
and BS8# 
active 
can force 
the 
Intel486 
Microprocessor 
to run additional 
cycles 
to 
complete 
what would 
have been only a single 32-bit 
cycle. 
BS8 # and BS 16 # may change 
the state 
of 
BLAST # when 
they force 
subsequent 
cycles 
from 
the transfer. 


Figure 
7.18 
shows 
an 
example 
in 
which 
BS8# 
forces 
the Intel486 
Microprocessor 
to run two extra 
cycles 
to complete 
a transfer. 
The 
Intel486 
Micro- 
processor 
issues a request for 24 bits of information. 


The external 
system 
drives 
BS8 # active 
indicating 
that only eight bits of data can be supplied 
per cycle. 
The Intel486 
Microprocessor 
issues two extra cycles 
to complete 
the transfer. 
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Extra cycles forced 
by the BS16# 
and BS8# 
should 
be viewed 
as independent 
bus cycles. 
BS16# 
and 
BS8 # 
should 
be driven 
active 
for each 
additional 
cycle unless the addressed 
device 
has the ability to 
change 
the number 
of bytes 
it can return 
between 


cycles. 
The 
Intel486 
Microprocessor 
will 
drive 
BLAST # 
inactive 
until 
the 
last 
cycle 
before 
the 


transfer 
is complete. 


Refer 
to 
Section 
7.1.3 
for 
the 
sequencing 
of ad- 
dresses 
while BS8# 
or BS16# 
are active. 


BS8# 
and BS16# 
operate 
during burst cycles in ex- 


actly the same manner 
as non-burst 
cycles. 
For ex- 


ample, 
a single non-cacheabfe 
read could be trans- 


ferred 
by the Intel486 
Microprocessor 
as four 8-bit 


burst 
data 
cycles. 
Similarly, 
a single 
32-bit 
write 


could 
be written 
as four 8-bit burst data cycles. 
An 


example 
of a burst 
write 
is shown 
in Figure 
7.19. 


Burst 
writes 
can only 
occur 
if BS8# 
or BS16# 
is 


asserted. 


ClK 


ADS# 
\ 
/ 


ADDR 
X 
: 'C 
SPEC 


I 


BEO-3# 
X 
X 
X 
X 
: 'C 


RDY# 
• 


BRDY# 


BS8# 
\ 
/ 


BLAST# 
X 
/ 
I 
\ 
C 
I 
I 
I 


DATA 
( 
i FROM 
CPU i 
)- 


240440-62 


Locked 
cycles 
are generated 
in software 
for any in- 
struction 
that 
performs 
a read-modify-write 
opera- 
tion. During a read-modify-write 
operation 
the proc- 
essor 
can 
read 
and 
modify 
a variable 
in external 
memory 
and be assured 
that the variable 
is not ac- 
cessed 
between 
the read and write. 


Locked 
cycles 
are automatically 
generated 
during 
certain 
bus transfers. 
The xchg 
(exchange) 
instruc- 
tion generates 
a locked 
cycle when one of its oper- 
ands is memory 
based. 
Locked 
cycles 
are generat- 
ed when a segment 
or page table entry is updated 
and during interrupt 
acknowledge 
cycles. Locked cy- 
cles are also generated 
when the LOCK instruction 
prefix is used with selected 
instructions. 


Locked cycles are implemented 
in hardware 
with the 
LOCK # pin. When 
LOCK # is active, 
the processor 
is performing 
a read-modify-write 
operation 
and the 
external 
bus should 
not be relinquished 
until the cy- 
cle 
is complete. 
Multiple 
reads 
or writes 
can 
be 
locked. 
A 
locked 
cycle 
is shown 
in Figure 
7.20. 


LOCK # goes active with the address and bus defini- 
tion pins at the beginning 
of the first read cycle and 
remains 
active 
until RDY # is returned 
for the last 
write cycle. 
For unaligned 
32 bits read-modify-write 
operation, 
the LOCK # remains 
active 
for the entire 
duration 
of the multiple 
cycle. It will go inactive when 
RDY # is returned 
for the last write cycle. 
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When LOCK# 
is active, the Intel486 
Microprocessor 
will recognize 
address 
hold and backoff 
but will not 
recognize 
bus hold. It is left to the external 
system to 
properly 
arbitrate 
a central 
bus when 
the 
Intel486 
Microprocessor 
generates 
LOCK # . 


Pseudo-locked 
cycles 
assure 
that 
no other 
master 
will be given control 
of the bus during operand 
trans- 
fers which take more than one bus cycle. Examples 
include 
64-bit 
floating 
point 
read and writes, 
64-bit 
descriptor 
loads and cache 
line fills. 


Pseudo-locked 
transfers 
are 
indicated 
by 
the 
PLOCK # 
pin. 
The 
memory 
operands 
must 
be 
aligned for correct 
operation 
of a pseudo-locked 
cy- 
cle. 


PLOCK # need not be examined 
during burst reads. 
A 64-bit 
aligned 
operand 
can 
be retrieved 
in one 
burst (note: this is only valid in systems 
that do not 
interrupt 
bursts). 


The 
system 
must 
examine 
PLOCK # 
during 
64-bit 
writes 
since 
the 
Intel486 
Microprocessor 
cannot 
burst write more than 32 bits. However, 
burst can be 
used 
within 
each 
32-bit 
write 
cycle 
if 
BSS # 
or 
Bs16# 
is asserted. 
BLAST will be deasserted 
in re- 
sponse 
to Bs8# 
or Bs16#. 
A 64-bit 
write 
will be 
driven out as two non-burst 
bus cycles. 
BLAST # is 
asserted 
during both writes since a burst is not pos- 


sible. 
PLOCK # is asserted 
during 
the first write 
to 
indicate 
that another 
write follows. 
This behavior 
is 
shown 
in Figure 7.21. 


The first cycle 
of a 64-bit floating 
point write 
is the 
only case in which 
both PLOCK # and BLAST # are 
asserted. 
Normally 
PLOCK # and BLAST # are the 
inverse 
of each other. 


During all of the cycles where 
PLOCK# 
is asserted, 
HOLD 
is not 
acknowledged 
until 
the 
cycle 
com- 
pletes. 
This results 
in a large HOLD 
latency, 
espe- 
cially when Bs8# 
or Bs16# 
is asserted. 
To reduce 
the HOLD latency 
during these cycles, 
windows 
are 
available 
between 
transfers 
to allow HOLD to be ac- 
knowledged 
during 
non-cacheable, 
non-bursted 
code 
prefetches. 
PLOCK # 
will 
be asserted 
since 
BLAST # is negated, 
but it is ignored 
and HOLD is 
recognized 
during the prefetch. 


PLOCK # can change 
several 
times 
during 
a cycle 
settling 
to its final 
value 
in the 
clock 
ready 
is re- 


turned. 


Invalidate 
cycles 
are needed 
to keep 
the 
Intel486 
Microprocessor's 
internal cache contents 
consistent 
with external 
memory. 
The Intel486 
microprocessor 


contains 
a mechanism 
for listening to writes by other 


devices 
to external 
memory. 
When 
the 
processor 


finds 
a write to a Section 
of external 
memory 
con- 
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tained in its internal cache, the processor's internal 
copy is invalidated. 


Invalidations use two pins, address hold request 
(AHOlD) 
and valid external address (EADS#). 
There are two steps in an invalidation cycle. First, 
the external system asserts the AHOlD input forcing 
the Intel486 Microprocessor to immediately relin- 
quish its address bus. Next, the external system as- 
serts EADS# indicating that a valid address is on 
the Intel486 Microprocessor's address bus. EADS# 
and the invalidation address, Figure 7-22 shows the 
fastest possible invalidation cycle. The Intel486 cy- 
cle CPU recognizes AHOlD on one ClK edge and 
floats the address bus in response. To allow the ad- 
dress bus to float and avoid contention, EADS# and 
the invalidation address should not be driven until 
the following ClK edge. The microprocessor reads 
the address over its address lines. If the microproc- 
essor finds this address in its internal cache, the 
cache entry is invalidated. Note that the Intel486 Mi- 
croprocessor's address bus is input/output unlike 
the 386 Microprocessor's bus, which is output only. 


The Intel486 Microprocessor immediately relinquish- 
es its address bus in the next clock upon assertion 
of AHOlD. 
For example, the bus could be 3 wait 
states into a read cycle. If AHOlD is activated, the 
Intel486 Microprocessor will immediately float its 


address bus before ready is returned terminating the 
bus cycle. 


When AHOLD is asserted only the address bus is 
floated, the data bus can remain active. Data can be 
returned for a previously specified bus cycle during 
address hold (see Figures 7.22, 7.23). 


EADS# is normally asserted when an external mas- 
ter drives an address onto the bus. AHOlD need not 
be driven for EADS# to generate an internal invali- 
date. If EADS# alone is asserted while the Intel486 
Microprocessor is driving the address bus, it is pos- 
sible that the invalidation address will come from the 
Intel486 Microprocessor itself. 


Note that it is also possible to run an invalidation 
cycle by asserting EADS# when HOLD or BOFF# 
is asserted. 


Running an invalidate cycle prevents the Intel486 
Microprocessor cache from satisfying other internal 
requests, so invalidations should be run only when 
necessary. The fastest possible invalidate cycle is 
shown in Figure 7.22, while a more realistic invalida- 
tion cycle is shown in 7.23. Both of the examples 
take one clock of cache access from the rest of the 
Intel486 Microprocessor. 
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Figure 7.22. Fast Internal Cache Invalidation Cycle 
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7.2.8.1 Rate of Invalidate Cycles 


The Intel486 Microprocessor can accept one invali- 
date per clock except in the last clock of a line fill. 
One invalidate per clock is possible as long as 
EAOS# is negated in ONE or BOTH of the following 
cases: 
1. In the clock ROY# or BROY# is returned for 
the last time. 
2. In the clock following ROY# or BROY# being 
returned for the last time. 


This definition allows two system designs. Simple 
designs can restrict invalidates to one every ~t~er 
clock. The simple design need not track bus activity. 
Alternatively, systems can request one invalidate 
per clock provided that the bus is monitored. 


7.2.8.2 Running Invalidate Cycles Concurrently 
with Line Fills 


Precautions are necessary to avoid caching stale 
data in the Intel486 Microprocessor's cache in a sys- 
tem with a second level cache. An example of a 
system with a second level cache is shown in Figure 
7.24. An external device can be writing to main 
memory over the system bus while the Intel486 Mi- 
croprocessor is retrieving data from the second level 
cache. The Intel486 Microprocessor will need to in- 
validate a line in its internal cache if the external 
device is writing to a main memory address also 
contained in the Intel486 Microprocessor's cache. 


A potential problem exists if the external device is 
writing to an address In external memory, and at the 
same time the Intel486 Microprocessor is reading 
data from the same address in the second level 
cache. The system must force an invalidation cycle 
to invalidate the data that the Intel486 Microproces- 
sor has requested during the line fill. 


Second 


Level 
Cache 


Figure 7.24. System with Second Level Cache 


If the system asserts EAOS# before the first data in 
the line fill is returned to the Intel486 Microproces- 
sor, the system must return data consistent with the 
new data in the external memory upon resumption of 
the line fill after the invalidation cycle. This is illus- 
trated by the asserted EAOS# signal labeled 1 in 
Figure 7.25. 


If the system asserts EAOS# at the same time or 
after the first data in the line fill is returned (in the 
same clock that the first ROY# or BROY# is re- 
turned or any subsequent clock in the line fill) the 
data will be read into the Intel486 Microprocessors 
input buffers but it will not be stored in the on-chip 
cache. This is illustrated by asserted EAOS# signal 
labeled 2 in Figure 7.25. The stale data will be used 
to satisfy the request that initiated the cache fill cy- 
cle. 
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NOTES: 
1. Data returned 
must be consistent 
if its address 
equals 
the invalidation 
address 
in this clock 
2. Data returned 
will not be cached 
if its address 
equals 
the invalidation 
address 
in this clock 


The Intel486 Microprocessor provides a bus hold, 
hold acknowledge protocol using the bus hold re- 
quest (HOLD) and bus hold acknowledge (HLDA) 
pins. Asserting the HOLD input indicates that anoth- 
er bus master desires control of the Intel486 Micro- 
processor's bus. The processor will respond by 
floating its bus and driving HLDA active when the 
current bus cycle, or sequence of locked cycles is 
complete. An example of a HOLD/HLDA transaction 
is shown in Figure 7.26a. Unlike the 386 Microproc- 
essor, the Intel486 Microprocessor can respond to 
HOLD by floating its bus and asserting HLDA while 
RESET is asserted. 


Note that HOLD will be recognized during un-aligned 
writes (less than or equal to 32-bits) with BLASTII 
being active for each write. For greater than 32-bit or 
un-aligned write, HOLDII recognition is prevented 
by PLOCKII getting asserted. 


For cacheable and nonbursted or bursted cycles, 
HOLD is acknowledged during backoff only if HOLD 
and BOFFII are asserted during an active bus cycle 
(after ADSII asserted) and before the first RDY# or 
BRDYII has been returned (see Figure 7.26b). The 
order in which HOLD and BOFFII go active is unim- 
portant (so long as both are active prior to the first 
ROYII IBRDY II 
returned by the system). Figure 
7.26b shows the case where HOLD is asserted first; 
HOLD could be asserted simultaneously or after 
BOFFII and still be acknowledged. 


The pins floated during bus hold are: BEOIl-BE3#, 
PCD, 
PWT, 
W/RII, 
D/CII, 
M/IOIl, 
LOCK# , 


PLOCKII , 
ADSII, 
BLASTII , 
00-031, 
A2-A31 , 


DPO-DP3. 


The Intel486 Microprocessor generates interrupt ac- 
knowledge cycles in response to maskable interrupt 
requests generated on the interrupt request input 
(INTR) pin. Interrupt acknowledge cycles have a 
unique cycle type generated on the cycle type pins. 
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An example interrupt acknowledge transaction is 
shown in Figure 7.27. Interrupt acknowledge cycles 
are generated in locked pairs. Data returned during 
the first cycle is ignored. The interrupt vector is re- 
turned during the second cycle on the lower 8 bits of 
the data bus. The Intel486 Microprocessor has 256 
possible interrupt vectors. 
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The state of A2 distinguishes the first and second 
interrupt acknowledge cycles. The byte address 
driven during the first interrupt acknowledge cycle is 
4 (A31-A3 
low, A2 high, BE3# -BEU 
high, and 
BEO# low). The address driven during the second 
interrupt acknowledge cycle is 0 (A31-A2 
low, 
BE3#-BE1 # high, BEO# low). 
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Each of the interrupt acknowledge cycles are termi- 
nated when the external system returns ROY# or 
BRDY#. Wait states can be added by withholding 
ROY# or BRDY#. The Intel486 Microprocessor au- 
tomatically generates four idle clocks between the 
first and second cycles to allow for 8259A recovery 
time. 


The Intel486 Microprocessor provides four special 
bus cycles to indicate that certain instructions have 
been executed, or certain conditions have occurred 
internally. The special bus cycles in Table 7.9 are 
defined when the bus cycle definition pins are in 
the 
following 
state: 
M/lO# = 0, 
D/e# = 0 
and 
WIR # = 1. During these cycles the address bus is 
driven low while the data bus is undefined. 


Two of the special cycles indicate halt or shutdown. 
Another special cycle is generated when the In- 
tel486 Microprocessor executes an INVD (invalidate 
data cache) instruction and could be used to flush 
an external cache. The Write Back cycle is generat- 
ed when the Intel486 Microprocessor executes the 
WBINVD (write-back invalidate data cache) instruc- 
tion and could be used to synchronize an external 
write-back cache. 


The external hardware must acknowledge these 
special bus cycles by returning ROY# or BRDY#. 


BE3# 
BE2# 
BE1# 
BEO# 
Special 
Bus Cycle 


1 
1 
1 
0 
Shutdown 
1 
1 
0 
1 
Flush 
1 
0 
1 
1 
Halt 
0 
1 
1 
1 
Write Back 


7.2.11.1 Halt Indication Cycle 


The Intel486 Microprocessor halts as a result of exe- 
cuting a HALT instruction. Signaling its entrance into 2 
the halt state, a halt indication cycle is performed. 
The halt indication cycle is identified by the bus defi- 
nition signals in special bus cycle state and a byte 
address of 2. BEO# and BE2# are the only signals 
distinguishing halt indication from shutdown indica- 
tion, which drives an address of o. During the halt 
cycle undefined data is driven on 00-031. The halt 
indication cycle must be acknowledged by ROY# or 
BRDY# asserted. 
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A halted Intel486 Microprocessor resumes execu- 
tion when INTR (if interrupts are enabled) or NMI or 
RESET is asserted. 


7.2.11.2 Shutdown Indication Cycle 


The Intel486 Microprocessor shuts down as a result 
of a protection fault while attempting to process a 
double fault. Signaling its entrance into the shut- 
down state, a shutdown indication cycle is per- 
formed. The shutdown indication cycle is identified 
by the bus definition signals in special bus cycle 
state and a byte address of O. 


In a multi-master system another bus master may 
require the use of the bus to enable the Intel486 
Microprocessor to complete its current bus request. 
In this situation the Intel486 Microprocessor will 
need to restart its bus cycle after the other bus mas- 
ter ha~ completed its bus transaction. 


A bus cycle may be restarted if the external system 
asserts the backoff (BOFF#) input. The Intel486 Mi- 
croprocessor samples the BOFF# pin every clock. 
The Intel486 Microprocessor will immediately (in the 
next clock) float its address, data and status pins 
when BOFF# is asserted (see Figure 7.28). Any bus 
cycle in progress when BOFF# is asserted is abort- 


ed and any data returned to the processor is ig- 
nored. The same pins are floated in response to 
BOFF# as are floated in response to HOLD. HLDA 
is not generated in response to BOFF#. BOFF# 
has higher priority than ROY# or BRDY#. If either 
ROY# or BRDY# are returned in the same clock as 
BOFF#, BOFF# takes effect. 


The device asserting BOFF# is free to run any cy- 
cles it wants while the Intel486 Microprocessor bus 
is in its high impedance state. If backoff is requested 
after the Intel486 Microprocessor has started a cy- 
cle, the new master should wait for memory to return 
ROY# or BRDY# before assuming control of the 
bus. Waiting for ready provides a handshake to in- 
sure that the memory system is ready to accept a 
new cycle. If the bus is idle when BOFF# is assert- 
ed, the new master can start its cycle two clocks 
after issuing BOFF# . 


The external memory can view BOFF# in the same 
manner as BLAST#. Asserting BOFF# tells the ex- 
ternal memory system that the current cycle is the 
last cycle in a transfer. 


The bus remains in the high impedance state until 
BOFF# is negated. Upon negation, the Intel486 Mi- 
croprocessor restarts its bus cycle by driving out the 
address and status and asserting ADS#. The bus 
cycle then continues as usual. 


Asserting BOFF# during a burst, BS8# or BS16# 
cycle will force the Intel486 Microprocessor to ig- 
nore data returned for that cycle only. Data from pre- 
vious cycles will still be valid. For example, if 
BOFF# is asserted on the third BRDY# of a burst, 
the Intel486 Microprocessor assumes the data re- 
turned with the first and second BRDY# 's is correct 
and restarts the burst beginning with the third item. 
The same rule applies to transfers broken into multi- 
ple cycle by BS8# or BS16#. 


aborted. There are two possible solutions to this 
problem. The first is to have all devices recognize 
this condition and ignore ADS# until ready comes 
back. The second approach is to use a "two clock" 
backoff: in the first clock AHOLD is asserted, and in 
the second clock BOFF# is asserted. This guaran- 
tees that ADS# will not be floating low. This is only 
necessary in systems where BOFF# may be assert- 
ed in the same clock as ADS#. 


Asserting BOFF# in the same clock as ADS# will 
cause the Intel486 Microprocessor to float its bus in 
the next clock and leave ADS# floating low. Since 
ADS# is floating low, a peripheral may think that a 
new bus cycle has begun even-though the cycle was 


A bus state diagram is shown in Figure 7.30. A de- 
scription of the signals used in the diagram is given 
in Table 7.10. 
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Figure 7.30. Bus State Diagram 


Table 7.10. Bus State Description 


State 
Means 


Ti 
Bus is idle. Address and status signals may be driven to undefined values, or 
the bus may be floated to a high impedance state. 


T1 
First clock cycle of a bus cycle. Valid address and status are driven and 
ADS# is asserted. 


T2 
Second and subsequent clock cycles of a bus cycle. Data is driven if the 
cycle is a write, or data is expected if the cycle is a read. ROY# and BRDY# 
are sampled. 


T1b 
First clock cycle of a restarted bus cycle. Valid address and status are driven 
and ADS# is asserted. 


Tb 
Second and subsequent clock cycles of an aborted bus cycle. 
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The 
Intel486 
Microprocessor 
provides 
two 
options 
for 
reporting 
floating 
point 
errors. 
The 
simplest 
method 
is to 
raise 
interrupt 
16 whenever 
an 
un- 
masked 
floating 
point error occurs. 
This option 
may 
be enabled 
by setting the NE bit in control 
register 
0 
(CRO). 


The 
Intel486 
Microprocessor 
also provides 
the op- 
tion of allowing 
external 
hardware 
to determine 
how 
floating 
point errors are reported. 
This option 
is nec- 
essary 
for 
compatibility 
with 
the 
error 
reporting 
scheme 
used 
in DOS based 
systems. 
The 
NE bit 
must be cleared 
in CRO to enable 
user-defined 
error 
reporting. 
User-defined 
error reporting 
is the default 
condition 
because 
the NE bit is cleared 
on reset. 


Two pins, floating 
point 
error 
(FERR#) 
and ignore 
numeric 
error 
(IGNNE #), are provided 
to direct 
the 
actions 
of hardware 
if user-defined 
error reporting 
is 
used. 
The 
Intel486 
Microprocessor 
asserts 
the 
FERR # output 
to indicate 
that a floating 
point error 
has occurred. 
FERR # corresponds 
to the ERROR # 
pin on the 387 math coprocessor. 
However, 
there is 
a difference 
in the behavior 
of the two. 


In some 
cases 
FERR # 
is asserted 
when 
the next 
floating 
point instruction 
is encountered 
and in other 
cases 
it is asserted 
before 
the 
next floating 
point 
instruction 
is encountered 
depending 
upon the exe- 
cution state of the instruction 
causing the exception. 


The following 
class of floating 
point exceptions 
drive 
FERR # at the time the exception 
occurs (Le., before 
encountering 
the next floating 
point instruction). 


1. The stack 
fault, 
invalid 
operation, 
and denormal 
exceptions 
on all transcendental 
instructions, 
in- 
teger 
arithmetic 
instructions, 
FSQRT, 
FSEALE, 
FPREM(1), 
FXTRACT, 
FBLD, and FBSTP. 


2. Any 
exceptions 
on store 
instructions 
(including 
integer 
store instructions). 


The following 
class of floating 
point exceptions 
drive 
FERR# 
only 
after 
encountering 
the 
next 
floating 
point instruction. 


1. Exceptions 
other 
than 
on all transcendental 
in- 
structions, 
integer 
arithmetic 
instructions, 
FSQRT, 
FSCALE, 
FPREM(1), 
FXTRACT, 
FBLD, 
and FBSTP. 


2. Any exception 
on all basic arithmetic, 
load, com- 
pare, 
and 
control 
instructions 
(Le., all other 
in- 
structions). 


For both 
sets 
of exceptions 
above, 
the 
387 
Math 
Coprocessor 
asserts 
ERROR # when 
the error oc- 
curs 
and 
does 
not wait for the 
next floating 
point 
instruction 
to be encountered. 


asserted, 
the Intel486 
Microprocessor 
will ignore 
a 
user 
floating 
point 
error 
and 
continue 
executing 
floating 
point instructions. 
When 
IGNNE# 
is negat- 
ed, the Intel486 
Microprocessor 
will freeze 
on float- 
ing point instructions 
which get errors (except 
for the 
control 
instructions 
FNCLEX, 
FNINIT, 
FNSAVE, 
FNSTENV, 
FNSTCW, 
FNSTSW, 
FNSTSW 
AX, FNE- 


NI, FNDISI 
and FNSETPM). 
IGNNE# 
may be asyn- 
chronous 
to the Intel486 
clock. 


In systems 
with 
user-defined 
error 
reporting, 
the 
FERR# 
pin is connected 
to the interrupt 
controller. 


When 
an unmasked 
floating 
point 
error 
occurs, 
an 
interrupt 
is raised. 
If IGNNE # is high at the time of 
this interrupt, 
the Intel486 
Microprocessor 
will freeze 
(disallowing 
execution 
of a subsequent 
floating 
point 
instruction) 
until the interrupt 
handler 
is invoked. 
By 
driving the IGNNE # pin low (when clearing 
the inter- 
rupt request), 
the interrupt 
handler 
can allow execu- 
tion of a floating 
point instruction, 
within the interrupt 
handler, 
before 
the 
error 
condition 
is cleared 
(by 
FNCLEX, 
FNINIT, 
FNSAVE 
or FNSTENV). 
If execu- 
tion of a non-control 
floating 
point instruction, 
within 
the floating 
point 
interrupt 
handler, 
is not 
needed, 
the IGNNE# 
pin can be tied HIGH. 


7.2.15 
FLOATING 
POINT 
ERROR 
HANDLING 
IN 
AT COMPATIBLE 
SYSTEMS 


The 
Intel486 
OX Microprocessor 
provides 
special 
features 
to allow the implementation 
of an AT com- 
patible numerics 
error reporting 
scheme. 
These fea- 
tures 
DO NOT replace 
the external 
circuit. 
Logic is 
still required 
that 
decodes 
the 
OUT 
FO instruction 
and 
latches 
the 
FERR # 
signal. 
What 
follows 
is a 
description 
of the use of these 
Intel486 
OX Micro- 
processor 
features. 


The features 
provided 
by the Intel486 
OX Microproc- 
essor are the NE bit in the Machine 
Status Register, 
the IGNNE# 
pin, and the FERR# 
pin. 


The 
NE bit determines 
the action 
taken 
by the 
In- 
tel486 
OX Microprocessor 
when a numerics 
error is 
detected. 
When 
set this 
bit signals 
that 
non-DOS 
compatible 
error 
handling 
will 
be implemented. 
In 
this mode 
the Intel486 
OX Microprocessor 
takes 
a 
software 
exception 
(16) if a numerics 
error is detect- 
ed. 


If the NE bit is reset the Intel486 
OX Microprocessor 
uses the IGNNE # pin to allow an external 
circuit to 
control 
the time 
at which 
non-control 
numerics 
in- 
structions 
are allowed 
to execute. 
Note that floating 
point 
control 
instructions 
such 
as 
FNINIT 
and 
FNSAVE 
can be executed 
during a floating 
point er- 
ror condition 
regardless 
of the state of IGNNE #. 
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To process a floating point error in the DOS environ- 
ment the following sequence must take place: 
1. The error is detected by the Intel486 OX Micro- 
processor which activates the FERRII pin. 
2. FERRII is latched so that it can be cleared by the 
OUT FOinstruction. 
3. The latched FERRII signal activates an interrupt 
at the interrupt controller. This interrupt is usually 
handled on IRQ13. 
4. The Interrupt Service Routine (ISR) handles the 
error and then clears the interrupt by executing an 
OUT instruction to port FO.The address FOis de- 
coded externally to clear the FERRII latch. The 
IGNNEII signal is also activated by the decoder 
output. 


5. Usually the ISR then executes an FNINIT instruc- 
tion or other control instruction before restarting 
the program. FNINIT clears the FERRII output. 


Figure 7.31 illustrates the circuit required to perform 
this function. Note that this circuit has not been test- 
ed. It is included as an example of the required error 
handling logic. 


Note that the IGNNEII input allows non-control in- 
structions to be executed prior to the time the 
FERRII signal is reset by the Intel486 OX Micro- 
processor. This function is implemented to allow ex- 
act compatibility with the AT implementation. Most 
programs reinitialize the floating point unit before 
continuing after an error is detected. The floating 
point unit can be reinitialized using one of the follow- 
ing 
four 
instructions: 
FCLEX, 
FINIT, 
FSAVE, 


FSTENV. 
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Testing the Intel486 Microprocessor can be divided 
into three categories: 
Built-In Self Test 
(BIST), 
Boundary Scan, and external testing. BIST performs 
basic device testing on the Intel486 CPU, including 
the non-random logic, control ROM (CROM), trans- 
lation lookaside buffer (TLB), and on-chip cache 
memory. Boundary Scan provides additional test 
hooks that conform to the IEEE Standard Test Ac- 
cess Port and Boundary Scan Architecture (IEEE 
Std.1149.1). The Intel486 Microprocessor also has a 
test mode in which all of its outputs are tristated. 
Additional testing can be performed by using the test 
registers within the Intel486 CPU. 


8.1 
Built-In Self Test (BIST) 


The BIST is initiated by asserting AHOLD (address 
hold) on the falling edge of RESET. AHOLD is a 
synchronous signal only. It should be asserted in the 
clock prior to RESET going from High to Low to start 
BIST. FLUSH/I must also be asserted (driven low) 
prior to the falling edge of RESET to start BIST. 
FLUSH/I must be deasserted (driven high) during 
BIST. A20M /I must be deasserted (driven high) dur- 
ing the falling edge of RESET to start BIST. The 
BIST takes approximately 2"20 
clocks, or approxi- 
mately 42 milliseconds with a 25 MHz Intel486 mi- 
croprocessor. No bus cycles will be run by the In- 
tel486 Microprocessor until the BIST is concluded. 
Note that for the Intel486 Microprocessor the RE- 
SET must be active for 15 clocks with or without 
BIST being enabled for warm resets. 


The results of BIST is stored in the EAX register. 
The 
Intel486 
Microprocessor 
has 
successfully- 
passed the BIST if the contents of the EAX register 
are zero. If the results in EAX are not zero then the 
BIST has detected a flaw in the microprocessor. The 
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microprocessor performs reset and begins normal 
operation at the completion of the BIST. 


The non-random logic, control ROM, on-chip cache 
and translation lookaside buffer (TLB) are tested 
during the BIST. 


The cache portion of the BIST verifies that the 
cache is functional and that it is possible to read and 
write to the cache. The BIST manipulates test regis- 
ters TR3, TR4 and TR5 while testing the cache. 
These test registers are described in Section 8.2. 


The cache testing algorithm writes a value to each 
cache entry, reads the value back, and checks that 
the correct value was read back. The algorithm may 
be repeated more than once for each of the 512 
cache entries using different constants. 


The TLB portion of the BIST verifies that the TLB is 
functional and that it is possible to read and write to 
the TLB. The BIST manipulates test registers TR6 
and TR7 while testing the TLB. TR6 and TR7 are 
described in Section 8.3. 


8.2 
On-Chip Cache Testing 


The on-chip cache testability hooks are designed to 
be accessible during the BIST and for assembly lan- 
guage testing of the cache. 


The Intel486 Microprocessor contains a cache fill 
buffer and a cache read buffer. For testability writes, 
data must be written to the cache fill buffer before it 
can be written to a location in the cache. Data must 
be read from a cache location into the cache read 
buffer before the microprocessor can access the 
data. The cache fill and cache read buffer are both 
128 bits wide. 
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8.2.1 CACHE TESTING REGISTERS TR3, TR4 
AND TR5 


Figure 8.1 shows the three cache testing registers: 
the Cache Data Test Register (TR3), the Cache 
Status Test Register (TR4) and the Cache Control 
Test Register (TR5). External access to these regis- 
ters is provided through MOV reg,TREG and MOV 
TREG, reg instructions. 


The cache fill buffer and the cache read buffer can 
only be accessed through TR3. Data to be written to 
the cache fill buffer must first be written to TR3. Data 
read from the cache read buffer must be loaded into 
TR3. 


TR3 is 32 bits wide while the cache fill and read 
buffers are 128 bits wide. 32 bits of data must be 
written to TR3 four times to fill the cache fill buffer. 
32 bits of data must be read from TR3 four times to 
empty the cache read buffer. The entry select bits in 
TR5 determine which 32 bits of data TR3 will access 
in the buffers. 


TR4 handles tag, LRU and valid bit information dur- 
ing cache tests. TR4 must be loaded with a tag and 
a valid bit before a write to the cache. After a read 
from a cache entry, TR4 contains the tag and valid 
bit from that entry, and the LRU bits and four valid 
bits from the accessed set. 


TR5 specifies which testability operation will be per- 
formed and the set and entry within the set which 
will be accessed. 


The seven bit set select field determines which of 
the 128 sets will be accessed. 


The functionality of the two entry select bits depend 
on the state of the control bits. When the fill or read 
buffers are being accessed, the entry select bits 
point to the 32-bit location in the buffer being ac- 
cessed. When a cache location is specified, the en- 
try select bits point to one of the four entries in a set. 
Refer to Table 8.1. 


Five testability functions can be performed on the 
cache. The two control bits in TR5 specify the oper- 
ation to be executed. The five operations are: 


1. Write cache fill buffer 
2. Perform a cache testability write 
3. Perform a cache testability read 
4. Read the cache read buffer 
5. Perform a cache flush 


Table 8.1 shows the encoding of the two control bits 
in TR5 for the cache testability functions. Table 8.1 
also shows the functionality of the entry and set se- 
lect bits for each control operation. 


The cache tests attempt to use as much of the nor- 
mal operating circuitry as possible. Therefore when 
cache tests are being performed, the cache must be 
disabled (the CD and NW bits in control register 
must be set to 1 to disable the cache. See Section 
5). 


A testability write to the cache is a two step process. 
First the cache fill buffer must be loaded with 128 
bits of data and TR4 loaded with the tag and valid 
bit. Next the contents of the fill buffer are written to a 
cache location. Sample assembly code to do a write 
is given in Figure 8.2. 


Table 8.1. Cache Control Bit Encoding and Effect of 
Control Bits on Entry select and set select Functionality 


Control Bits 
Operation 
Entry Select Bits 
Set Select Bits 
Bit 1 
BltO 
Function 


0 
0 
Enable 
{ 
Fill Buffer Write 
Select 32-bit location in fill/read 
Read Buffer Read 
buffer 
- 


0 
1 
Perform Cache Write 
Select an entry in set. 
Select a set to write to 


1 
0 
Perform Cache Read 
Select an entry in set. 
Select a set to read from 


1 
1 
Perform Flush Cache 
- 
- 
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Sample 
Assembly 
Code 


An example 
assembly 
language 
sequence 
to perform 
a cache write is: 


eax. 
ebx. 
ecx. edx contain 
the cache 
line to write 
edi contains 
the tag 
information 
to load 
eRO already 
says to enable 
reads/write 
to TR5 


fill the cache 
buffer 
mov 
esi,O 
mov tr5,esi 
mov tr3,eax 
mov 
esi,4 
mov tr5,esi 
mov tr3,ebx 
mov 
esi,8 
mov tr5,esi 
mov 
tr3,ecx 
mov esi,Och 
mov tr5,esi 
mov tr3,edx 


set up command 
load to TR5 
load data 
into 
cache 
fill buffer 


mov 
esi,l 
mov tr5,esi 


mov esi,2 
mov tr5,esi 


read the data 
from the read buffer 


mov esi,O 
mov 
tr5,esi 
mov 
eax,tr3 
mov 
esi,4 
mov tr5,esi 
mov 
ebx,tr3 
mov 
esi,8 
mov tr5,esi 
mov 
ecx,tr3 
mov 
esi,Och 
mov tr5,esi 
mov edx,tr3 


read 
the status 
from TR4 


mov 
edi,tr4 
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Loading the fill buffer is accomplished by first writing 
to the entry select bits in TR5 and setting the control 
bits in TR5 to 00. The entry select bits identify one of 
four 32-bit locations in the cache fill buffer to put 32 
bits of data. Following the write to TR5, TR3 is writ- 
ten with 32 bits of data which are immediately 
placed in the cache fill buffer. Writing to TR3 initiates 
the write to the cache fill buffer. The cache fill buffer 
is loaded with 128 bits of data by writing to TR5 and 
TR3 four times using a different entry select location 
each time. 


TR4 must be loaded with the 21-bit tag and valid bit 
(bit 10 in TR4) before the contents of the fill buffer 
are written to a cache location. 


The contents of the cache fill buffer are written to a 
cache location by writing TR5 with a control field of 
01 along with the set select and entry select fields. 
The set select and entry select field indicate the lo- 
cation in the cache to be written. The normal cache 
LRU update circuitry updates the internal LRU bits 
for the selected set. 


Note that a cache testability write can only be done 
when the cache is disabled for replaces (the CD bit 
is control register 0 is reset to 1). Also note that care 
must be taken when directly writing to entries in the 
cache. If the entry is set to overlap an area of mem- 
ory that is being used in external memory, that 
cache entry could inadvertently be used instead of 
the external memory. Of course, this is exactly the 
type of operation that one would desire if the cache 
were to be used as a high speed RAM. 


A cache testability read is a two step process. First 
the contents of the cache location are read into the 
cache read buffer. Next the data is examined by 
reading it out of the read buffer. Sample assembly 
code to do a testability read is given in Figure 8.2. 


Reading the contents of a cache location into the 
cache read buffer is initiated by writing TR5 with the 
control bits set to 10 and the desired seven-bit set 
select and two-bit entry select. In response to the 
write to TR5, TR4 is loaded with the 21-bit tag field 
and the single valid bit from the cache entry read. 
TR4 is also loaded with the three LRU bits and four 
valid bits corresponding to the cache set that was 
accessed. The cache read buffer is filled with the 
128-bit value which was found in the data array at 
the specified location. 


The contents of the read buffer are examined by 
performing four reads of TR3. Before reading TR3 
the entry select bits in TR5 must loaded to indicate 
which of the four 32-bit words in the read buffer to 


transfer into TR3 and the control bits in TR5 must be 
loaded with 00. The register read of TR3 will initiate 
the transfer of the 32-bit value from the read buffer 
to the specified general purpose register. 


Note that it is very important that the entire 128-bit 
quantity from the read buffer and also the informa- 
tion from TR4 be read before any memory refer- 
ences are allowed to occur. If memory operations 
are allowed to happen, the contents of the read buff- 
er will be corrupted. This is because the testability 
operations use hardware that is used in normal 
memory accesses for the Intel486 microprocessor 
whether the cache is enabled or not. 


The control bits in TR5 must be written with 11 to 
flush the cache. None of the other bits in TR5 have 
any meaning when 11 is written to the control bits. 
Flushing the cache will reset the LRU bits and the 
valid bits to 0, but will not change the cache tag or 
data arrays. 


When the cache is flushed by writing to TR5 the 
special bus cycle indicating a cache flush to the ex- 
ternal system is not run (see Section 7.2.11, Special 
Bus Cycles). The cache should be flushed with the 
instruction INVD (Invalidate Data Cache) instruction 
or the WBINVD (Write-back and Invalidate Data 
Cache) instruction. 


8.3 
Translation 
Lookaside 
Buffer 
(TLB) Testing 


The Intel486 Microprocessor TLB testability hooks 
are similar to those in the 386 Microprocessor. The 
testability hooks have been enhanced to provide 
added test features and to include new features in 
the Intel486 Microprocessor. The TLB testability 
hooks are designed to be accessible during the 
BIST and for assembly language testing of the TLB. 


8.3.1 
TRANSLATION 
LOOKASIDE 
BUFFER 
ORGANIZATION 


The Intel486 Microprocessors TLB is 4-way set as- 
sociative and has space for 32 entries. The TLB is 
logically split into three blocks shown in Figure 8.3. 


The data block is physically split into four arrays, 
each with space for eight entries. An entry in the 
data block is 22 bits wide containing a 20-bit physi- 
cal address and two bits for the page attributes. The 
page attributes are the PCD (page cache disable) bit 
and the PWT (page write-through) bit. Refer to Sec- 
tion 4.5.4 for a discussion of the PCD and PWT bits. 
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The tag block is also split into four arrays, one for 
each of the data arrays. A tag entry is 21 bits wide 
containing a 17-bit linear address and four protec- 
tion bits. The protection bits are valid M, user/su- 
pervisor (U/S), read/write (R/W) and dirty (0). 


TLB is the same as used by the on-chip cache. For a 
description of this algorithm refer to Section 5.5. 


The third block contains eight three bit quantities 
used in the pseudo least recently used (LRU) re- 
placement algorithm. These bits are called the LRU 
bits. The LRU replacement algorithm used in the 


The two TLB test registers are shown in Figure 8.4. 
TR6 is the command test register and TR7 is the 
data test register. External access to these registers 
is provided through 
MOV reg,TREG and 
MOV 


TREG,reg instructions. 
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TR6 contains the tag information and control infor- 
mation used in a TLB test. Loading TR6 with tag and 
control information initiates a TLB write or lookup 
test. 


TR6 contains three bit fields, a 20-bit linear address 
(bits 12-31), seven bits for the TLB tag protection 
bits (bits 5-11) and one bit (bit 0) to define the type 
of operation to be performed on the TLB. 


The 20-bit linear address forms the tag information 
used in the TLB access. The lower three bits of the 
linear address select which of the eight sets are ac- 
cessed. The upper 17 bits of the linear address form 
the tag stored in the tag array. 


The seven TLB tag protection bits are described be- 
low. 
V: 
0,0#: 
U,U#: 


The valid bit for this TLB entry 
The dirty bit for/from the TLB entry 
The user/supervisor bit for/from the TLB 
entry 
W,W#: 
The read/write bit for/from the TLB entry 


Two bits are used to represent the 0, U/S and R/W 
bits in the TLB tag to permit the option of a forced 
miss or hit during a TLB lookup operation. The 
forced miss or hit will occur regardless of the state 
of the actual bit in the TLB. The meaning of these 
pairs of bits is given in Table 8.2. 


The operation bit in TR6 determines if the TLB test 
operation will be a write or a lookup. The function of 
the operation bit is given in Table 8.3. 


TR6 
TLB Operation 
Bit 0 
to Be Performed 


0 
TLB Write 
1 
TLB Lookup 


TR7 contains the information stored or read from the 
data block during a TLB test operation. Before a TLB 


test write, TR7 contains the physical address and 
the page attribute bits to be stored in the entry. After 
a TLB test lookup hit, TR7 contains the physical ad- 
dress, page attributes, LRU bits and entry location 
from the access. 


TR7 contains a 20-bit physical address (bits 12-31), 
two bits for PCO (bit 11) and PWT (bit 10) and three 
bits for the LRU bits (bits 7-9). The LRU bits in TR7 
are only used during a TLB lookup test. The func- 
tionality of TR7 bit 4 differs for TLB writes and look- 
ups. The encoding of bit 4 is defined in Tables 8.4 
and 8.5. Finally TR7 contains two bits (bits 2-3) to 
specify a TLB replacement pointer or the location of 
a TLB hit. 


TR7 
Replacement 
Pointer 
Blt4 
Used on TLB Write 


0 
Pseudo-LRU Replacement Pointer 
1 
Data Test Register Bits 3:2 


TR7 
Meaning 
after TLB 
Bit 4 
Lookup Operation 


0 
TLB Lookup Resulted in a Miss 
1 
TLB Lookup Resulted in a Hit 


A replacement pointer is used during a TLB write. 
The pointer indicates which of the four entries in an 
accessed set is to be written. The replacement 
pointer can be specified to be the internal LRU bits 
or bits 2-3 in TR7. The source of the replacement 
pointer is specified by TR7 bit 4. The encoding of bit 
4 during a write is given by Table 8.4. 


Note that both testability writes and lookups affect 
the state of the internal LRU bits regardless of the 
replacement pointer used. All TLB write operations 
(testability or normal operation) cause the written 
entry to.become the most recently used. For exam- 
ple, during a testability write with the replacement 
pointer specified by TR7 bits 2-3, the indicated en- 
try is written and that entry becomes the most re- 
cently used as specified by the internal LRU bits. 


TR6 Protection 
Bit 
TR6 Protection 
Bit # 
Meaning 
on 
Meaning 
on 
(B) 
(B#) 
TLB Write Operation 
TLB Lookup Operation 


0 
0 
Undefined 
Miss any TLB TAG Bit B 
0 
1 
Write 0 to TLB TAG Bit B 
Match TLB TAG Bit B if 0 
1 
0 
Write 1 to TLB TAG Bit B 
Match TLB TAG Bit B if 1 
1 
1 
Undefined 
Match any TLB TAG Bit B 


•• 


There are two TLB testing operations: write entries 
into the TLB, and perform TLB lookups. One major 
enhancement over TLB testing in the 386 Micro- 
processor is that paging need not be disabled while 
executing testability writes or lookups. 


Note that any time one TLB set contains the same 
linear address in more than one of its entries, look- 
ing up that linear address will give unpredictable re- 
sults. Therefore a single linear address should not 
be written to one TLB set more than once. 


8.3.3 TLB WRITE TEST 


To perform a TLB write TR7 must be loaded fol- 
lowed by a TR6 load. The register operations must 
be performed in this order since the TLB operation is 
triggered by the write to TR6. 


TR7 is loaded with a 20-bit physical address and 
values for PCD and PWT to be written to the data 
portion of the TLB. In addition, bit 4 of TR7 must be 
loaded to indicate whether to use TR7 bits 3-2 or the 
internal LRU bits as the replacement pointer on the 
TLB write operation. Note that the LRU bits in TR7 
are not used in a write test. 


TR6 must be written to initiate the TLB write opera- 
tion. Bit 0 in TR6 must be reset to zero to indicate a 
TLB write. The 20-bit linear address and the seven 
page protection bits must also be written in TR6 to 
specify the tag portion of the TLB entry. Note that 
the three least significant bits of the linear address 
specify which of the eight sets in the data block will 
be loaded with the physical address data. Thus only 
17 of the linear address bits are stored in the tag 
array. 


To perform a TLB lookup it is only necessary to write 
the proper tags and control information into TR6. Bit 
o in TR6 must be set to 1 to indicate a TLB lookup. 
TR6 must be loaded with a 20-bit linear address and 
the seven protection bits. To force misses and 
matches of the individual protection bits on TLB 
lookups, set the seven protection bits as specified in 
Table 8.2. 


A TLB lookup operation is initiated by the write to 
TR6. TR7 will indicate the result of the lookup opera- 
tion following the write to TR6. The hit/miss indica- 
tion can be found in TR7 bit 4 (see Table 8.5). 


TR7 will contain the following information if bit 4 indi- 
cated that the lookup test resulted in a hit. Bits 2-3 
will indicate in which set the match occurred. The 22 
most significant bits in TR7 will contain the physical 
address and page attributes contained in the entry. 


Bits 9-7 will contain the LRU bits associated with 
the accessed set. The state of the LRU bits is previ- 
ous to their being updated for the current lookup. 


If bit 4 in TR7 indicated that the lookup test resulted 
in a miss the remaining bits in TR7 are undefined. 


Again it should be noted that a TLB testability lookup 
operation affects the state of the LRU bits. The LRU 
bits will be updated if a hit occurred. The entry which 
was hit will become the most recently used. 


The Intel486 Microprocessor provides the ability to 
float all its outputs and bidirectional pins. This in- 
cludes all pins floated during bus hold as well as 
pins which are never floated in normal operation of 
the chip (HLDA, BREQ, FERR# 
and PCHK#). 
When the Intel486 microprocessor is in the tri-state 
output test mode external testing can be used to test 
board connections. 


The 
tri-state 
test 
mode 
is 
invoked 
by 
driving 
FLUSH# low for 2 clocks before and 2 clocks after 
RESET going low. The outputs are guaranteed to tri- 
state no later than 10 clocks after RESET goes low 
(see Figure 6.4). The Intel486 Microprocessor re- 
mains in the tristate test mode until the next RESET. 


8.5 
Intel486™ 
Microprocessor 
Boundary Scan (JTAG) 


The Intel486 Microprocessor (50 MHz version only) 
provides additional testability features compatible 
with the IEEE Standard Test Access 
Port and 
Boundary Scan Architecture (IEEE Std.1149.1). The 
test logic provided allows for testing to insure that 
components function correctly, that interconnec- 
tions between various components are correct, and 
that various components interact correctly on the 
printed circuit board. 


The boundary scan test logic consists of a boundary 
scan register and support logic that are accessed 
through a test access port (TAP). The TAP provides 
a simple serial interface that makes it possible to 
test all signal traces with only a few probes. 


The TAP can be controlled via a bus master. The 
bus master can be either automatic test equipment 
or a component (PLD) that interfaces to the four-pin 
test bus. 


The boundary scan test logic contains the following 
elements: 
- 
Test access port (TAP), consisting of input pins 
TMS, TCK, and TOI; and output pin TOO. 
- 
TAP controller, which interprets the inputs on the 
test mode select (TMS) line and performs the 
corresponding operation. The operations per- 
formed by the TAP include controlling the in- 
struction and data registers within the compo- 
nent. 


- 
Instruction register (IR), which accepts instruc- 
tion codes shifted into the test logic on the test 
data input (TOI) pin. The instruction codes are 
used to select the specific test operation to be 
performed or the test data register to be ac- 
cessed. 


- 
Test data registers: The Intel486 Microprocessor 
contains three test data registers: Bypass regis- 
ter (BPR), Device Identification register (DID), 
and Boundary Scan register (BSR). 


The instruction and test data registers are separate 
shift-register paths connected in parallel and have a 
common serial data input and a common serial data 
output connected to the TAP signals, TOI and TOO, 
respectively. 


The Intel486 CPU contains the two required test 
data registers; bypass register and boundary scan 
register. In addition, they also have a device identifi- 
cation register. 


Each test data register is serially connected to TOI 
and TOO, with TOI connected to the most significant 
bit and TOO connected to the least significant bit of 
the test data register. Data is shifted one stage (bit 
position within the register) on each rising edge of 
the test clock (TCK). 


In addition the Intel486 CPU contains a runbist regis- 
ter to support the RUNBIST boundary scan instruc- 
tion. 


8.5.2.1 
Bypass 
Register 


The Bypass Register is a one-bit shift register that 
provides the minimal length,path between TOI and 
TOO. This path can be selected when no test opera- 
tion is being performed by the component to allow 
rapid movement of test data to and from other com- 
ponents on the board. While the bypass register is 
selected, data is transferred from TOI to TOO with- 
out inversion. 


8.5.2.2 
Boundary 
Scan Register 


The Boundary Scan Register is a single shift register 
path containing the boundary scan cells that are 
connected to all input and output pins of the Intel486 
CPU. Figure 8.5 shows the logical structure of the 
boundary scan register. While output cells determine 
the value of the signal driven on the corresponding 
pin, input cells only capture data; they do not affect 
the normal operation of the device. Data is trans- 
ferred without inversion from TOI to TOO through the 
boundary scan register during scanning. The bound- 
ary scan register can be operated by the EXTEST 
and SAMPLE instructions. The boundary scan regis- 
ter order is described in Section 8.5.5. 
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Instruction 
Code 
Instruction 
Name 


0000 
EXTEST 


0001 
SAMPLE 


0010 
IDCODE 


0011 
PRIVATE 


0100 
PRIVATE 


0101 
PRIVATE 


0110 
PRIVATE 


0111 
PRIVATE 


1000 
RUNBIST 


1001 


> 


PRIVATE 


1010 
PRIVATE 


1011 
PRIVATE 


1100 
PRIVATE 


1101 
PRIVATE 


1110 
PRIVATE 


1111 
BYPASS 


EXTEST 
The instruction code is "0000". The EX- 
TEST instruction allows testing of cir- 
cuitry external to the component pack- 
age, typically board interconnects. It 
does so by driving the values loaded 
into the Intel486 CPU's boundary scan 
register out on the output pins corre- 
sponding to each boundary scan cell 
and capturing the values on Intel486 
CPU input pins to be loaded into their 
corresponding boundary scan register 
locations. I/O pins are selected as input 
or output, depending on the value load- 
ed into their control setting locations in 
the 
boundary 
scan 
register. 
Values 
shifted into input latches in the bounda- 
ry scan register are never used by the 
internal logic of the Intel486 CPU. 


NOTE: 
After using the EXTEST instruction, the 
Intel486 CPU must be reset before nor- 
mal (non-boundary scan) use. 


SAMPLE/ 
The instruction code is "0001". 
The 
PRELOAD SAMPLE/PRELOAD has two functions 
that it performs. When the TAP control- 
ler is in the Capture-DR state, the SAM- 
PLE/PRELOAD 
instruction 
allows 
a 


"snap-shot" of the normal operation of 


the component without interfering with 
that normal operation. The instruction 
causes boundary scan register cells as- 
sociated with outputs to sample the val- 
ue being driven by the Intel486 CPU. It 
causes the cells associated with inputs 
to sample the value being driven into 
the Intel486 CPU. On both outputs and 
inputs the sampling occurs on the rising 
edge of TCK. When the TAP controller 
is in the Update-DR state, the SAM- 
PLE/PRELOAD 
instruction 
preloads 


data to the device pins to be driven to 
the board by executing the EXTEST in- 
struction. Data is preloaded to the pins 
from the boundary scan register on the 
falling edge of TCK. 
• 


IDCODE 
The instruction code is "0010". The ID- 
CODE instruction selects the device 
identification register to be connected 
to TDI and TDO, allowing the device 
identification code to be shifted out of 
the device on TDO. Note that the de- 
vice identification register is not altered 
by data being shifted in on TD!. 


BYPASS 
The instruction code is "1111". 
The 


BYPASS instruction selects the bypass 
register to be connected to TDI or TDO, 
effectively bypassing the test logic on 
the Intel486 microprocessor by reduc- 
ing the shift length of the device to one 
bit. Note than an open circuit fault in the 
board level test data path will cause the 
bypass register to be selected following 
an instruction scan cycle due to the 
pull-up resistor on the TDI input. This 
has been done to prevent any unwant- 
ed interference with the proper opera- 
tion of the system logic. 
RUNBIST 
The instruction code is "1000". 
The 


RUNBIST instruction selects the one (1) 
bit runbist register, loads a value of "1" 
into the runbist register, and connects it 
to TDO. It also initiates the built-in self 
test (BIST) feature of the Intel486 CPU, 
which is able to detect approximately 
60% of the stuck-at faults on the In- 
tel486 CPU. The Intel486 CPU AC/DC 
Specifications for Vcc and CLK must be 
met and reset must have been asserted 
at least once prior to executing the 
RUNBIST boundary scan instruction. 
After loading the RUNBIST instruction 
code in the instruction register, the TAP 
controller must be placed in the Run- 
Test/Idle state. BIST begins on the first 
rising edge of TCK after entering the 
Run-Test/Idle 
state. 
The 
TAP 
con- 


troller must remain in the Run-Test/Idle 
state until BIST is completed. It requires 
1.2 million clock (ClK) cycles to com- 
plete BIST and report the result to the 
runbist register. After completing the 1.2 
million clock (ClK) cycles, the value in 
the runbist register should be shifted 
out on TOO during the Shift-DR state. A 
value of "0" being shifted out on TOO 
indicates BIST successfully completed. 
A value of "1" 
indicates a failure oc- 
curred. After executing the RUNBIST in- 
struction, the Intel486 CPU must be re- 
set prior to normal operation. 


8.5.4 
TEST 
ACCESS 
PORT (TAP) 
CONTROLLER 


The TAP controller is a synchronous, finite state ma- 
chine. It controls the sequence of operations of the 
test logic. The TAP controller changes state only in 
response to the following events: 
1. a rising edge of TCK 
2. power-up. 


intel~ 


The value of the test mode state (TMS) input signal 
at a rising edge of TCK controls the sequence of the 
state changes. The state diagram for the TAP con- 
troller is shown in Figure 8.7. Test designers must 
consider the operation of the state machine in order 
to design the correct sequence of values to drive on 
TMS. 


8.5.4.1 
Test-Logic-Reset 
State 


In this state, the test logic is disabled so that normal 
operation of the device can continue unhindered. 
This is achieved by initializing the instruction register 
such that the IOCOOEinstruction is loaded. No mat- 
ter what the original state of the controller, the con- 
troller enters Test-logic-Reset state when the TMS 
input is held high (1) for at least five rising edges of 
TCK. The controller remains in this state while TMS 
is high. The TAP controller is also forced to enter 
this state at power-up. 


A controller state between scan operations. Once in 
this state, the controller remains in this state as long 


as TMS is held low. In devices supporting the 
RUNBIST instruction, the BIST is performed during 
this state and the result is reported in the runbist 
register. For instruction not causing functions to exe- 
cute during this state, no activity occurs in the test 
logic. The instruction register and all test data regis- 
ters retain their previous state. When TMS is high 
and a rising edge is applied to TCK, the controller 
moves to the Select-DR state. 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-DR state, and a scan se- 
quence for the selected test data register is initiated. 
If TMS is held high and a rising edge is applied to 
TCK, the controller moves to the Select-IR-Scan 
state. 


8.5.4.4 Capture-DR State 


In this state, the boundary scan register captures 
input pin data if the current instruction is EXTEST or 
SAMPLE/PRELOAD. The other test data registers, 
which do not have parallel input, are not changed. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-DR state if TMS is high or the Shift-DR state if 
TMS is low. 


In this controller state, the test data register con- 
nected between TDI and TOO as a result of the cur- 
rent instruction, shifts data one stage toward its seri- 
al output on each rising edge of TCK. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-DR state if TMS is high or remains in the Shift- 
DR state if TMS is low. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-DR state, which termi- 


nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Pause-DR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the test data 
register in the serial path between TDI and TOO. An 
example of using this state could be to allow a tester 
to reload its pin memory from disk during application 
of a long test sequence. 


The test data register selected by the current in- • 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-DR 
state. 


8.5.4.8 Exit2-0R State 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-DR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Shift-DR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


8.5.4.9 Update-DR State 


The boundary scan register is provided with a 
latched parallel output to prevent changes at the 
parallel output while data is shifted in response to 
the EXTEST and SAMPLE/PRELOAD instructions. 
When the TAP controller is in this state and the 
boundary scan register is selected, data is latched 
onto the parallel output of this register from the shift- 
register path on the falling edge of TCK. The data 
held at the latched parallel output does not change 
other than in this state. 


All shift-register stages in test data register selected 
by the current instruction retains its previous value 
during this state. The instruction does not change in 
this state. 
. 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-IR state, and a scan se- 
quence for the instruction register is initiated. If TMS 
is held high and a rising edge is applied to TCK, the 
controller moves to the Test-Logic-Reset state. 


8.5.4.11 
Capture-IR 
State 


In this controller state the shift register contained in 
the instruction register loads the fixed value "0001" 
on the rising edge of TCK. 


The test data register selected by the current in- 
struction retains it previous value during this s~ate. 
The instruction does not change in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or the Shift-IR state if TMS 
is held low. 


In this state the shift register contained in the in- 
struction register is connected between TOI and 
TOO and shifts data one stage towards its serial out- 
put on each rising edge of TCK. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or remains in the Shift-IR 
state if TMS is held low. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low and a 


rising edge is applied to TCK, the controller enters 
the Pause-IR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the instruction 
register. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-IR 
state. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Shift-IR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 
ter path on the falling edge of TCK. Once the new 
instruction has been latched, it becomes the current 
instruction. 


Test data registers selected by the current instruc- 
tion retain the previous value. 


The boundary scan register contains a cell for each 
pin, as well as cells for control of I/O and tristate 
pins. 
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The following is the bit order of the Intel486 CPU 
boundary scan register: (from left to right and top to 
bottom). 


TOI - 
WRCTL 
ABUSCTL 
BUSCTL 
MISCCTL 
AOS# 
BLAST# 
PLOCK# 
LOCK# 
PCHK# 
BROY# BOFF# BS16# BS8# ROY# KEN# 
HOLO AHOLO CLK HLOA WR# BREQ BEO# 
BE1# 
BEU 
BE3# 
MIO# 
OC# 
PWT PCO 
EAOS# A20M# 
RESET FLUSH# INTR NMI 
FERR# IGNNE# 031 030029028027026 
025024 
OP3 023 022 021 020019018017 
016 OP2 015 014 013 012 011 0100908 
OP1 07 06 05 04 03 02 01 00 OPOA31 A30 
A29 A28 A27 A26 A25 A24 A23 A22 A21 A20 
A19 A18 A17 A16 A15 A14 A13 A12 A11 A10 
A9 
A8 
A7 
A6 
RESERVEO 
A5 
A4 
A3 
A2-TOO 


"RESERVEO" corresponds to no connect "NC" sig- 
nals on the Intel486 CPU. 


All the ·CTL cells are control cells that are used to 
select the direction of bidirectional pins or tristate 
output pins. If "1" 
is loaded into the control cell 


(·CTL), the associated pin(s) are tristated or select· 
ed as input. The following lists the control cells and 
their corresponding pins. 


1. WRCTL controls the 031-0 
and OP3-0 pins. 


2. ABUSCTL controls the A31-A2 pins. 
3. BUSCTL 
controls 
the 
AOS#, 
BLAST#, 


PLOCK#, LOCK#, WR#, BEO#, BE1#, BE2#, 
BE3#, MIO#, OC#, PWT, and PCO pins. 
4. MISCCTL controls the PCHK#, HLOA, BREQ, 


and FERR# pins. 


The TAP controller is automatically initialized when a 
device is powered up. In addition, the TAP controller 
can be initialized by applying a high signal level on 
the TMS input for five TCK periods. 


8.5.7 
BOUNDARY 
SCAN 
DESCRIPTION 
LANGUAGE 
(BSDL) 
• 


The Intel486 Microprocessor provides several fea- 
tures which simplify the debugging process. The 
three categories of on-chip debugging aids are: 
1) the code execution breakpoint opcode (OCCH), 
2) the single-step capability provided by the TF bit 
in the flag register, and 
3) the code and data breakpoint capability provided 
by the Debug Registers DRO-3, DR6, and DR? 


A single-byte-opcode breakpoint instruction is avail- 
able for use by software debuggers. The breakpoint 
opcode is OCCH,and generates an exception 3 trap 
when executed. In typical use, a debugger program 
can "plant" the breakpoint instruction at all desired 
code execution breakpoints. The single-byte break- 
point opcode is an alias for the two-byte general 
software interrupt instruction, INT n, where n=3. 
The only difference between INT 3 (OCCh)and INT n 
'is that INT 3 is never IOPL-sensitive but INT n is 
IOPL-sensitive in Protected Mode and Virtual 8086 
Mode. 


9.2 
Single-Step Trap 


If the single-step flag (TF, bit 8) in the EFLAG regis- 
ter is found to be set at the end of an instruction, a 
single-step exception occurs. The single-step ex- 
ception is auto vectored to exception number 1. Pre- 
cisely, exception 1 occurs as a trap after the instruc- 
tion following the instruction which set TF. In typical 
practice, a debugger sets the TF bit of a flag register 
image on the debugger's stack. It then typically 
transfers control to the user program and loads the 
flag image with a signal instruction, the IRET instruc- 
tion. The single-step trap occurs after executing one 
instruction of the user program. 


Since the exception 1 occurs as a trap (that is, it 
occurs after the instruction has already executed), 
the CS:EIP pushed onto the debugger's stack points 
to the next unexecuted instruction of the program 
being debugged. An exception 1 handler, merely by 
ending with an IRET instruction, can therefore effi- 
ciently support single-stepping through a user pro- 
gram. 


The Debug Registers are an advanced debugging 
feature of the Intel486 Microprocessor. They allow 
data access breakpoints as well as code execution 
breakpoints. Since the breakpoints are indicated by 


on-chip registers, an instruction execution break- 
point can be placed in ROM code or in code shared 
by several tasks, neither of which can be supported 
by the INT3 breakpoint opcode. 


The Intel486 Microprocessor contains six Debug 
Registers, providing the ability to specify up to four 
distinct breakpoints addresses, breakpoint control 
options, and read breakpoint status. Initially after re- 
set, breakpoints are in the disabled state. Therefore, 
no breakpoints will occur unless the debug registers 
are programmed. Breakpoints set up in the Debug 
Registers are autovectored to exception number 1. 


9.3.1 
LINEAR 
ADDRESS 
BREAKPOINT 
REGISTERS 
(DRO-DR3) 


Up to four breakpoint addresses can be specified by 
writing into Debug Registers DRO-DR3, shown in 
Figure 9.1. The breakpoint addresses specified are 
32-bit linear addresses. Intel486 Microprocessor 
hardware continuously compares the linear break- 
point addresses in DRO-DR3 with the linear ad- 
dresses generated by executing software (a linear 
address is the result of computing the effective ad- 
dress and adding the 32-bit segment base address). 
Note that if paging is not enabled the linear address 
equals the physical address. If paging is enabled, 
the linear address is translated to a physical 32-bit 
address by the on-chip paging unit. Regardless of 
whether paging is enabled or not, however, the 
breakpoint registers hold linear addresses. 


A Debug Control Register, DR? shown in Figure 9.1, 
allows several debug control functions such as en- 
abling the breakpoints and setting up other control 
options for the breakpoints. The fields within the De- 
bug Control Register, DR?, are as follows: 


A 2-bit LEN field exists for each of the four break- 
points. LEN specifies the length of the associated 
breakpoint field. The choices for data breakpoints 
are: 1 byte, 2 bytes, and 4 bytes. Instruction execu- 
tion breakpoints must have a length of 1 (LENi 
00). Encoding of the LENi field is as follows: 


BREAKPOINT 
0 LINEAR ADDRESS 


BREAKPOINT 
1 LINEAR ADDRESS 


BREAKPOINT 
2 LINEAR ADDRESS 


BREAKPOINT 
3 LINEAR ADDRESS 


Intel reserved. 
Do not define. 


Intel reserved. 
Do not define. 


0 
B 
B B 00 o 0 o 0 o 0 o B B B B 
T 
S D 
3 
210 


LEN I~ IW I LEN I~ IW I LEN I RIW I LEN I~ I W 
0 o G o 0 OG 
L G L G L G 
L G 
L 
333222111000 
D 
E E 3 
3 
2 
2 
1 
100 


31 
16 15 


NOTE: 
o indicates Intel reserved: Do not define; SEE SECTION 2.3.10 


DRO 


DR1 


DR2 


DR3 


DR4 


DR5 


Usage of Least 
LENI 
Breakpoint 
Significant 
Bits In 
Encoding 
Field Width 
Breakpoint 
Address 
Register 
I, (I=0 - 3) 


00 
1 byte 
All 32-bits used to 
specify a single-byte 
breakpoint field. 


01 
2 bytes 
A 1- A31 used to specify 
a two-byte, word- 
aligned breakpoint field. 
AO in Breakpoint 
Address Register is not 
used. 


10 
Undefined- 
do not use 
this encoding 


11 
4 bytes 
A2-A31 
used to specify 
a four-byte, dword- 
aligned breakpoint field. 
AO and A1 in 
Breakpoint Address 
Register are not used. 


The LENi field controls 
the size of breakpoint 
field i 
by controlling 
whether 
all low-order 
linear 
address 
bits in the breakpoint 
address 
register 
are used to 
detect 
the 
breakpoint 
event. 
Therefore, 
all break- 
point fields are aligned; 
2-byte 
breakpoint 
fields be- 
gin 
on 
Word 
boundaries, 
and 
4-byte 
breakpoint 
fields begin on Dword boundaries. 


The following 
is an example 
of various 
size break- 
point fields. Assume 
the breakpoint 
linear address 
in 
DR2 
is 00000005H. 
In that 
situation, 
the following 
illustration 
indicates 
the 
region 
of 
the 
breakpoint 
field for lengths 
of 1, 2, or 4 bytes. 


DR2 = 00000005H; 
LEN2 = OOB 


I" 1 
Ib~tfld21 
0 I::::: 


DR2=00000005H; 
LEN2 = 01B 


31 
I 
0 


I 
00000008H 


00000004H 


OOOOOOOOH 
+- bkpt fld2 -+ 
I 


DR2 = 00000005H; 
LEN2 = 11B 


31 
I 
I 
I 
0 


I 
00000008H 


00000004H 


OOOOOOOOH 


+- 
bkpt fld2 
-+ 
I 
I 


RWi (memory 
access 
qualifier 
bits) 


A 2-bit RW field 
exists 
for each 
of the four 
break- 
points. The 2-bit RW field specifies 
the type of usage 
which must occur in order to activate 
the associated 
breakpoint. 


• 


RW 
Usage 


Encoding 
Causing Breakpoint 


00 
Instruction execution only 
01 
Data writes only 
10 
Undefined--do not use this encoding 


11 
Data reads and writes only 


RW encoding 00 is used to set up an instruction 
execution breakpoint. RW encodings 01 or 11 are 
used to set up write-only or read/write data break- 
points. 


Note that Instruction 
execution 
breakpoints 
are 


taken 
as- faults 
(Le., before the instruction exe- 


cutes), but data 
breakpoints 
are taken 
as traps 


(Le., after the data transfer takes place). 


Using LENi and RWi to Set Data Breakpoint i 


A data breakpoint can be set up by writing the linear 
address into DRi (i = 0-3). For data breakpoints, 
RWi can = 01 (write-only) or 11 (write/read). LEN 
can = 00,01, or 11. 


If a data access entirely or partly falls within the data 
breakpoint field, the data breakpoint condition has 
occurred, and if the breakpoint is enabled, an excep- 
tion 1 trap will occur. 


Using LENi and RWi to Set Instruction Execution 
Breakpoint i 


An instruction execution breakpoint can be set up by 
writing address of the beginning of the instruction 
(including prefixes if any) into DRi (i = 0-3). 
RWi 


must = 00 and LEN must = 00 for instruction exe- 
cution breakpoints. 


If the instruction beginning at the breakpoint address 
is about to be executed, the instruction execution 
breakpoint condition has occurred, and if the break- 
point is enabled, an exception 1 fault will occur be- 
fore the instruction is executed. 


Note that an instruction execution breakpoint ad- 
dress must be equal to the beginning 
byte address 


of an instruction (including prefixes) in order for the 
instruction execution breakpoint to occur. 


The Debug Registers can only be accessed in Real 
Mode or at privilege level 0 in Protected Mode. The 
GO bit, when set, provides extra protection against 
any Debug Register access even in Real Mode or at 
privilege level 0 in Protected Mode. This additional 
protection feature is provided to guarantee that a 
software debugger can have full control over the De- 


bug Register resources when required. The GO bit, 
when set, causes an exception 1 fault if an instruc- 
tion attempts to read or write any Debug Register. 
The GO bit is then automatically cleared when the 
exception 1 handler is invoked, allowing the excep- 
tion 1 handler free access to the debug registers. 


GE and LE (Exact data breakpoint match, global and 
local) 


The breakpoint mechanism of the Intel486 Micro- 
processor differs from that of the 386. The Intel486 
Microprocessor always does exact data breakpoint 
matching, regardless of GE/LE bit settings. Any data 
breakpoint trap will be reported exactly after comple- 
tion of the instruction that caused the operand trans- 
fer. Exact reporting is provided by forcing the In- 
tel486 Microprocessor execution unit to wait for 
completion of data operand transfers before begin- 
ning execution of the next instruction. 


When the Intel486 Microprocessor performs a task 
switch, the LE bit is cleared. Thus, the LE bit sup- 
ports fast task switching out of tasks, that have 
enabled the exact data breakpoint match for their 
task-local breakpoints. The LE bit is cleared by the 
processor during a task switch, to avoid having ex- 
act data breakpoint match enabled in the new task. 
Note that exact data breakpoint match must be re- 
enabled under software control. 


The Intel486 Microprocessor GE bit is unaffected 
during a task switch. The GE bit supports exact data 
breakpoint match that is to remain enabled during all 
tasks executing in the system. 


Note that Instruction 
execution 
breakpoints are al- 


ways reported exactly. 


If either Gi or Li is set then the associated breakpoint 
(as defined by the linear address in DRi, the length 
in LENi and the usage criteria in RWi) is enabled. If 
either Gi or Li is set, and the Intel486 Microproces- 
sor detects the ith breakpoint condition, then the ex- 
ception 1 handler is invoked. 


When the Intel486 Microprocessor performs a task 
switch to a new Task State Segment (TSS), all Li 
bits are cleared. Thus, the Li bits support fast task 
switching out of tasks that use some task-local 
breakpoint registers. The Li bits are cleared by the 
processor during a task switch, to avoid spurious ex- 
ceptions in the new task. Note that the breakpoints 
must be re-enabled under software control. 


All Intel486 Microprocessor Gi bits are unaffected 
during a task switch. The Gi bits support breakpoints 
that are active in all tasks executing in the system. 


9.3.3 
DEBUG 
STATUS 
REGISTER 
(DR6) 


A Debug Status Register, DR6 shown in Figure 9.1, 
allows the exception 1 handler to easily determine 
why it was invoked. Note the exception 1 handler 
can be invoked as a result of one of several events: 
1) DROBreakpoint fault/trap. 
2) DR1 Breakpoint fault/trap. 
3) DR2 Breakpoint fault/trap. 
4) DR3 Breakpoint fault/trap. 
5) Single-step (TF) trap. 
6) Task switch trap. 
7) Fault due to attempted debug register access 
when GD=1. 


The Debug Status Register contains single-bit flags 
for each of the possible events invoking exception 1. 
Note below that some of these events are faults (ex- 
ception taken before the instruction is executed), 
while other events are traps (exception taken after 
the debug events occurred). 


The flags in DR6 are set by the hardware but never 
cleared by hardware. Exception 1 handler software 
should clear DR6 before returning to the user pro- 
gram to avoid future confusion in identifying the 
source of exception 1. 


The fields within the Debug Status Register, DR6, 
are as follows: 


Four breakpoint indicator flags, BO-B3, correspond 
one-to-one with the breakpoint registers in DRO- 
DR3. A flag Bi is set when the condition described 
by DRi, LENi, and RWi occurs. 


If Gi or Li is set, and if the ith breakpoint is detected, 
the processor will invoke the exception 1 handler. 
The exception is handled as a fault if an instruction 
execution breakpoint occurred, or as a trap if a data 
breakpoint occurred. 


IMPORTANT 
NOTE: A flag Bi is set whenever the 
hardware detects a match condition on enabled 
breakpoint i. Whenever a match is detected on at 
least one enabled 
breakpoint i, the hardware imme- 
diately sets all Bi bits corresponding to breakpoint 
conditions matching at that instant, whether enabled 
or not. Therefore, the exception 1 handler may see 
that multiple Bi bits are set, but only set Bi bits corre- 
sponding to enabled 
breakpoints (Li or Gi set) are 
true indications of why the exception 1 handler was 
invoked. 


BD (debug fault due to attempted register access 
when GD bit set) 


This bit is set if the exception 1 handler was invoked 
due to an instruction attempting to read or write to • 
the debug registers when GD bit was set. If such an 
event occurs, then the GD bit is automatically 
cleared when the exception 1 handler is invoked, 
allowing handler access to the debug registers. 


This bit is set if the exception 1 handler was invoked 
due to the TF bit in the flag register being set (for 
single-stepping). 


BT (debug trap due to task switch) 


This bit is set if the exception 1 handler was invoked 
due to a task switch occurring to a task having a 
Intel486 Microprocessor TSS with the T bit set. Note 
the task switch into the new task occurs normally, 
but before the first instruction of the task is execut- 
ed, the exception 1 handler is invoked. With respect 
to the task switch operation, the operation is consid- 
ered to be a trap. 


9.3.4 
USE OF RESUME 
FLAG (RF) IN FLAG 
REGISTER 


The Resume Flag (RF) in the flag word can sup- 
press an instruction execution breakpoint when the 
exception 1 handler returns to a user program at a 
user address which is also an instruction execution 
breakpoint. 


This section describes the Intel486 Microprocessor 
instruction set. Tables 10.1 through 10.3 list all in- 
structions along with instruction encoding diagrams 
and clock counts. Further details of the instruction 
encoding are then provided in Section 10.2, which 
completely describes the encoding structure and the 
definition of all fields occurring within the Intel486 
Microprocessor instructions. 


10.1 
Intel486™ 
Microprocessor 
Instruction 
Encoding and Clock 
Count Summary 


To calculate elapsed time for an instruction, multiply 
the instruction clock count, as listed in Tables 10.1 
through 10.3 by the processor clock period (e.g., 
40 ns for a 25 MHz Intel486 Microprocessor). 


For more detailed information on the encodings of 
instructions, refer to Section 10.2 Instruction Encod- 
ings. Section 10.2 explains the general structure of 
instruction encodings, and defines exactly the en- 
codings of all fields contained within the instruction. 


The Intel486 Microprocessor instruction clock count 
tables give clock counts assuming data and instruc- 
tion accesses hit in the cache. A separate penalty 
column defines clocks to add if a data access miss- 
es in the cache. The combined instruction and data 
cache hit rate is over 90%. 


A cache miss will force the Intel486 Microprocessor 
to run an external bus cycle. The Intel486 Micro- 
processor 32-bit burst bus is defined as r- b- w. 


Where: 
r = The number of clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle in a non-burst read. 
b = The number of clocks for the second and sub- 
sequent cycles in a burst read. 
w = The number of clocks for a write. 


The fastest bus the Intel486 microprocessor can 
support is 2-1 - 2 assuming 0 wait states. The 
clock counts in the cache miss penalty column as- 
sume a 2-1 - 2 bus. For slower busses add r- 2 
clocks to the cache miss penalty for the first dword 
accessed. Other factors also affect instruction clock 
counts. 


Instruction 
Clock Count Assumptions 
1. The external bus is available for reads or writes 
at all times. Else add clocks to reads until the 
bus is available. 
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2. Accesses are aligned. Add three clocks to each 
misaligned access. 


3. Cache fills complete before subsequent access- 
es to the same line. If a read misses the cache 
during a cache fill due to a previous read or pre- 
fetch, the read must wait for the cache fill to 
complete. If a read or write accesses a cache 
line still being filled, it must wait for the fill to 
complete. 
4. If an effective address is calculated, the base 
register is not the destination register of the pre- 
ceding instruction. If the base register is the 
destination register of the preceding instruction 
add 1 to the clock counts shown. Back-to-back 
PUSH and POP instructions are not affected by 
this rule. 
5. An effective address calculation uses one base 
register and does not use an index register. 
However, if the effective address calculation 
uses an index register, 1 clock may be added to 
the clock count shown. 
6. The target of a jump is in the cache. If not, add r 
clocks for accessing the destination instruction 
of a jump. If the destination instruction is not 
completely contained in the first dword read, 
add a maximum of 3b clocks. If the destination 
instruction is not completely contained in the 
first 16 byte burst, add a maximum of another 
r+3b clocks. 
7. If no write buffer delay, w clocks are added only 
in the case in which all write buffers are full. 
Typically, this case rarely occurs. 
8. Displacement and immediate not used together. 


If displacement and immediate used together, 1 
clock may be added to the clock count shown. 
9. No invalidate cycles. Add a delay of 1 clock for 
each invalidate cycle if the invalidate cycle con- 
tends for the internal cache/ external bus when 
the Intel486 CPU needs to use it. 


10. Page translation hits in TLB. A TLB miss will add 
13,21 or 28 clocks to the instruction depending 
on whether the Accessed and/or Dirty bit in nei- 
ther, one or both of the page entries needs to 
be set in memory. This assumes that neither 
page entry is in the data cache and a page fault 
does not occur on the address translation. 
11. No exceptions are detected during instruction 
execution. Refer to Interrupt Clock Counts Ta- 
ble for extra clocks if an interrupt is detected. 


12. Instructions that read multiple consecutive data 
items (Le.task switch, paPA, etc.) and miss the 
cache are assumed to start the first access on a 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+ 3b) 
clocks to the cache miss penalty. 


INSTRUCTION 
FORMAT 
Cache Hit 
Penally II 
Notes 
CacheMln 


INTEGER OPERATIONS 


MOV - 
Move: 


reg1 toreg2 
I 1000100W 
111 
reg1 reg21 
1 


reg2to reg1 
1 1000101w 
11 
reg1 reg21 
1 


memory to reg 
I 1000101w 
mod 
reg 
r/ml 
1 
2 


reg to memory 
1000100w 
mod 
reg 
r/ml 
1 


Immediate 
to reg 
1100011 
w 
11000 
reg I immediate 
data 
1 


or 
1011w reg 
immediate 
data 
1 


Immediate 
to Memory 
1100011 
w 
mod 000 
rIm I ~iSPlac.ement 
1 
Immediate 


Memory 
to Accumulator 
1010000w 
full displacement 
1 
2 


Accumulator 
to Memory 
1010001w 
full displacement 
1 


MOVSX/MOVZX 
~ Move with Sign/Zero 
Extension 


reg2toreg1 
I 00001111 
1 1011 
z11 w 
111 
reg1 reg21 
3 


memory 
to reg 
1 00001111 
I 
10'1 
z 11 w I mod 
reg 
r/ml 
3 
2 


z 
Instruction 


0 
MOVZX 


1 
MOVSX 


PUSH ~ Push 


reg 
1 11111111 
111 
110 
reg 1 
4 


or 
101010 
reg I 
1 


memory 
I 
11111111 
I mod 110 
rim I 
4 
1 
1 


immediate 
I 01101050 
I immediate 
data 
1 


PUSHA - 
Push All 
I 01100000 
I 
11 


POP ~ Pop 


reg 
1 10001111 
111 
000 
regI 
4 
1 


or 
101011 
reg 1 
1 
2 


memory 
I 
10001111 
1mod 000 
rim 1 
5 
2 
1 


POPA - 
Pop All 
I 01100001 
1 
9 
7/15 
16/32 


XCHG - 
Exchsnge 


reg1 with reg2 
1000011w 
111 
reg1 reg21 
3 
2 


Accumulator 
with reg 
10010 
reg 1 
3 
2 


Memory 
with reg 
1000011w 
1mod 
reg 
r/ml 
5 
2 


NOP - 
No Operstlon 
10010000 
1 
1 


LEA ~ Load EA to Register 
10001101 
1mod 
reg 
r/ml 


no index register 
1 
with index register 
2 


•• 


INSTRUCTION 
FORMAT 
CIIcIle Hit 
PeMllylf 
Not•• 
CKheMIu 


INTEGER OPERATIONS (Continued) 


Inatructlon 
TTT 


ADD ~ Add 
000 
AOC ~ Add with Garry 
010 


AND ~ logical AND 
100 


OR ~ logical OR 
001 


SUB = Subtract 
101 


SBB = Subtract with Borrow 
011 


XOR ~ logical Exclusiva OR 
110 


reg1 toreg2 
I OOTTTOOw 
11 
regl 
reg21 
1 


reg2toregl 
I OOTTTOlw 
11 
regl 
reg21 
1 


memory to register 
I OOTTTOlw 
mod 
reg 
rim I 
2 
2 


register to memory 
I OOTTTOOw 
mod 
reg 
rim I 
3 
6/2 
Ull 


immediate to register 
I 100000sw 
11 
TTT 
reg Iimmediate regrster 
1 


immediate to accumulator 
I 00TTT10w 
immediate data 
1 


immediate to memory 
I 100000sw 
mod TTT 
rim Iimmediate data 
3 
6/2 
U/l 


Inatructlon 
TTT 


INC = Increment 
000 
DEe ~ Decrement 
001 


reg 
I l111111w 
111 
TTT 
regI 
1 


or 
101 TTT 
regI 
1 


memory 
I l111111w 
I mod TTT 
r/ml 
3 
6/2 
U/l 


lnatructlon 
TTT 


NOT ~ logical Complement 
010 


NEG = Negate 
011 


reg 
I 1111011w 
111 
TTT 
regI 
1 


memory 
I 1111011 
w I mod TTT 
rim I 
3 
6/2 
U/l 


CMP = Compara 


regl with reg2 
0011100w 
11 
regl 
reg21 
1 


reg2 with regl 
0011101 
w 
11 
regl 
reg21 
1 


memory with register 
0011100w 
mod 
reg 
rim I 
2 
2 


register with memory 
0011101 
w 
mod 
reg 
rim I 
2 
2 


immediate with register 
100000sw 
11 
III 
regI immediate data 
1 


immediate with ace. 
I 0011110w 
immediate data 
1 


immediate with merTlOfY 
I 100000sw 
mod 111 
rIm Iimmediate data 
2 
2 


TEST = logical 
Compara 


regl and reg2 
1000010w 
111 
regl 
reg21 
1 


memory and register 
1000010w 
Imod 
reg 
rim I 
2 
2 


immediate and register 
1111011 
w 
111 
000 
regIimmediate data 
1 


immediate and ace. 
1010100w 
I immediate data 
1 


immediate and memory 
1111011 
w I mod 000 
rim Iimmediate data 
2 
2 


intel~ 


INSTlIucnOH 
FORMAT 
CaclleHlt 
Panallyl! 
Not •• 
CecIleM 
••• 


INTEGER OPERATIONS 
(Continued) 


MUL = MultIply 
(UMIgned) 


ace. with register 
I 1111011w 
111 
100 
reg I 
, 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


ace. with memory 
I 1111011w 
I mod 
100 
rIm I 


Multiplier·Byte 
13/18 
1 
MN/MX,3 


Word 
13/26 
1 
MN/MX,3 


Dword 
13/42 
1 
MN/MX,3 


IMUL - 
Int_ 
MulUply (eIgnacI) 


ace. with register 
I 111101 
1w 
111 
101 
reg I 


Multiplier·Byte 
13/18 
MN/MX,3 


Word 
13/28 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


ace. with memory 
I 1111011w 
Imod 
101 
rIm I 


Multiplier·Byte 
13/18 
MN/MX,3 


Word 
13/28 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


reg1 with reg2 
I 00001111 
I 10101111 
111 
reg1 
reg21 


Multiplier·Byte 
13/18 
MN/MX,3 


Word 
13/28 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


register with memory 
I 00001111 
I 10101111 
Imod 
reg 
r/ml 


Multiplier·Byte 
13/18 
1 
MN/MX,3 


Word 
13/28 
1 
MN/MX,3 


Dword 
13/42 
1 
MN/MX,3 


reg1 with imm. to reg2 
I 01101 
Os 1 
111 
reg1 
reg2llmmediate 
data 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/28 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


memo with imm. to reg. 
I 011010s1 
I mod 
reg 
rIm I Immediate data 


Multiplier·Byte 
13/18 
2 
MN/MX,3 


Word 
13/26 
2 
MN/MX,3 


Dword 
13/42 
2 
MN/MX,3 
DIY - 
_ 
(unaIgnad) 


ace. by register 
I 111101 
1w 
111 
110 
reg 1 


Divisor·Byte 
16 


Word 
24 


Dword 
40 


ace. by memory 
I 1111011w 
Imod 
110 
rIm I 


Divisor-Byte 
16 


Word 
24 
Dword 
40 


IDlY = ~ 
_ 
(eIgnacI) 


ace. by register 
I 1111011 
w 
111 
111 
reg I 


DMsor-Byte 
19 


Word 
27 
Dword 
43 


• 


INSTRUCTION 
FORMAT 
Cache Hit 
Penalty 
If 
Not •• 
caeheMlaa 


INTEGER OPERATIONS 
(Continued) 


acc. by memory 
11 1 1 1 0 1 1 w 1mod 1 1 1 rIm 1 


Divisor-Byte 
20 


Word 
28 


Dwa<d 
44 


CBW ICWD 
= Convert 
Byte to Wordl 
1100110001 
Convert 
Word to Dword 
3 


CWD/COO 
~ Convert 
Word to Dwordl 
1100110011 
Convert 
Dword 
to 
3 
Quad Word 


Instruction 
TTT 


ROL = Rotate Left 
000 


ROR - 
Rotate Right 
001 


RCL = Rotate through Gerry Left 
010 


RCR = Rotate through Carry Right 
011 


SHL/SAL 
~ Shift Logicall Arithmetic 
Left 
100 


SHR - 
Shift Logical Right 
101 


SAR - 
Shift Arithmetic 
Right 
111 


Not Through 
Carry (ROl, 
ROR, SAl, 
SAR, SHl, and SHR) 


reg by 1 
1101000w 
11 
TTT 
reg I 
3 


memory by 1 
1101000w 
mod TTT 
rIm 1 
4 
6 


regbyCL 
1101001 
w 
11 
TTT 
reg I 
3 


memorybyCL 
1101001 
w 
mod TTT 
rIm I 
4 
8 
, 


reg Iimmediate 8-bit data 
reg by immediate count 
1100000w 
11 
TTT 
2 


mem by immediate count 
I 1100000w 
mod TTT 
rIm Iimmediate 8-bit data 
4 
6 


Through 
carry 
(RCL and RCR) 


regby1 
I 1101000w 
11 
TTT 
reg 1 
3 


memory by 1 
I 1101000w 
mod TTT 
r/ml 
4 
6 


regbyCL 
I 1101001 
w 
11 
TTT 
reg 1 
8/30 
MN/MX,4 


memorybyCL 
I 1101001 
w 
mod TTT 
rlml 
9/31 
MN/MX,5 


reg by immediate count 
I 1100000w 
11 
TTT 
reg Iimmediate 8-bit data 
8/30 
MN/M)(,4 


mern by immediate count 
I 1100000w 
mod TTT 
rIm Iimmediate 8-bit data 
9/31 
MN/MX,5 


lnatrvctlon 
TTT 


SHLD = Shift Laft Double 
100 


SHRD = Shift Right Double 
101 


register with immediate 
00001111 
10TTT100 
11 
reg2 
reg1 1imm 8-bit data 
2 


memory by immediate 
00001111 
10TTT100 
mod 
reg 
rim Iimm 8-bit data 
3 
8 


registar by CL 
00001111 
10TTT101 
11 
reg2 
reg11 
3 


memorybyCL 
00001111 
10TTT101 
mod 
reg 
rIm 1 
4 
5 


BSWAP = Byte Swap 
00001111 
11001 
reg 
1 


XADD = EXchenge 
and Add 


reg1,reg2 
I 00001111 
11100000W 
111 
reg2 
regl I 
3 


memory, reg 
I 00001111 
11100000w 
1mod 
reg 
r/ml 
4 
8/2 
UlL 


CMPXCHG 
= Compa •• and Exchenge 


reg 1, reg2 
1 00001111 
11011000w 
111 
reg2 
reg11 
6 


memory,reg 
1 00001111 
11011000w 
Imod 
reg 
r/ml 
7110 
2 
6 
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INSTRUCTION 
FORMAT 
C8Che Hit 
Penllty 
It 
Not •• 
CacheMI •• 


CONTROL TRANSFER 
(within 
IIllment) 


NOTE: Times are jump taken/not 
taken 
I 


Jccc = Jump on ccc 


8-M displacement 
I 
0111 
tttn 
I 
8-bildisp. 
I 
3/1 
TINT,23 


full displacement 
I 00001111 
I 
1000tttn 
Ifull displacement 
3/1 
T/NT,23 


NOTE: Times are jump taken/not 
taken 


SETcccc 
= SIt Byte on cccc (Tlmel 
Ire 
cccc true/fel 
•• ) 


reg 
I 00001111 
I 
1001tttn 
111 
000 
reg 1 
4/3 


memory 
I 00001111 
I 
1001tttn 
1mod 
000 
r/ml 
3/4 


Mnemonic 
Condltlon 
tttn 
cccc 
0 
Overllowססoo 


NO 
No Overflow 
0001 


B/NAE 
BelowlNot 
Above or Equal 
0010 


NB/AE 
Not Below/ Above or Equal 
0011 


E/Z 
Equal/Zero 
0100 


NEINZ 
Not EquallNot 
Zero 
0101 


BEINA 
Below or EquallNot 
Above 
OlIO 


NBE/A 
Not Below or Equal/ Above 
0111 


S 
Sign 
1000 


NS 
Not Sign 
1001 


PIPE 
Parity/Parity 
Even 
1010 


NP/PO 
Not Parity/Parity 
Odd 
1011 


LINGE 
Less Than/Not 
Greater 
or Equal 
1100 


NL/GE 
Not Less Than/Greater 
or Equal 
1101 
. 


LE/NG 
Less Than or Equal/Greater 
Than 
1110 


NLE/G 
Not Less Than or Equal/Greater 
Than 
1111 


LOOP - 
LOOP CX Tlmel 
I 
11100010 
8-bitdisp, 
7/6 
LlNL,23 


LooPZlLooPE 
- 
Loop with 
I 11100001 
8-bitdisp. 
9/6 
LINL,23 


Zero/Eq 
•• 1 


LOOPNZILOOPNE 
= Loop while 
I 11100000 
8-bitdisp. 
9/6 
LINL,23 


Not Zero 


JeXZ 
- 
Jump on CX Zero, 
I 11100011 
8-bildisp. 
8/5 
T/NT,23 


JECXZ = Jump on ECX Zero 
I 
11100011 
8-bitdisp. 
I 
8/5 
T/NT,23 


(Address Size Prefix Differentiates 
JCXZ for JECXZj 


JMP = Uncondltlonll 
Jump (within 
IIllment) 


Short 
I 
11101011 
I 
8-bildisp. 
I 
3 
7,23 


Direct 
I 
11101001 
I full displacement 
3 
7,23 


Register Indirect 
I 11111111 
111 
100 
reg 1 
5 
7,23 


Memory Indirect 
I 11111111 
I mod 
100 
r/ml 
5 
5 
7 


CALL = Con (within 
eegment) 


Direct 
I 11101000 
Ifull displacement 
3 
7,23 


Register Indirect 
I 11111111 
111 
010 
reg I 
5 
7,23 


Memory Indirect 
I 11111111 
Imod 
010 
r/ml 
5 
5 
7 


RET - 
Retum 
from CALL (within 
IIllment) 
I 11000011 
I 
5 
5 


Adding Immediate to SP 
I 11000010 
I 
16-bit disp. 
I 
5 
5 


INSTRUCTION 
FORMAT 
ClIche 
Hit 
Peneltylf 
N_ 
ClIche Mia 


CONTROL TRANSFER (within MgIMllt) 
(Continued) 


ENTER = Enter~ 
•• 
1 11001000 
h6-llilcisp.,6-lliI18V8I 


Level - 
0 
14 


Level = 1 
17 


Level(L) > 1 
17+3L 
8 


LEAVE = Lnve~ 
•• 
I 11001001 
I 
5 
1 


MUL TIPLE-llEGMENT 
INSTRUCTIONS 


MaY = Move 


reg. to segment reg. 
1 10001110 
111 
O(eg3 
reg I 
3/9 
0/3 
RVlP,9 


memory to segment reg. 
1 10001110 
Imod O(eg3 rim I 
3/9 
2/5 
RV/P, 9 


segment reg. to reg. 
1 10001100 
111 
O(eg3 
reg I 
3 


segment reg. to memory 
I 10001100 
Imod O(eg3 rim I 
3 


PUSH - 
Push 


segment reg. 
10000(eg21101 
3 


(ES, CS, SS, or OS) 


segment reg. (FS or GS) 
! 
00001111 
!10 
0(8930001 
3 


POP = Pop 


segment 
reg. 
10000(eg2111 
3/9 
2/5 
RV/P, 9 
(ES, SS, or OS) 


segment reg. (FS or GS) 
I 00001111 
10 
0(893001! 
3/9 
2/5 
RVlP, 9 


LOS = LoIId Pointer to OS 
I 11000101 
mod 
reg 
r/ml 
6/12 
7110 
RV/P, 9 


LES - 
LoIId P_to 
ES 
I 11000100 
mod 
reg 
rim! 
6/12 
7/10 
RV/P, 9 


LFS = LoIId Pointer to FS 
! 
00001111 
10110100 
I mod 
reg 
rim! 
6/12 
7/10 
RV/P, 9 


LOS - 
LoIId PoInter to OS 
I 00001111 
10110101 
I mod 
reg 
r/ml 
6/12 
7110 
RV/P, 9 


LSS - 
LoIId Pointer to SS 
I 00001111 
10110010 
I mod 
reg 
rim! 
6/12 
7/10 
RV/P, 9 


CALL - 
celt 


Direct intersegment 
I 10011010 
I uns~ned full offset. selector 
18 
2 
R,7,22 


to same level 
20 
3 
P,9 
thru Gate to same level 
35 
6 
P,9 


to inner lev~. no parameters 
69 
17 
P,9 
to inner level, x parametar (d) words 
77+4X 
17+n 
P,11,9 
toTSS 
37+TS 
3 
P,10,9 
thru Task Gate 
38+TS 
3 
P,10,9 


Indirect intersegment 
1 11111111 
1mod 011 
rim I 
17 
8 
R,7 


to same level 
20 
10 
P,9 
thru Gate to same level 
35 
13 
P,9 


to inner level, no parameters 
69 
24 
P,9 
to innar level, x parameter (d) words 
77+4X 
24+n 
P,11,9 
toTSS 
37+TS 
10 
P,10,9 
thru Task Gate 
36+TS 
10 
P,10,9 
RET = Return !rom CALL 


intersegment 
I 11001011 
! 
13 
8 
R,7 


to same level 
17 
9 
P,9 
to outar level 
35 
12 
P,9 


intarsegment adding 
I 11001010 
I 
16-bi1disp. I 
imm.toSP 
14 
8 
R,7 
to same level 
18 
9 
P,9 
to outer level 
36 
12 
P,9 
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INSTRUCTION 
FORMAT 
Cache Hit 
P.nally 
If 
Not •• 
Cach.MI 
•• 


MULTIPLE·SEGMENT 
INSTRUCTIONS 
(ContinUed) 


JMP ~ Unconditional 
Jump 


Direct intersegment 
I 11101010 
I unsigned full offset, selector 
17 
2 
R,7,22 


to same level 
19 
3 
P,9 


thru Call Gate to same level 
32 
6 
P,9 


lhru TSS 
42+TS 
3 
P,10,9 


thru Task Gate 
43+TS 
3 
P,10,9 


Indirect int",segment 
I 11111111 
I mod 1 01 
rIm 1 
13 
9 
R,7,9 


to same level 
18 
to 
P,9 
thru Call Gate to same level 
31 
13 
P,9 


lhruTSS 
41+TS 
10 
P,10,9 
thru Task Gate 
42+TS 
10 
P,10,9 


BIT MANIPULATION 


BY = T•• t bit 


regtster, immediate 
I 00001111 
I 10111010 
111 
100 
regIimm. 8-bit data 
3 


memory 
I immediate 
I 00001111 
I 10111010 
Imod 100 
rIm I imm. 8..bft data 
3 
1 


reg1, reg2 
I 00001111 
110100011 
111 
reg2 reg11 
3 


memo<y,reg 
I 00001111 
1 10100011 
1mod 
reg 
r/ml 
8 
2 


lnatructton 
TIT 


BTS = Test Bhnd5et 
101 
BTR ~ Test Bit and Reset 
110 
BTe ~ Test B~ and Compliment 
111 


register, 
immediate 
I 00001111 
1 10111010 
111 
TTT 
reg Ilmm. 8·bit data 
6 


memory, immediate 
I 00001111 
1 10111010 
1mod TTT 
rIm Ilmm. 8-bit data 
8 
2/0 
UlL 


reg1,reg2 
100001111 
I 10TTT011 
111 
reg2 reg11 
6 


memory,reg 
I 00001111 
I 10TTT011 
I mod 
reg 
rIm 1 
13 
3/1 
U/L 


BSF = SCan BIt Forwent 


reg1, reg2 
I 00001111 
I 
10111100 
111 
reg2 reg11 
8/42 
MN/MX,12 


memory,reg 
I 00001111 
1 10111100 
1mod 
reg 
r/ml 
7/43 
2 
MN/MX,13 


BSR-SCanBltRevene 


reg1,reg2 
I 00001111 
I 10111101 
111 
reg2 reg11 
6/103 
MN/MX,14 


memory,reg 
I 00001111 
I 10111101 
Imod 
reg 
rIm 1 
7/104 
1 
MN/MX,15 


STRING INSTRUCTIONS 


CMPS = CclnlpeN 
Byt. Word 
I 1010011w 
I 
8 
6 
16 


LODS = L_ 
Byt./Word 
I 1010110w 
I 
5 
2 
to ALl AX/EAX 


MOYS ~ Mov. 
BytelWord 
I 1010010w 
1 
7 
2 
18 


SCAS ~ SCan Byt./Word 
I 1010111 
w 1 
6 
2 


STOS - 
Sto •.• Byt./Word 
I 1010101w 
I 
5 
from All AX/EX 


XLAT = T•.•••••• t. String 
I 11010111 
I 
4 
2 


II 
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INSTRUCTION 
FORMAT 
Clche Hit 
Plnllty 
If 
Notel 
CecheMI •• 


REPEATEDSTRINGINSTRUCTIONS 


Repeated by Count inCX or ECX (C- Count inCX or ECX) 


REPE CMPS = Compere String 
I 11110011 
I 1010011 
w I 
(Find Non-Match) 
C=O 
5 
C>O 
7+7c 
16,17 


REPNE CMPS - Compere String 
I 11110010 
I 1010011 
w I 
- 


(Find Match) 
C-O 
5 
C>O 
7+7c 
16,17 


REP LODS = Laid String 
I 11110011 
I 1010110w 
I 
C-O 
5 
C>O 
7+4c 
16,18 


REP MOVS = Move String 
I 11110011 
I 1010010w 
I 
C=O 
5 
C -1 
13 
1 
16 
C> 
1 
12+3c 
16,19 


REPE SCAS - SCln String 
I 11110011 
I 1010111 
w I 
(Find Non-ALlAX/EAX) 
C-O 
5 
C>O 
7+5c 
20 


REPNE SCAS - SCln String 
I 11110010 
I 1010111w 
I 
(FindALlAX/EAX) 
C=O 
5 
C>O 
7+5c 
20 


REP STOS - Store String 
I 11110011 
I 1010101w 
I 
C=O 
5 
C>O 
7+4c 


FLAGCONTROL 


CLC - Clelr Corry FIIg 
I 11111000 
2 


STC = Set Corry FIIg 
I 11111001 
2 


CMC - Complement Clrry FIIg 
I 11110101 
2 


CLD = Clelr Direction Flog 
I 11111100 
2 


STD = Set Dlrectlon Flog 
I 11111101 
2 


CLI = Cllir Interrupt 
I 11111010 
5 
Eneble FIIg 


STI = Set Interrupt 
I 11111011 
5 
Eneble Rag 


LAHF - Lood AHInto F"'g 
I 1 00 11111 
I 
3 


SAHF = Store AHInto FIege 
I 10011110 
I 
2 


PUSHF - Pueh FIIIlI 
I 10011100 
I 
4/3 
RVIP 


POPF = Pop FIIgl 
I 10011101 
I 
9/6 
RV/P 


DECIMALARITHMETIC 


AAA = ASCIIAdJult for AdcI 
I 00110111 
I 
3 


AAS - ASCIIAdJultfor 
I 00111111 
I 
3 
Subtrlet 


AAM = ASCIIAdluet for 
I 11010100 
I 00001010 
I 
15 
Multlply 
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INSTRUCTION 
FORMAT 
ClIChe Hit 
P•••• 1tyII 
Not•• 


/ 
ClICheMI •• 


DECIMAL ARITHMETIC (Continued) 


MD 
~ ASCII Adjuatlor 
I 11010101 
I 00001010 
I 
14 
. 


DIvide 


OM 
~ DecI••••1AdJuat lor Add 
I 00100111 
I 
2 


DAS ~ Dect••••1Adjuat lor SUbIrllCt I 00101111 
I 
2 


PROCESSORCONTROL INSTRUCTIONS 


HLT = Halt 
I 11110100 
I 
4 


MOV = Move To and From Conlrol/DebulllT 
•• t Reglat••• 


CROfrom register 
00001111 
00100010 
11 
000 
regI 
17 
2 


CR2/CR3 from register 
00001111 
00100010 
11 
eee 
regI 
4 


Reg from CRO-3 
00001111 
00100000 
11 
eee 
regI 
4 


ORO-3 from register 
00001111 
00100011 
11 
eee 
regI 
10 


OR6-7 from register 
00001111 
00100011 
11 
eee 
regI 
10 


Register from OR6-7 
I 00001111 
00100001 
11 
eee 
regI 
9 


Register from ORO-3 
I 00001111 
00100001 
11 
eee 
regI 
9 


TR3 from register 
I 00001111 
00100110 
11 
011 
regI 
4 


TR4-7 from register 
I 00001111 
00100110 
11 
eee 
regI 
4 


Register from TR3 
I 00001111 
00100100 
11 
011 
regI 
3 


Register from TR4-7 
I 00001111 
00100100 
11 
eee 
regI 
4 


CLTS ~ CI.ar Teak Switched Fleg 
I 00001111 
00000110 
7 
2 


INVD = Inv.l_ 
Date ClIChe 
I 00001111 
00001000 
I 
4 


WBINVD = Writ_lICk 
and Invelld.lte I 00001111 
00001001 
I 
5 
Date ClIChe 


INVLPG = Invalld.lt. 
TLB Entry 


INVLPGmemory 
I 00001111 
I 00000001 
I mod 
111 
rIm I 
12/11 
HINH 


PREAX BYTES 


Add •••• 
Slz. Prell. 
I 01100111 
I 
1 


LOCK = Bua Lock _. 
I 11110000 
I 
1 


Operand SIz. Prefl. 
I 01100110 
I 
1 


Segment Ov.n1cle_. 


cs: 
I 00101110 
1 


OS: 
I 00111110 
1 


ES: 
I 00100110 
1 


FS: 
I 01100100 
1 


GS: 
I 01100101 
1 


55: 
I 00110110 
1 


• 


INSTRUCTION 
FORMAT 
Cache Hit 
Penally I! 
Notee 
CacheMI •• 


PROTECTION CONTROL 


ARPL = Adjuet Requeeted Privilege Level 


From register 
I 01100011 
111 
reg1 reg21 
9 


From memory 
I 01100011 
Imod 
reg 
r/ml 
9 


LAR ~ Loed Acceu 
Righte 


From register 
I 00001111 
I 00000010 
111 
reg1 reg21 
11 
3 


From memory 
1 00001111 
1 00000010 
Imod 
reg 
r/ml 
11 
5 


LGOT = Loed Globel Deecrlptor 


Table regist", 
I 00001111 
I 00000001 
Imod010 
r/ml 
12 
5 


L10T ~ Loed Interrupt 
Deecrlptor 


Table register 
I 00001111 
I 00000001 
Imod 011 
rlml 
12 
5 


LLDT = Loed Local Deecrlptor 


Table regist", from reg. 
I 00001111 
I 00000000 
111 
010 
regI 
11 
3 


Table 
register 
from memo 
1 00001111 
1 00000000 
1mod 010 
rim I 
11 
6 


LMSW ~ Load Machine Statue Word 


From register 
1 00001111 
1 00000001 
111 
110 
reg 1 
13 


From memory 
I 00001111 
1 00000001 
1mod 110 
rim 1 
13 
1 


LSL = Loed segment 
Umlt 


From register 
I 00001111 
1 00000011 
111 
reg1 reg21 
10 
3 


From memory 
I 00001111 
I 00000011 
I mod 
reg 
rim 1 
10 
6 


LTR = Loed Taak Regleter 


From Register 
I 00001111 
1 00000000 
111 
011 
regI 
20 


From Memcry 
I 00001111 
I 00000000 
Imod 011 
rim I 
20 


SOOT ~ Store GlobeI Deecrlptor Table 


I 00001111 
1 00000001 
Imod 000 
rim I 
10 


StOT ~ Store Interrupt 
Deecrlptor Table 
I 00001111 
1 
00000001 
1 mod 001 
rim 1 
10 


SLOT = Store Local Deecrlptor T_ 


Toregist", 
1 00001111 
I 00000000 
111 
000 
reg 1 
2 


To memory 
1 00001111 
1 00000000 
1mod 000 
rim I 
3 


SMSW = Store Machine Statue Word 


To register 
I 00001111 
I 00000001 
111 
100 
reg 1 
2 


To memory 
I 00001111 
I 00000001 
1mod 100 
rim 1 
3 


STR ~ Store Teek Regleter 


To register 
I 00001111 
1 00000000 
111 
001 
reg 1 
2 


To memory 
I 00001111 
I 00000000 
Imod 001 
rim I 
3 


VERR = Verity Reed Acceu 


Regist", 
I 00001111 
I 00000000 
111 
100 
rim 1 
11 
3 


Memcry 
1 00001111 
1 00000000 
Imod 100 
r/ml 
11 
7 


VERW = Verity Write Acce •• 


To register 
I 00001111 
I 00000000 
111 
101 
regI 
11 
3 


To memory 
I 00001111 
1 
00000000 
1mod 101 
rim I 
11 
7 


INSTRUCTION 
FORMAT 
eachaHII 
...".Ity 
If 
Nol •• 
<:acheMI •• 


INTERRUPT INSTRUCTIONS 


INT n - 
Inlllmlpl 
Type n 
I 11001101 
I 
type 
I 
INT+4/0 
RVlP,21 


INT 3 - 
Inlerrupt 
Type 3 
I 11001100 
I 
INT+O 
21 


INTO = Intenupl4 
If 
I 11001110 
I 
Ov_w 
FllIg s.t 


Taken 
INT+2 
21 
Not Taken 
3 
21 


BOUND ~ Inlerrupl51f 
Detect 
I 01100010 
I mod 
reg 
rIm I 
Value Out Range 


Ifinrange 
7 
7 
21 


If out of range 
INT+24 
7 
21 


IRET = Inlerrupl 
Return 
I 11001111 
I 


Real Mode/Virtuel 
Mode 
15 
& 
Protec1ed Mode 


To same level 
20 
11 
9 
To outer level 
36 
19 
9 
To nested task (EFLAGS.NT - 
1) 
TS+32 
4 
9,10 


Extemallnlllmlpl 
INT+11 
21 


NMI = Non-M_bIe 
Inlllmlpl 
INT+3 
21 


Page Faull 
INT+24 
21 


YMll8 Excap_ 


ell 
INT+& 
21 
STI 
INT+& 
21 


INTn 
INT+9 


PUSHF 
INT+9 
21 
POPF 
INT+& 
21 


IRET 
INT+9 


IN 
Fixed Port 
INT+50 
21 
Variable Port 
INT+51 
21 


OUT 
Fixed Port 
INT+SO 
21 
Variable Port 
INT+51 
21 
INS 
INT+50 
21 
OUTS 
INT+50 
21 
REPINS 
INT+S1 
21 
REPOUTS 
INT+S1 
21 


Task Switch Clock Counts Table 


Method 
Value for TS 


Cache Hit 
Miss Penalty 


VM/intel486 
CPU/286 
TSS To Intel486 CPU TSS 
162 
55 
VM/intel486 
CPU/286 
TSS To 286 TSS 
143 
31 
VM/intel486 
CPU/286 
TSS To VM TSS 
140 
37 


• 


Interrupt 
Clock Counts Table 


Method 
Value for INT 


Cache Hit 
Miss Penalty 
Notes 


Real Mode 
26 
2 


Protected 
Mode 


1 


Interrupt/Trap 
gate, same level 
44 
6 
9 
Interrupt/Trap 
gate, different 
level 
71 
17 
9 
Task Gate 
37 + TS 
3 
9, 10 


Virtual Mode 
Interrupt/Trap 
gate, different 
level 
82 
17 
Task gate 
37 + TS 
3 
10 


Abbreviations 
16/32 
U/L 
MN/MX 
LINL 
RV/P 
R 
P 
T/NT 
H/NH 


Definition 
16/32 bit modes 
unlocked/locked 
minimum/maximum 
loop/no loop 
real and virtual mode/protected mode 
real mode 
protected mode 
taken/not taken 
hit/no hit 


NOTES: 
1. Assuming that the operand address and stack address fall in different cache sets. 
2. Always locked, no cache hit case. 
3. Clocks = 10 + max(log2(lml),n) 
m = multiplier value (min clocks for m= 0) 
n = 3/5 for ±m 
4. Clocks = {quotient(count/operand length)I'7+9 
= 8 if count,;; 
operand length (8/16/32) 
5. Clocks = {quotient(count/operand length)I'7+9 
= 9 if count,;; 
operand length (8/16/32) 
6. Equal/not equal cases (penalty is the same regardless of lock). 
7. Assuming that addresses for memory read (for indirection), stack push/pop, and branch fall in different cache sets. 
8. Penalty for cache miss: add 6 clocks for every 16 bytes copied to new stack frame. 
9. Add 11 clocks for each unaccessed descriptor load. 
10. Refer to task switch clock counts table for value of TS. 
11. Add 4 extra clocks to the cache miss penalty for each 16 bytes. 
For notes 12-13: (b = 0-3, non-zero byte number); 
(i = 0-1, non-zero nibble number); 
(n = 0-3, non bit number in nibble); 
12. Clocks = 8+4 
(b+ 1) + 3(i+ 1) + 3(n+ 1) 
= 6 if second operand = 0 
13. Clocks = 9+4(b+ 
1) + 3(i+ 1) + 3(n+ 1) 
= 7 if second operand = 0 . 


For notes 14-15: (n = bit position 0-31) 
14. Clocks = 7 + 3(32-n) 
6 if second operand = 0 
15. Clocks = 8 + 3(32-n) 
7 if second operand = 0 
16. Assuming that the two string addresses fall in different cache sets. 
H. Cache miss penalty: add 6 clocks for every 16 bytes compared. Entire penalty on first compare. 
18. Cache miss penalty: add 2 clocks for every 16 bytes of data. Entire penalty on first load. 
19. Cache miss penalty: add 4 clocks for every 16 bytes moved. 
(1 clock for the first operation and 3 for the second) 
20. Cache miss penalty: add 4 clocks for every 16 bytes scanned. 


(2 clocks each for first and second operations) 
21. Refer to interrupt clock counts table for value of INT 
22. Clock count includes one clock for using both displacement and immediate. 
23. Refer to assumption 6 in the case of a cache miss. 


intel~ 


Re.1 
Protected 
Protected 
VirlUIIl8ll 
INSTRUCTION 
FORMAT 
Mode 
Mode 
Mode 
Mod. 
Not •• 
(CPL<:IOPL) 
(CPL>IOPL 


1/0 INSTRUCTIONS 


IN - 
Inpullrom: 


FIXed Port 
1111001 
Ow I 
port number I 
14 
9 
29 
27 


Variable Port 
1111011 
Ow I 
14 
8 
28 
27 


OUT = Output 
to: 


Fixed Port 
11110011 
wi 
port number I 
18 
11 
31 
29 


Variable Port 
11110111W 
I 
16 
10 
30 
29 


INS = Input Byte/Word 
1011011 
Ow I 
17 
10 
32 
30 


lromOX 
Port 


OUTS = Output 
Byte/Word 
10110111 
w I 
17 
10 
32 
30 
1 


to OX Port 


REP INS = Input Siring 
I 11110011 
1011011 
Ow I 
16+8c 
10+8c 
30+8c 
29+8c 
2 


REP OUTS = Output 
Siring 
I 11110011 
10110111 
wi 
17+5c 
11+5c 
31+5c 
3O+5c 
3 


NOTES: 
1. Two clock 
cache 
miss penalty 
in all cases. 


2. c = count 
in CX or ECX. 


3. Cache 
miss penalty 
in all modes: 
Add 2 clocks 
for every 
16 bytes. 
Entire penalty 
on second 
operation. • 


Ce_Hlt 
Concu ••••••t 


P_1ty1f 
Ex_ 


INSTRUCTION 
FORMAT 
Avg(Lllwer 
c.cheMIu 
Avg(Lllwer 
Not •• 


Renge ••• 
Renge .•• 
Upper Rengel 
Upper~ng.) 


DATA TRANSFER 


FLD ~ ,R.el LOlId to ST(O) 


32-bit memory 
111011 
OOdmod 
000 
rim I 
s-I-bl dlsp. 
I 
3 
2 


54-bitmemory 
111011 
1011modOOO 
r/ml 
s-I-bl dlsp. 
I 
3 
3 


80-blt memory 
111011 
oldmod 
101 
r/ml 
s-I-b/dlsp. 
I 
6 
4 


ST(I) 
111011 
00dl1000 
ST(I)I 
4 


FILD = Integ.r 
Llled to ST(O) 


16-blt memory 
111011 
l111mod 
000 
r/ml 
s-i-b/disp. 
I 
14.5(13-16) 
2 
4 


32-Mmemory 
111011 
0111 
mod 
000 
r/ml 
s-I-bl disp. 
I 
11.5(9-12) 
2 
4(2-4) 


54·b" memory 
111011 
llllmod 
101 
r/ml 
s·l-b/disp. 
I 
16.8(10-18) 
3 
7.8(2-8) 


FBLD = BCD Loed to ST(O) 
111011 
llllmod 
100 
rim I 
s-i-bl disp. 
I 
75(70-103) 
4 
7.7(2-8) 


FST - 
Store R•• I from ST(O) 


32-Mmemory 
111011 
oOllmod 
010 
rim I 
s-i-b/disp. 
I 
7 
1 


54-bit memory 
111011 
1011 
mod 
010 
rlml 
s-i-b/dlsp. 
I 
8 
2 


ST(I) 
111011 
101111010 
ST(i) I 
3 


FSTP = Store R•• I from ST(O) end Pop 


32-bit memory 
111011 
oOllmod 
011 
rim I 
s-i-bl dlsp. 
I 
7 
1 


54-bit memory 
111011 
10dmod 
011 
rim I 
s-I-b/dlsp. 
I 
8 
2 


8O-Mmemory 
111011 
0111 
mod 
111 
rim I 
s-I-b/dlsp. 
I 
8 


ST(I) 
111011 
101111001 
ST(I)I 
3 


FIST = Store Integer from ST(Ol 


18-bit memory 
111011 
l111mod 
010 
rim I 
s-I-b/dlsp. 
I 
33.4(29-34) 


32-bit memory 
111011 
0111 
mod 
010 
rim I 
s-I-b/dlsp. 
I 
32.4(28-34) 


FISTP = Store Integer from ST(O) end Pop 


18-bit memory 
111011 
111 
mod 
011 
rim I 
s-I-b/dlsp. 
I 
33.4(29-34) 


32-bit memory 
111011 
011 
mod 
011 
rlml 
s-i-b/dlsp. 
I 
33.4(29-34) 


54-Mmemory 
111011 
111 
mod 
111 
rlml 
s-i-b/disp. 
I 
33.4(29-34) 
. 


FBSTP - 
Store BCD from 
111011 
111 
mod 
110 
r/ml 
s-I-b/disp. 
I 
175(172-178) 
ST(Ol end Pop 


FXCH = Excheng. 
ST(O) end ST(I) 
111011 
001 
11001 
ST(I)I 
4 


COMPARISON INSTRUCTIONS 


FCOM = Compere ST(O) with R.el 


32-bit memory 
111011 
0001 
mod 
010 
rim I 
s-I-b/dlsp. 
I 
4 
2 
1 


54-bit memory 
111011 
100lmod 
010 
rim I 
s-i-b/dlsp. 
I 
4 
3 
1 


ST(I) 
111011 
000111010 
ST(I)I 
4 
1 


FCOMP - 
Compere ST(O) with R.el end Pop 


32-blt memory 
111011 
oooimod 
011 
r/ml 
s-I-bl disp. 
I 
4 
2 
1 


54-bit memory 
111011 
100lmodOl1 
rlml 
s-I-b/dlsp. 
I 
4 
3 
1 


ST(I) 
111011 
000111011 
ST(I)I 
4 
1 


c.eheHIt 
Concurrent 


Pen8/ty1f 
Execution 


INSTRUCTION 
FOR••AT 
AVlI(LowIlf 
c.ehe 
••••• 
Avg(LowIlf 
Not •• 
Range ••• 
Range ••• 
Upper Range) 
Uppar Ranga) 


CO•• PARISON INSTRUCTIONS (Continued) 


FCOMPP ~ Compa •• ST(O)with 
111011 
11 O! 1101 
1001 
1 
5 
1 


ST(1) and Pop Twice 


FICO" 
= Compara ST(O)with Inlagar 


16-bil merT10lY 
111011 
1101 
mod 010 
rim! 
s-i--b/disp. 
I 
18(16-20) 
2 
1 


32-M merT10lY 
111011 
010lmod 
010 
r/ml 
.-i-b/disp. 
I 
16.5(15-17) 
2 
1 


FICOMP - 
Compa •• ST(O)with Inl_ 


16-bit memory 
111011 
1101 
mod 011 
r/ml 
.-i-bl disp. 
1 
18(16-20) 
2 
1 


32-bitmemory 
111011 
010 
mod 011 
rim 1 
.-i-b/disp. 
I 
16.5(15-17) 
2 
1 


FTST ~ Compa •• ST(O)with 0.0 
111011 
001 
1110 
01001 
4 
1 


FUCO" 
~ Unordllfed 
compara 
111011 
101 
11100 
ST(i) I 
4 
1 


ST(O)w1th ST(I) 


FUCO•• P = Unordered 
compa •• 
111011 
101 
11101 
ST~)I 
4 
1 


ST(O)with ST(I) and Pop 


FUCO•• PP ~ Unordllt'ed compa •• 
111011 
01011110 
1001 I 
5 
1 


ST(O)with ST(I) and Pop Twice 


FXA" 
~ Examine ST(O) 
111011 
00111110 
01011 
8 


CONSTANTS 


FLOZ = Load +0.0 Inlo ST(O) 
11011 
001 
1110 
11101 
4 


FL01 = Loed + 1.0 Inlo ST(O) 
11011 
001 
1110 
10001 
4 


FLOPI = Loed.". Into ST(O) 
11011 
001 
1110 
10111 
8 
2 


FLOUT 
- 
Loed 1olI2(10) Into ST(O) 
11011 
001 
1110 
1001 I 
8 
2 


FLOL2E = Load logia) 
Into ST(O) 
11011 
001 
1110 
10101 
8 
2 


FLOLG2 ~ Loed k'II10(2) Into ST(O) 
111011 
001 
1110 
11001 
8 
2 


FLOLN2 - 
Load 10lI0(2) Inlo ST(O) 
111011 
001 
1110 
1101 I 
8 
2 


ARITH •• ETIC 


FADD ~ Add RN' with ST(O) 


ST(O)+- ST(O) + 32-M memory 
111011 
0001 
mod 000 
r/ml 
.·i-bl disp. 
1 
10(8-20) 
2 
7(5-17) 


ST(O)+- ST(O) + 84-bit merT10lY 
111011 
100lmod 
000 
r/ml 
s-i--b/disp. 
I 
10(8-20) 
3 
7(5-17) 


ST(d) +- ST(O) + STeil 
111011 
dOO!11000 
ST(i)I 
10(8-20) 
7(5-17) 


FADDP = Add •• al with ST(O)and 
111011 
110111000 
ST(ill 
10(8-20) 
7(5-17) 
Pop (ST(I) +- ST(O) + ST(Il) 


FSUB = SUbtract •• alfrom 
ST(O) 


ST(O)+- ST(O) - 
32-b~ merT10lY 
111011 
0001 
mod 
100 
rim I 
s-i-b/disp. 
I 
10(8-20) 
2 
7(5-17) 


ST(Ol +- ST(O) - 
84~ 
merT10lY 
111011 
100lmod 
100 
rim I 
s-i-b/disp. 
I 
10(8-20) 
3 
7(5-17) 


ST(d) +- ST(Ol - 
STeil 
111011 
dool1110d 
ST(ill 
10(8-20) 
7(5-17) 


FSUBP ~ SUblrllct •• alfrom 
ST(O) 
111011 
110111101 
ST(QI 
10(8-20) 
7(5-17) 
and Pop (ST(I) +- ST(O) - 
ST(Il) 


• 


caetIe 
Hit 
Concu ••.•••t 


Penalty 
If 
Execution 


INSTRUCTION 
FORMAT 
AVll(Low.r 
caetleMI 
•• 
AVll(Low.r 
Not •• 


RlIIlll·· 
•• 
R.IllI" 
•.• 


Upper RlIIlll.) 
UpperR.nge) 


ARITHMETIC 
(Continued) 


FSUBR ~ SUbtrect 
real revereed 
(SUbtrect 
ST(O) from 
••• 1) 


ST(O) +- 32-bil memory 
- 
ST(O) 
111011 
0001 
mod 
101 
r/ml 
s-i-b/disp. 
I 
10(8-20) 
2 
7(5-17) 


ST(O) +- 64-bit memory 
- 
ST(O) 
111011 
100lmod 
101 
rim! 
•..i-b/disp. 
I 
10(8-20) 
3 
7(5-17) 


ST(d) +- ST(i) - 
ST(O) 
111011 
dOO!1110d 
ST(i) I 
10(8-20) 
7(5-17) 


FSUBRP = SUb""ct 
re.1 rev.reed 
111011 
110111100 
ST(i) I 
10(8-20) 
7(5-17) 


and Pop (ST(I) +- ST(I) - 
ST(O)) 


FMUL = Multlply 
••• 1with ST(O) 


ST(O) +- ST(O) x 32-bit memory 
111011 
OOOlmod 
001 
rim I 
s-i-b/disp. 
I 
11 
2 
8 


ST(O) +- ST(O) x 64-bit memory 
111011 
100lmod 
001 
r/ml 
•..i-b/disp. 
I 
14 
3 
11 


ST(d) +- ST(O) x ST(i) 
111011 
dool11001 
ST(i) I 
16 
13 


FMULP ~ Multiply 
ST(O) with ST(I) 
111011 
11 0!11 
001 
ST(i) I 
16 
13 


and Pop (ST(I) +- ST(O) x ST(I)) 


FDIV ~ Dlvtde ST(O) by R•• I 


ST(O) +- ST(0)/32-bil 
memory 
111011 
oooimod 
110 
rim I 
s·i-b/disp. 
I 
73 
2 
70 
3 


ST(O) +- ST(O)/64-bit 
memory 
111011 
100lmod 
110 
r/ml 
s-i-bl disp. 
I 
73 
3 
70 
3 


ST(d) +- ST(O)/ST(i) 
111011 
dool1111 
d 
ST(i) I 
73 
70 
3 


FDIVP = Divide 
ST(O) by ST(I) .nd 
111011 
110111111 
ST(ill 
73 
70 
3 


Pop (ST(I) +- ST(O)/ST(I)) 


FDIVR = Dlvld. 
real rev.reed 
(R.aIlST(O)) 


ST(O) +- 32-bil memoryIST(O) 
111011 
oooimod 
111 
rim I 
s-i-b/disp. 
I 
73 
2 
70 
3 


ST(O) +- 64-bit memoryIST(O) 
111011 
100lmod 
111 
r/ml 
s-i-bl disp. 
I 
73 
3 
70 
3 


ST(d) +- ST(i)/ST(O) 
111011 
dOO 11111 
d 
ST(i) I 
73 
70 
3 


FDIVRP = Divide real revereed 
and 
111011 
110 
11111 
0 
ST(i) I 
73 
70 
3 


Pop (ST(I) +- ST(I)/ST(O) 


FIADD ~ Add Integer 
to ST(O) 


ST(O) +- ST(O) + 16-bit memory 
111011 
1101 
mod 
000 
r/ml 
s-i-b/disp. 
I 
24(20-35) 
2 
7(5-17) 


ST(O) +- ST(O) + 32-bit memory 
111011 
ololmod 
000 
r/ml 
s-i-b/disp. 
I 
22.5(19-32) 
2 
7(5-17) 


FISUB = Subtract 
Integ.r 
from ST(O) 


ST(O) +- ST(O) - 
18-brt memory 
111011 
1101 
mod 
100 
rim I 
s-i-b/disp. 
I 
24(20-35) 
2 
7(5-17) 


ST(O) +- ST(O) - 
32-bil memory 
111011 
010lmod 
100 
r/ml 
s-i-bl disp. 
I 
22.5(19-32) 
2 
7(5-17) 


FISUBR 
~ Intell.r 
SUbtract 
Revereed 


ST(O) +- 16-bit memory 
- 
ST(O) 
111011 
11 O! mod 
101 
rim! 
•..i·b/disp. 
I 
24(20-35) 
2 
7(5-17) 


ST(O) +- 32-bit memory 
- 
ST(O) 
111011 
010lmod 
101 
r/ml 
s-i-bl disp. 
I 
22.5(19-32) 
2 
7(5-17) 


FIMUL ~ Multiply 
Integer 
with ST(O) 


ST(O) +- ST(O) x 16-bit memory 
!11 
0 11 
1101 
mod 
001 
r/ml 
s-i-b/disp. 
I 
25(23-27) 
2 
8 


ST(O) +- ST(O) x 32·bil memory 
111011 
010lmod 
001 
rim! 
s-i-b/disp. 
I 
23.5(22-24) 
2 
8 


FIDIV = Intell.r 
Divide 


ST(O) +- ST(0)/16-bit 
memory 
111011 
1101 
mod 
110 
r/ml 
•..i-b/disp. 
I 
87(85-89) 
2 
70 
3 


ST(O) +- ST(0)/32·bit 
memory 
111011 
010!mod 
110 
rim I 
•..i-b/disp. 
I 
85.5(64-86) 
2 
70 
3 


intel~ 


Cach.HIt 
Concurrent 


P.naltyll 
Execution 


INSTRUCTION 
FORMAT 
Avg(Lower 
each. 
MIlS 
Avg(Lower 
Not •• 


R.nge •.. 
R.ng ••.• 


Upper R.ng.) 
Upper R.nge) 


ARITHMETIC (Continued) 


FIOIVR - 
Integ.r 
Dlvld. 
R.v.'- 


ST(O)+- 16-b~ memory/ST(O) 
1,10" 
1101 
mod 
111 
rlml 
s-i-bl disp. 
I 
87(85-89) 
2 
70 
3 


ST(O)+- 32-b~ memory/ST(O) 
1,101, 
010 
mod 
111 
rim I 
s-i-b/disp. 
I 
85.5(84-86) 
2 
70 
3 


FSQRT - 
Squ.r. 
Root 
1,101, 
001 
1111 
10101 
85.5(83-87) 
70 


FSCALE - 
Scal. ST(O)by ST(I) 
1,101, 
001 
1111 
11011 
31(30-32) 
2 


FXTRACT ~ Extreet componente 
1" 
0 11 
001 
1111 
01001 
19(16-20) 
4(2-4) 


oIST(O) 


FPREM = P.rtI.1 R.mlnder 
11011 
001 
1111 
1000 I 
84(70-138) 
2(2-8) 


FPREMI - 
P.rtI.1 R.mlnd.r 
(IEEE) 
11011 
001 
1111 
01011 
94.5(72-167) 
5.5(2-18) 


FRNDINT ~ Round ST(O)to Integ.r 
11011 
001 
1111 
11001 
29.1(21-30) 
7.4(2-8) 


FABS - 
Abeolut. 
v.lu. 
01ST(O) 
11011 
001 
1110 
oood 
3 


FCHS ~ Change olgn 01 ST(O) 
11011 
001 
1110 
00001 
8 


TRANSCENDENTAL 


FCOS ~ Coaln. 
01 ST(O) 
11011 
001 
1111 
11111 
241(193-279) 
2 
8.7 


FPTAN = P.rtlel 
teng.nt 
01ST(O) 
11011 
001 
1111 
00101 
244(200-273) 
70 
6,7 


FPATAN = P.rtl.1 arctang.nt 
11011 
001 
1111 
00111 
289(218-303) 
5(2-17) 
6 


FSIN - 
Sin. of ST(O) 
11011 
001 
1111 
11101 
241(193-279) 
2 
8,7 


FSINCOS - 
Sin. _ 
coaln. 
01ST(O) 
11011 
001 
1111 
10111 
291(243-329) 
2 
8,7 


F2XMl 
- 
2"T(O) 
- 
1 
11011 
001 
1111 
00001 
242(140-279) 
2 
8 


FYL2X = ST(I) 
X log:z(ST(O» 
11011 
001 
1111 
00011 
311(186-329) 
13 
6 


FYL2XPI = ST(I) X Iog:z(ST(O)+ 1.0) 
11011 
001 
1111 
10011 
313(171-326) 
13 
8 


PROCESSOR CONTROL 


FINIT = Inltl.llz. 
FPU 
111011 
01111110 
00111 
17 
4 


111011 
1111,1,0 
0000 I 


, 


FSTSW AX - 
Store atetua word 
3 
5 


Into AX 


FSTSW = Slore atetua word 
1,101, 
101 
mod 
111 
rim! 
s-i-b/disp. 
I 
3 
5 


Intom.mory 


FLDCW - 
Load control 
word 
1,101, 
001 
mod 
101 
rim! 
s-i-b/disp. 
I 
4 
2 


FSTCW = Store control 
word 
111011 
001 
mod 
111 
rlml 
s·i·b/disp. 
I 
3 
5 


FCLEX - 
CI.ar .xc.ptlons 
111011 
011 
1110 
00101 
7 
4 


FSTENV - 
Store .nvlronment 
111011 
001 
mod 
110 
rlml 
s-i-bl disp. 
I 
Real and Virtual modes 16-bit Address 
67 
4 


Reel and Virtual modes 32-bit Address 
67 
4 


Protected mode 16-bit Address 
56 
4 


Protected mode 32-bit Address 
56 
4 


FLDENV = Load environment 
1" 
0 11 
oOllmod 
100 
rim! 
s-i-b/disp. 
I 
Real and Virtual modes 16-bit Address 
44 
2 
Real and Virtual modes 32-bit Address 
44 
2 
Protected mode 16-bit Address 
34 
2 
Protected mode 32-bit Address 
34 
2 


• 


CllcheHIt 
Concurrent 


•••••• Ity II 
Execution 


INSTRUCTION 
FORMAT 
Ayo(Lower 
C_MIu 
Ayo(Lower 
Not. 
R.nge 
••• 
Range ••• 
Upper Ranoa) 
Upper Range) 


PROCESSOR CONTROL 
(Continued) 


FSAVE = saya atata 
111011 
1011mod 
110 
rlml 
s-~b/disp. 
I 
Real and Virtual modes 16-bil Address 
154 
4 


Real and Virtual modes 32-bil Address 
154 
4 


Protected mode 16-bil Address 
143 
4 


Protected mode 32-bil Address 
143 
4 


FRSTOR 
~ R.t 
••••• atat. 
111011 
1011 
mod 
100 
r/ml 
s-i-bl 
I 
Real and Virtual modes 16-bil Addrass 
131 
23 
Real and VIrtUal modes 32·bil Address 
131 
27 
Protected mode 16-bil Addrass 
120 
23 
Protected mode 32-bil Addrass 
120 
27 


FINCSTP = Increment 
Stack Pointer 
111011 
001 
1111 
01111 
3 


FDECSTP = Dacramant 
Stack Pointer I 1 1 0 1 1 
001 
1111 
01101 
3 


FFREE = F••• 
ST(I) 
111011 
101 
11000 
ST(ill 
3 


FNOP = No_tiona 
111011 
001 
1101 
00001 
3 


WAIT 
= Walt until FPU raady 
I 
10011011 


(Minimum/Maximum) 
1/3 


NOTES: 
1. If operand 
is 0 clock 
counts 
= 27. 


2. If operand 
is 0 clock 
counts 
= 28. 


3. If CW.PC indicates 
24 bit precision 
then subtract 
38 clocks. 


If CW.PC 
indicates 
53 bit precision 
then subtract 
11 clocks. 


4. If there 
is a numeric 
error 
pending 
from a previous 
instruction 
add 17 clocks. 


5. If there 
is a numeric 
error 
pending 
from a previous 
instruction 
add 18 clocks. 


6. The INT pin is polled 
several 
times while this instruction 
is executing 
to assure 
short 
interrupt 
latency. 


7. If ABS(operand) 
is greater 
than 1r/4 then add n clocks. 
Where 
n = (operand/(1r/4». 
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10.2.1 
OVERVIEW 


All instruction encodings are subsets of the general 
instruction format shown in Figure 10.1. Instructions 
consist of one or two primary opcode bytes, possibly 
an address specifier consisting of the "mod rim" 
byte and "scaled index" byte, a displacement if re- 
quired, and an immediate data field if required. 


Within the primary opcode or opcodes, smaller en- 
coding fields may be defined. These fields vary ac- 
cording to the class of operation. The fields define 
such information as direction of the operation, size 
of the displacements, register encoding, or sign ex- 
tension. 


Almost all instructions referring to an operand in 
memory have an addressing mode byte following 
the primary opcode byte(s). This byte, the mod rim 
byte, specifies the address mode to be used. Certain 
encodings of the mod rim byte indicate a second 


addressing byte, the scale-index-base byte, follows 
the mod rim byte to fully specify the addressing 
mode. 


Addressing modes can include a displacement im- 
mediately following the mod rim byte, or scaled in- 
dex byte. If a displacement is present, the possible 
sizes are S, 16 or 32 bits. 


If the instruction specifies an immediate operand, 
the immediate operand follows any displacement 
bytes. The immediate operand, if specified, is always 
the last field of the instruction. 


Figure 10.1 illustrates several of the fields that can 
appear in an instruction, such as the mod field and 
the rim field, but the Figure does not show all fields. 
Several smaller fields also appear in certain instruc- 
tions, sometimes within the opcode bytes them- 
selves. Table 10.4 is a complete list of all fields ap- 
pearing in the Intel4S6 Microprocessor instruction 
set. Further ahead, following Table 10.4, are de- 
tailed tables for each field. 
• 


ITTTTTTTTI 
TTTTTTTTI 
modTTTr/m 
I ssindexbase 
Id321161s1 
nonedata32!16!S! 
none 
? 
07 
0}\765}20}\765.320j\ 
)\ 
) 


register and address 
mode specifier 


opcode 
(one or two bytes) 
(T represents an 
opcodebit.) 


"mod rim" 
byte 
address 
displacement 
(4, 2, 1 bytes 
or none) 


immediate 
data 
(4, 2, 1 bytes 
or none) 


.. 


Field Name 
Description 
Number 
of Bits 


w 
Specifies if Data is Byte or Full Size (Full Size is either 16 or 32 Bits 
1 
d 
Specifies Direction of Data Operation 
1 
s 
Specifies if an Immediate Data Field Must be Sign-Extended 
1 
reg 
General Register Specifier 
3 
mod rim 
Address Mode Specifier (Effective Address can be a General Register) 
2 for mod; 
3 for rim 
ss 
Scale Factor for Scaled Index Address Mode 
2 
index 
General Register to be used as Index Register 
3 
base 
General Register to be used as Base Register 
'3 
sreg2 
Segment Register Specifier for CS, SS, OS, ES 
2 
sreg3 
Segment Register Specifier for CS, 55, OS,ES, FS, GS 
3 
tttn 
For Conditional Instructions, Specifies a Condition Asserted 
or a Condition Negated 
4 


NOTE: 
Tables 
10.1-10.3 
show encoding 
of individual 
instructions. 


10.2.2 32-BIT EXTENSIONS OF THE 
INSTRUCTION SET 


With the Intel486 Microprocessor, the 8086/801861 
80286 instruction set is extended in two orthogonal 
directions: 32·bit forms of all 16·bit instructions are 
added to support the 32-bit data types, and 32-bit 
addressing modes are made available for all instruc- 
tions referencing memory. This orthogonal instruc- 
tion set extension is accomplished having a Default 
(D) bit in the code segment descriptor, and by hav· 
ing 2 prefixes to the instruction set. 


Whether the instruction defaults to operations of 16 
bits or 32 bits depends on the setting of the D bit in 
the code segment descriptor, which gives the de· 
fault length (either 32 bits or 16 bits) for both oper· 
ands and effective addresses when executing that 
code segment. In the Real Address Mode or Virtual 
8086 Mode, no code segment descriptors are used, 
but a D value of 0 is assumed internally by the In- 
tel486 
Microprocessor when operating in those 
modes (for 16-bit default sizes compatible with the 
8086/80186/80286). 


Two prefixes, the Operand Size Prefix and the Effec- 
tive Address Size Prefix, allow overriding individually 
the Default selection of operand size and effective 
address size. These prefixes may precede any op- 
code bytes and affect only the instruction they pre- 
cede. If necessary, one or both of the prefixes may 
be placed before the opcode bytes. The presence of 
the Operand Size Prefix and the Effective Address 
Prefix will toggle the operand size or the effective 
address size, respectively, to the value "opposite" 
from the Default setting. For example, if the default 
operand size is for 32-bit data operations, then pres- 
ence of the Operand Size Prefix toggles the instruc- 
tion to 16-bit data operation. As another example, if 
the default effective address size is 16 bits, pres- 
ence of the Effective Address Size prefix toggles the 
instruction to use 32-bit effective address computa- 
tions. 


These 32-bit extensions are available in all Intel486 
Microprocessor modes, including the Real Address 
Mode or the Virtual 8086 Mode. In these modes the 
default is always 16 bits, so prefixes are needed to 
specify 32-bit operands or addresses. For instruc- 
tions with more than one prefix, the order of prefixes 
is unimportant. 


Unless specified otherwise, instructions with 8-bit 
and 16-bit operands do not affect the contents of 
the high-order bits of the extended registers. 


10.2.3 ENCODING OF INTEGER 
INSTRUCTION FIELDS 


Within the instruction are several fields indicating 
register selection, addressing mode and so on. The 
exact encodings of these fields are defined immedi· 
ately ahead. 


For any given instruction performing a data opera- 
tion, the instruction is executing as a 32·bit operation 
or a 16-bit operation. Within the constraints of the 
operation size, the w field encodes the operand size 
as either one byte or the full operation size, as 
shown in the table below. 


Operand Size 
Operand Size 
wFleld 
During 16-Blt 
During 32·Blt 
Data Operations 
Data Operations 


0 
8 Bits 
8 Bits 
1 
16 Bits 
32 Bits 


10.2.3.2 Encoding of the General 
Register (reg) Field 


The general register is specified by the reg field, 
which may appear in the primary opcode bytes, or as 
the reg field of the "mod rim" byte, or as the rim 
field of the "mod rim" byte. 


Encoding of reg Field When w Field 
Is not Present In Instruction 


Register Selected 
Register selected 
reg'Fleld 
During 16-Blt 
During 32-Blt 
Data Operations 
Data Operations 


000 
AX 
EAX 
001 
CX 
ECX 
010 
DX 
EDX 
011 
BX 
EBX 
100 
SP 
ESP 
101 
BP 
EBP 
110 
SI 
ESI 
111 
DI 
EDI 
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Encoding 
of reg Field When w Field 
Is Present 
In Instruction 


Register 
Specified 
by reg Field 
DurIng 16-Blt Data Operations: 


Function 
of w Field 
reg 
(whenw 
= 0) 
(whenw 
= 1) 


000 
AL 
AX 


001 
CL 
CX 


010 
OL 
OX 


011 
BL 
BX 


100 
AH 
SP 


101 
CH 
BP 


110 
OH 
SI 


111 
BH 
01 


Register 
Specified 
by reg Field 
During 32-Blt Data Operations 


Function 
of w Field 
reg 


(whenw 
= 0) 
(whenw 
= 1) 


000 
AL 
EAX 
001 
CL 
ECX 


010 
OL 
EOX 
011 
BL 
EBX 


100 
AH 
ESP 


101 
CH 
EBP 


110 
OH 
ESI 


111 
BH 
EOI 


10.2.3.3 
Encoding 
of the Segment 
Register 
(sreg) Field 


The sreg field in certain instructions is a 2·bit field 
allowing one of the four 80286 segment registers to 
be specified. The sreg field in other instructions is a 
3-bit field, allowing the Intel486 Microprocessor FS 
and GS segment registers to be specified. 


2-Blt 
Segment 


sreg2 Field 
Register 
selected 


00 
ES 
01 
CS 


10 
SS 
11 
OS 


3-Blt 
Segment 


sreg3Fleid 
Register 
selected 


000 
ES 


001 
CS 


010 
SS 


011 
OS 


100 
FS 


101 
GS 


110 
do not use 
111 
do not use 


10.2.3.4 
Encoding 
of Address 
Mode 


Except for special instructions, such as PUSH or 
POP,where the addressing mode is pre-determined, 
the addressing mode for the current instruction is 
specified by addressing bytes following the primary 
opcode. The primary addressing byte is the "mod 
rim" byte, and a second byte of addressing informa- 
tion, the "s-i-b" 
(scale-index-base) byte, can be 


specified. 


The s-i-b byte (scale-index-base byte) is specified 
when using 32-bit addressing mode and the "mod 
rim" byte has rim = 100 and mod = 00,01 or 10. 
When the sib byte is present, the 32-bit addressing 
mode is a function of the mod, ss, index, and base 
fields. 


The primary addressing byte, the "mod rim" 
byte, 


also contains three bits (shown as TTT in Figure 
10.1) sometimes used as an extension of the pri- 
mary opcode. The three bits, however, may also be 
used as a register field (reg). 


When calculating an effective address, either 16-bit 
addressing or 32-bit addressing is used. 16-bit ad- 
dressing uses 16-bit address components to calcu- 
late the effective address while 32-bit addressing 
uses 32-bit address components to calculate the ef- 
fective address. When 16-bit addressing is used, the 
"mod rim" byte is interpreted as a 16-bit addressing 
mode specifier. When 32-bit addressing is used, the 
"mod rim" byte is interpreted as a 32-bit addressing 
mode specifier. 


Tables on the following three pages define all en- 
codings of all 16-bit addressing modes and 32-bit 
addressing modes. 
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mod rIm 
Effective Address 


00000 
OS:[BX+SI] 
00001 
OS:[BX+OI] 
00010 
SS:[BP+SI] 
00011 
SS:[BP+OI] 
00100 
OS:[SI] 
00101 
OS:[01] 
00110 
OS:d16 
00111 
OS:[BX] 


01000 
OS:[BX+ 81+ dB] 
01001 
08: [BX + 01+ dB] 
01 010 
88: [BP+ 81+ dB] 
01 011 
SS:[BP+ 01+ dB] 
01100 
08:[81+dB] 
01 101 
08:[01+dB] 
01 110 
S8:[BP+dB] 
01 111 
08:[BX+dB] 


Register Specified by rIm 
During 16-Blt Data Operations 


mod rIm 
Function of w Field 


(whenw=O) 
(whenw 
=1) 


11000 
AL 
AX 
11 001 
CL 
CX 
11 010 
OL 
OX 
11 011 
BL 
BX 
11 100 
AH 
SP 
11 101 
CH 
BP 
11110 
OH 
81 
11111 
BH 
01 


mod rIm 
Effective Address 


10000 
OS:[BX+SI+d16] 
10001 
OS:[BX+ 01+ d16] 
10010 
SS:[BP+81+d16) 
10011 
SS:[BP+01+d16] 
10100 
OS:[SI+d16] 
10101 
OS:[01+d16] 
10110 
SS:[BP+d16] 
10111 
OS:[BX+d16] 


11000 
register-see 
below 
11 001 
register-see 
below 
11 010 
register-see 
below 
11 011 
register-see 
below 
11100 
register-see 
below 
11 101 
register-see 
below 
11110 
register-see 
below 
11111 
register-see 
below 


R'Glster Specified by rIm 
During 32-Blt Data Operations 


mod rIm 
Function of w Field 


(whenw=O) 
(whenw 
=1) 


11000 
AL 
EAX 
11 001 
CL 
ECX 
11 010 
OL 
EOX 
11 011 
BL 
EBX 
11100 
AH 
ESP 
11 101 
CH 
EBP 
11110 
OH 
ESI 
11111 
BH 
EOI 


mod rIm 
Effective Address 


00000 
OS:[EAX] 
00001 
OS:[ECX] 
00010 
OS:[EoX] 
00011 
OS:[EBX] 
00100 
s-i-b is present 
00101 
oS:d32 
00110 
OS:[ESI] 
00111 
OS:[Eol] 


01000 
oS:[EAX+d8] 
01001 
oS:[ECX+d8] 
01 010 
oS:[EoX+d8] 
01 011 
oS:[EBX+d8] 
01100 
s-i-b is present 
01 101 
SS:[EBP+d8] 
01 110 
oS:[ESI+d8] 
01 111 
oS:[Eol+ 
d8] 


Register Specified by reg or rIm 
during 16-Blt Data Operations: 


mod rIm 
Function of w field 


(whenw=O) 
(whenw=1) 


11000 
AL 
AX 
11 001 
CL 
CX 
11 010 
oL 
OX 
11 011 
BL 
BX 
11100 
AH 
SP 
11 101 
CH 
BP 
11110 
oH 
SI 
11111 
BH 
01 


mod rIm 
Effective Address 


10000 
OS:[EAX + d32] 
10001 
OS:[ECX+ d32] 
10010 
OS:[EoX + d32] 
10011 
oS:[EBX + d32] 
10100 
s-i-b is present 
10101 
SS:[EBP+ d32] 
10110 
OS:[ESI+ d32] 
10111 
oS:[Eol + d32] 


11000 
register-see 
below 
11 001 
register-see 
below 
11 010 
register-see 
below 
11 011 
register-see 
below 
11 100 
register-see 
below 
11 101 
register-see 
below 
11110 
register-see 
below 
11111 
register-see 
below 


Register Specified by reg or rIm 
during 32-Blt Data Operations: 


mod rIm 
Function of w field 


(whenw=O) 
(whenw= 
1) 


11000 
AL 
EAX 
11 001 
CL 
ECX 
11 010 
oL 
EoX 
11 011 
BL 
EBX 
11100 
AH 
ESP 
11 101 
CH 
EBP 
11110 
oH 
ESI 
1'1111 
BH 
Eol 


• 
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mod base 
Effective 
Address 


00000 
DS: [EAX + (scaled index)) 
00001 
DS: [ECX + (scaled index)) 
00010 
DS: [EDX + (scaled index)) 
00011 
DS: [ESX + (scaled index)) 
00100 
SS: [ESP + (scaled index)) 
00101 
DS: [d32 + (scaled index)) 
00110 
DS: [ESI + (scaled index)) 
00111 
DS: [ED I+ (scaled index)) 


01000 
DS: [EAX + (scaled index) + dB) 
01001 
DS: [ECX + (scaled index) + dB) 
01010 
DS: [EDX + (scaled index) + dB) 
01 011 
DS: [ESX + (scaled index) + dB) 
01 100 
SS: [ESP + (scaled index) + dB) 
01 101 
SS: [ESP + (scaled index) + dB) 
01 110 
DS: [ESI + (scaled index) + dB) 
01 111 
DS: [EDI + (scaled index) + dB) 


10000 
DS: [EAX + (scaled index) + d32) 
10001 
DS: [ECX + (scaled index) + d32) 
10010 
DS: [EDX + (scaled index) + d32) 
10011 
DS: [ESX + (scaled index) + d32) 
10100 
SS: [ESP + (scaled index) + d32) 
10 101 
SS: [ESP + (scaled index) + d32) 
10110 
DS: [ESI + (scaled index) + d32) 
10111 
DS: [EDI + (scaled index) + d32) 


NOTE: 
Mod field in "mod rim" 
byte; ss, index, base fields in 


"s-i-b" byte. 


ss 
Scale Factor 


00 
x1 
01 
x2 
10 
x4 
11 
xB 


Index 
Index Register 


000 
EAX 
001 
ECX 
010 
EDX 
011 
ESX 
100 
no index reg·· 
101 
ESP 
110 
ESI 
111 
EDI 


""IMPORTANT 
NOTE: 
When index field is 100, indicating "no index register," then 
ss field MUST equal 00. If index is 100 and ss does not 
equal 00, the effective address is undefined. 
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10.2.3.5 
Encoding 
of Operation 
Direction 
(d) Field 


In many two-operand 
instructions 
the d field is pres- 
ent 
to 
indicate 
which 
operand 
is considered 
the 
source 
and which 
is the destination. 


d 
Direction 
of Operation 


0 
Register/Memory 
<- - Register 
"reg" 
Field Indicates 
Source Operand; 
"mod r/m" 
or "mod ss index base" 
Indicates 
Destination 
Operand 


1 
Register <- - Register/Memory 
"reg" 
Field Indicates 
Destination 
Operand; 
"mod r/m" 
or "mod ss index base" 
Indicates 
Source Operand 


10.2.3.6 
Encoding 
of Sign-Extend 
(s) Field 


The s field 
occurs 
primarily 
to instructions 
with im- 
mediate 
data fields. The s field has an effect 
only if 
the size of the immediate 
data is 8 bits and is being 
placed 
in a 16-bit or 32-bit destination. 


Effect 
on 
Effect 
on 
s 
Immediate 
Immediate 
Data8 
Data 16132 


0 
None 
None 


1 
Sign-Extend 
Data8 to Fill 
None 
16-Bit or 32-Bit Destination 


10.2.3.7 
Encoding 
of Conditional 
Test 
(tttn) 
Field 


For the 
conditional 
instructions 
(conditional 
jumps 
and set on condition), 
tttn is encoded 
with n indicat- 
ing to use the condition 
(n = 0) or its negation 
(n = 1), 
and ttt giving the condition 
to test. 


Mnemonic 
Condition 
tttn 
0 
Overflow 
0000 
NO 
No Overflow 
0001 
B/NAE 
Below/Not 
Above or Equal 
0010 
NB/AE 
Not Below/Above 
or Equal 
0011 
E/Z 
Equal/Zero 
0100 
NE/NZ 
Not Equal/Not Zero 
0101 
BE/NA 
Below or Equal/Not Above 
0110 
NBE/A 
Not Below or Equal/Above 
0111 
S 
Sign 
1000 
NS 
Not Sign 
1001 
PIPE 
Parity/Parity Even 
1010 
NP/PO 
Not Parity/Parity Odd 
1011 
lINGE 
Less Than/Not 
Greater or Equal 
1100 
NL/GE 
Not Less Than/Greater 
or Equal 
1101 
LE/NG 
Less Than or Equal/Greater 
Than 
1110 
NLE/G 
Not Less or Equal/Greater 
Than 
1111 


10.2.3.8 
Encoding 
of Control 
or Debug 
or Test 
Register 
(eee) 
Field 


For the 
loading 
and 
storing 
of the 
Control, 
Debug 
and Test registers. 


eeeCode 
Reg Name 


000 
CRO 
010 
CR2 
011 
CR3 


Do not use any other encoding 


eee Code 
Reg Name 


000 
DRO 
001 
DR1 
010 
DR2 


I 
011 
DR3 
110 
DR6 
111 
DR? 


Do not use any other encoding 


eee Code 
Reg Name 


011 
TR3 
100 
TR4 
101 
TRS 
110 
TR6 
111 
TR? 


Do not use any other encoding 


• 


Instruction 
Optional 


First Byte 
Second 
Byte 
Fields 


11011 
OPA 
1 
mod 
1 I 
OPB 
rim 
s-i-b 
I 
disp 


11011 
MF 
OPA 
mod 
OPB 
rim 
s-i-b I 
disp 


11011 
d 
P 
OPA 
1 
1 
OPB 
ST(i) 


11011 
0 
0 
1 
1 
1 
1 I 
OP 


11011 
0 
1 
1 
1 
1 
1 I 
OP 


10.2.4 
ENCODING 
OF FLOATING 
POINT 
INSTRUCTION 
FIELDS 
d = Destination 
O-Destination 
is ST(O) 
1-Destination 
is ST(i) 
Instructions for the FPU assume one of the five 
forms shown in the following table. In all cases, in- 
structions are at least two bytes long and begin with 
the bit pattern 11011B. 


R XOR d = Q-Destination 
(op) Source 
R XOR d = 1-Source 
(op) Destination 


OP = Instruction opcode, possible split into two 
fields OPA and OPB 


ST(i) = Register stack element; 
000 = Stack top 
001 = Second stack element 


MF = Memory Format 
0Q-32-bit 
real 
01-32-bit 
integer 
1Q--{)4-bit real 
11-16-bit 
integer 


•• 
111 = Eighth stack element 


mod (Mode field) and rim (Register/Memory specifi- 
er) have the same interpretation as the correspond- 
ing fields of the integer instructions. 
P = Pop 
Q-Do 
not pop stack 
1-Pop 
stack after operation 
s-i-b (Scale Index Base) byte and disp (displace- 
ment) are optionally present in instructions that have 
mod and rim fields. Their presence depends on the 
values of mod and rim, as for integer instructions. 
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11.0 
DIFFERENCES 
BETWEEN 
THE 
Intel486™ 
MICROPROCESSOR 
AND THE 386 
MICROPROCESSOR 
PLUS THE 


387 MATH COPROCESSOR 
EXTENSION 


The differences between the Intel486 Microproces- 
sor and the 386 Microprocessor are due to perform- 
ance enhancements. The differences between the 
microprocessors are listed below. 
1. Instruction clock counts have been reduced to 


achieve higher performance. See Section 10. 


2. The Intel486 Microprocessor bus is significantly 


faster than the 386 Microprocessor bus. Differ- 
ences include a 1X clock, parity support, burst 
cycles, cacheable cycles, cache invalidate cycles 
and 8-bit bus support. The Hardware Interface 
and Bus Operation Sections (Sections 6 and 7) of 
the data sheet should be carefully read to under- 
stand the Intel486 Microprocessor bus function- 
ality. 


3. To support the on-chip cache new bits have been 


. added to control register 0 (CD and NW) (Section 


2.1.2.1), new pins have been added to the bus 
(Section 6) and new bus cycle types have been 
added (Section 7). The on-chip cache needs tei 
be enabled after reset by clearing the CD and 
NW bit in CRO. 
4. The complete 387 math coprocessor instruction 


set and register set have beer added. No 1/0 
cycles are performed during Floating Point in- 
structions. The instruction and data pointers are 
set to 0 after FINITIFSAVE. Interrupt 9 can no 
longer occur, interrupt 13 occurs instead. 


5. The Intel486 Microprocessor supports new float- 
ing point error reporting modes to guarantee 
DOS compatibility. These new modes required a 
new bit in control register 0 (NE) (Section 2.1.2.1) 
and new pins (FERRI' 
and IGNNE#) (Section 


6.2.13 and 7.2.14). 
6. In some cases FERRII is asserted when the next 


floating point instruction is encountered and in 
other cases it is asserted before the next floating 
point instruction is encountered, depending upon 


the execution state the instruction causing ex- 
ception (see Sections 6.2.13 and 7.2.14). For 
both of these cases, the 387 Math Coprocessor 
asserts ERROR# when the error occurs and 
does not wait for the next floating point instruc- 
tion to be encountered. 


7. Six new instructions have been added: 


Byte Swap (BSWAP) 
Exchange-and-Add (XADD) 
Compare and Exchange (CMPXCHG) 
Invalidate Data Cache (INVD) 
Write-back 
and 
Invalidate 
Data 
Cache 


(WBINVD) 
Invalidate TLB Entry (INVLPG) 


8. There are two new bits defined in control regis- 2 


ter 3, the page table entries and page directory 
entries (PCD and PWT) (Section 4.5.2.5). 


9. A new page protection feature has been added. 


This feature required a new bit in control register 
o (WP) (Section 2.1.2.1 and 4.5.3). 


10. A new Alignment Check feature has been add- 


ed. This feature required a new bit in the flags 
register (AC) (Section 2.1.1.3) and a new bit in 
control register 0 (AM) (Section 2.1.2.1). 


11. The replacement algorithm for the translation 


lookaside buffer has been changed from a ran- 
dom algorithm to a pseudo least recently used 
algorithm like that used by the on-chip cache. 
See Section 5.5 for a description of the algo- 
rithm. 


12. Three new testability registers, TR3, TR4 and 


TR5, have been added for testing the on-chip 
cache. TLB testability has been enhanced. See 
Section 8. 
13. The prefetch queue has been increased from 16 


bytes to 32 bytes. A jump always needs to exe- 
cute after modifying code to guarantee correct 
execution of the new instruction. 


14. After reset, the 10 in the upper byte of the OX 


register is 04. The contents of the base regis- 
ters including the floating point registers may be 
different after reset. 


12.0 
OVERDRIVE 
PROCESSOR 
SOCKET 


Inclusion of the OverDrive Processor Socket in sys- 
tems based on Intel486 OX Microprocessors pro- 
vides the end-user with an easy and cost-effective 
way to increase system performance. The paradigm 
of simply installing an additional component into an 
empty OverDrive Processor Socket to achieve en- 
hanced system performance is familiar to the mil- 
lions of end-users and dealers who have purchased 
Intel Math CoProcessor upgrades to boost system 
floating point performance. The OverDrive Proces- 
sor provides an overall performance increase for 
systems based on Intel486 OX Microprocessors. 


As a new system architectural feature, the provision 
of the OverDrive Processor Socket as a means for 
PC users to take advantage of the ever more rapid 
advances in software and hardware technology will 
help to maintain the competitiveness of X86 PC- 
compatible systems over other architectures into the 
future. 


The majority of upgrade installations which take ad- 
vantage of the OverDrive Processor Socket will be 
performed by end-users and resellers. Therefore, it 
is important that the design be "end-user easy", and 
that the amount of training and technical expertise 
required to install the OverDrive Processor be mini- 
mized. Upgrade installation instructions should be 
clearly described in the system user's manual. In ad- 
dition, by making installation simple and foolproof, 
PC manufacturers can reduce the risk of system 
damage, warranty claims and service calls. Feed- 
~ack from Intel's Math CoProcessor customers high- 
lights three main characteristics of end-user easy 
designs: accessible OverDrive Processor Socket lo- 
cation, clear indication of component orientation, 
and minimization of insertion force. 


OverDrive 
Processor 
Socket 
Location: The Over- 
Drive Processor Socket for Intel486 OXand Intel486 
SX Microprocessor based systems is an empty 
socket which can be located on either the mother- 
board or modular CPU card. The OverDrive Proces- 
sor Socket should be easily accessible for installa- 
tion and readily visible when the PC case is re- 
moved. The OverDrive Processor Socket should not 
be located in a position that requires removal of any 
other hardware (such as hard disk drives) in order to 
install the OverDrive Processor. Since Math CoProc- 
essor sockets are typically found near the CPU 
socket on the motherboard, similarly locating the 
OverDrive Processor Socket near the CPU further 
adds to the ease of installation. 


Component 
Orientation: 
The most common mis- 
take made by end-users and resellers when install- 
ing Math CoProcessor upgrades is incorrect orienta- 
tion of the chip. This can result in irreversible dam- 
age to the chip and/or the PC. To solve this prob- 
lem, Intel has designed the OverDrive Processor 
with a 169 pin Pin Grid Array (PGA) pinout, with the 
169th pin as a non-electrical "key pin" used to en- 
sure proper orientation of the OverDrive Processor 
by the PC user. The OverDrive Processor Socket 
should, therefore, be a 169 pin PGA socket compati- 
ble with the OverDrive Processor pinout.<1)In addi- 
tion, the location of the key pin should be clearly 
marked on the motherboard or CPU card, for exam- 
ple by silk screeni~g. 


Insertion 
Force: The third major concern voiced by 
end-users refers to how much pressure should be 
exerted on the chip and PC board for proper installa- 
tion without damage. This becomes even more of a 
concern with the larger 169 pin components which 
require up to 150 pounds of pressure for insertion 
into a standard screw machine socket. This level of 
pressure can easily result in cracked traces and 
stress to solder joints. To minimize the risk of sys- 
tem damage, it is recommended that a Zero Inser- 
tion Force (ZIF) socket be used for the OverDrive 
Processor Socket. Designing with a ZIF socket elimi- 
nates the need to design in additional structural sup- 
port to prevent flexing of the PC board during instal- 
lation, and results in improved end-user and reseller 
product satisfaction due to easy "drop-in" installa- 
tion. 


12.1 
OverDrive 
Processor 
Overview 


The Intel OverDrive Processor is essentially an en- 
hanced Intel486 Microprocessor. There are three 
functional differences between the Intel OverDrive 
Processor and Intel486 Microprocessors. First, the 
Intel OverDrive Processor has an internal clock dou- 
bling circuit which decreases the time required to 
execute instructions. Second, the Intel OverDrive 
Processor does not support the JTAG boundary 
scan test feature (available with the PQFPversion of 
the Intel486 OX Microprocessor). Third, the Intel 
OverDrive Processor has a different CPU revision 
identification than the Intel486 OXCPU. These three 
differences are described in the following sections 
according to how they effect the CPU functionality. 


The Intel OverDrive Processor bus has been de- 
signed to be identical with the Intel486 Microproces- 
sor bus. Although the external clock is internally 
doubled and data and instructions are manipulated 
in the CPU core at twice the external frequency, the 
external bus is functionally identical with the Intel486 
CPU. 


The four boundary scan test signals (TCK, Test 
clock; TMS, Test Mode select; TDI, Test Data Input; 
TOO, Test Data Output), defined for the PQFP Intel 
486 SX CPU, are not specified for the Intel Over- 
Drive Processor. 


The UPI' (Upgrade Present) signal, which is defined 
as an input for the PQFP Intel486 CPU, is an output 
signal on the Intel OverDrive Processor. The UPI' 
pin on the Intel OverDrive Processor provides a logi- 
cal low output signal which can be used to enable 
logic to recognize and configure the system for the 
Intel OverDrive Processor. 


The OX register always contains the component 
identifier at the conclusion of RESET. The Intel 
OverDrive Processor has a different revision identifi- 
er in the DL register than the Intel486 OX Microproc- 
essor. When the OverDrive Processor is installed in 
a system the component identifier is supplied by the 
OverDrive Processor, rather than the original CPU. 
The stepping identification portion of the component 
identification will change with different revisions of 
the OverDrive Processor. The designer should only 
assume that the component identification for the 
OverDrive Processor will be 043xH, where 'x' is the 
stepping identifier. 


As detailed in Section 13.1.1, the Intel OverDrive 
Processor does not support the JTAG boundary 
scan testability feature. 


The Intel OverDrive Processor supports all Intel486 
extensions to the 
8086/80186/80286 
instruction 
set. In general, instructions will execute faster on the 
Intel OverDrive Processor than the Intel486 Micro- 
processor. Specifically, an instruction that only uses 
memory from the on-chip cache executes at the full 
core clock rate while all bus accesses execute at the 
bus clock rate. To calculate the elapsed time of an 
instruction, the number of clock counts for that in- 
struction must be multiplied by the clock period for 
the system. The instruction set clock count summary 
tables from Section 10.0 can be used for the Over- 
Drive Processor witth the following modifications: 
- 
Clock counts for a cache hit: This value repre- 
sents the number of internal CPU core clocks for 
an instruction that requires no external bus ac- 
cesses or the base core clocks for an instruction 
requiring external bus accesses. 


- 
Penalty clock counts for a cache miss: This value 
represents the worst-case approximation of the 
additional number of external clock counts that 
are required for an instruction which must access 


the external bus for data (a cache miss). This num- 
ber must be multiplied by 2 to convert it to an equal 
number of internal CPU core clock counts and add- 
ed to the base core clocks to compute the total 
number of core clocks for this instruction. 


The actual number of core clocks for an instruction 
with a cache miss may be less than the base clock 
counts (from the cache hit column) plus the penalty 
clock counts (2 times the cache miss column num- 
ber). The clock counts in the cache miss penalty 
column can be a cumulative value of external bus 
clocks (for data reads) and internal clocks for manip- 
ulating the data which has been loaded from the ex- 
ternal bus. The number of clocks which are related 
to external bus accesses are correctly represented 
in terms of internal core clocks by multiplying by two. 
However, the clock counts related to internal data 
manipulation should not be multiplied by two. There- 
fore the total number of CPU core clock counts for 
an instruction with a cache miss represents a worst- 
case approximation. 


To calculate the execution time for an OverDrive 
Processor instruction, multiply the total CPU core 
clock counts by the core clock period. For example, 
in a 25 MHz system the core clock period is 50 ns 
(1/50 MHz). 


Additionally, 
the 
assumptions 
specified 
below 
should be understood in order to estimate instruc- 
tion execution time. 


A cache miss will force the OverDrive Processor to 
run an external bus cycle. The Intel486 OX micro- 
processor 32-bit burst bus is defined as r-b-w. 


Where: 
r = 
The number of bus clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle is a non-burst read. 


b = 
The number of bus clocks for the second and 
subsequent cycles in a burst read. 


w = The number of bus clocks for a write. 


The fastest bus the OverDrive Processor can sup- 
port is 2 - 1- 2 assuming 0 wait states. The clock 
counts in the cache miss penalty column assume a 
2 -1 - 2 bus. For slower busses add r- 2 clocks to 
the cache miss penalty for the first dword accessed. 
Other factors also affect instruction clock counts. 


Instruction 
Clock Count Assumptions 
1. The external QUsis available for reads or writes at 
all times. Else add bus clocks to reads until the 
bus is available 
2. Accesses are' aligned. Add three core clocks to 
each misaligned access. 


• 


3. Cache fills complete before subsequent accesses 


to the same line. If a read misses the cache dur- 
ing a cache fill due to a previous read or prefetch, 
the read must wait for the cache fill to complete. If 
a read or write accesses a cache line still being 
filled, it must wait for the fill to complete. 


4. If an effective address is calculated, the base reg- 


ister is not the distiriation register of the preceding 
instruction. If the base register is the destination 
register of the preceding instruction add 1 to the 
core clock counts shown. Back-to-back PUSH 
and POP instructions are ~ot affected by this rule. 


5. An effective address calculation uses one base 


register and does not use an index register. 
However, if the effective address calculation 
uses an index register. 1 core clock may be add- 
ed to the clock shown. 


6. The target of a jump is in the cache. If not, add r 
clocks for accessing the destination instruction 
of a jump. If the destination instruction is not 
completely contained in the first dword read, add 
a maximum of 3b bus clocks. If the destination 
instruction is not completely contained in the first 
16 byte burst, add a maximum of another r+ 3b 
bus clocks. 


7. If no write buffer delay, w bus clocks are added 
only in the case in which all write buffers are full. 


8. Displacement and immediate not used together. 


If displacement and immediate used together, 1 
core clock may be added to the core clock count 
shown. 


9. No invalidate cycles. Add a delay of 1 bus clock 


for each invalidate cycle if the invalidate cycle 
contends for the internal cache/external 
bus 


when the OverDrive Processor needs to use it. 


10. Page translation hits in TLB. A TLB miss will add 


13, 21 or 28 bus clocks + 1 possible core clock 
to the instruction depending on whether the Ac- 
cessed and/or Dirty bit in neither, one or both of 
the page entries needs to be set in memory. This 
assumes that neither page entry is in the data 
cache and a page fault does not occur on the 
address translation. 
11. No exceptions are detected during instruction 


execution. Refer to interrupt core,Clock Counts 
Table for extra clocks if an interrupt is detected. 


12. Instructions that read multiple consecutive data 


items (Le.,task switch, POPA, etc.) and miss the 
cache are assumed to start the first access on a 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+3b) 
bus clocks to the cache miss penalty. 
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12.2 
Intel OverDrive™ 
Processor 
Circuit Design 


Figure 12.1 shows the 
interface circuit for the 
Intel486 DX CPU and the OverDrive Processor sock- 
et. This circuit allows Intel486 DX CPU-based sys- 
tems to be upgraded with the OverDrive Processor. 


12.2.1 
UPGRADE 
CIRCUIT 
FOR PGA INTEL486 
OX BASED 
SYSTEMS 


The Intel OverDrive Processor Socket Circuit for In- 
tel486 DX CPU based systems allows the Intel486 
DX CPU complete control of the system when the 
Intel OverDrive Processor Socket is unpopulated. 
The HLDA signal from the Intel OverDrive Processor 
Socket should be tied low through a resistor while 
the UP# and FERR# signals from the Intel Over- 
Drive Processor Socket should be tied high through 
a resistor to insure that the Intel486 DX CPU func- 
tions correctly when an Intel OverDrive Processor 
Socket component is not installed. 


When the Intel OverDrive Processor is installed, the 
Upgrade Present output, UP# 
pin, causes the 
FLUSH# and BOFF# signals to be driven active to 
the Intel486 DX CPU. When the Intel486 DX CPU 


DATA 
ADDR 
CTRL 


CLK 
HLDA 


FLUSH# 


Intel 


OVERDRIVE'" 
UP# 


PROCESSOR 


samples FLUSH# active during reset, the Intel486 
DX CPU enters tri-state output test mode after reset, 
which causes the Intel486 DX CPU to float all of its 
output signals. To float most of the Intel486 DX 
CPU's output pins before the end of reset, BOFF# 
is also driven active to the Intel486 DX CPU. BOFF# 
immediately causes all output signals to float except 
PCHK#, BREQ, HLDA and FERR#. 


In addition to floating the Intel486 DX CPU's outputs, 
the Intel486 DX CPU's HLDA and FERR# signals 
must be gated to prevent potential bus contention 
with the Intel OverDrive Processor's HLDA and 
FERR# signals during reset. During reset the In- 
tel486 DX CPU may not recognize HOLD active be- 
cause BOFF# is driven active to the Intel486 DX 
CPU by the Intel OverDrive Processor. If the Intel486 
DX CPU does not recognize HOLD active, it will not 
drive HLDA active. However, the Intel OverDrive 
Processor will recognize HOLD active and drive 
HLDA. By gating the HLDA signals from the Intel486 
DX CPU and Intel OverDrive Processor Socket, bus 
contention is avoided if HOLD is driven active during 
reset. Because the state of FERR# is undefined 
during reset, bus contention is also avoided by gat- 
ing FERR#. 


FLUSH# 
PGA 
1486™DX 


BOFF# 


• 


This section 
discusses 
three 
aspects 
for the Over- 
Drive Processor 
Socket: 
size, upgradability, 
and ven- 
dors. 


The OverDrive 
Processor 
Socket for Intel486 
DX mi- 
croprocessor-based 
systems 
is equivalent 
to a stan- 
dard 169-lead 
PGA package. 


The 
OverDrive 
Processor 
will 
be 
provided 
with 
a 
heat 
sink 
attached 
(see 
Figure 
12-2), 
to dissipate 
heat. 


The 
maximum 
and 
minimum 
dimensions 
of 
the 
OverDrive 
Processor 
package 
with the heat sink are 
shown 
in Table 
12-1. 
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Table 12.1. OverDrive 
Processor, 
169·Pin, PGA 
Package 
Dimensions 
with Heat Sink Attached 


Dimension 
(Inches) 
Minimum 
Maximum 


A. 
Heat Sink Width 
1.520 
1.550 


B. PGA Package 
1.735 
1.765 
Width 


C. Heat Sink Edge 
0.065 
0.155 
Gap 


D. Heat Sink Height 
0.212 
0.260 


E. Adhesive 
0.008 
0.012 
Thickness 


F. Package 
Height 
0.140 
0.180 
from Stand-Offs 


. G. Total Height from 
0.360 
0.452 
Stand-Offs 
to Top 
of Heat Sink 


B 
1t 
,. 
A 


OVERDRIVE 
PROCESSOR 
r 


Ot.4NI-DIRECTIONAL 
HEAT 
SINK 
D 
G 


./ADHESIVE 
U 


UPGRADE 
PROCESSOR, 
169 
PIN, 
PGA 
PACKAGE 
Tf 


I 
I 
1 
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450 
CHAMFER 


(INDEX 
CORNER) 


SEATING 


PLANE I 


0B 
(ALL 
PINS) I 
Fc=:Oij 


SWAGGED 
PIN 
DETAIL 


SWAGGED 
PIN 


(4 PL) 


Al 


BASE 
PLANE 


Family: Ceramic 
Pin Grid Array Package 


Symbol 
MIllimeters 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
2.8 
3.5 
SOLID LID 
0.110 
0.140 
SOLID LID 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


D 
44.07 
44.83 
1.735 
1.765 


D1 
40.51 
40.77 
1.595 
1.605 


81 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
169 
169 


81 
1.52 
2.54 
0.060 
0.100 


ISSUE 
IW8 
REVX 
7/15/88 
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Letter or 
Description of Dimensions 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance 
from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


D 
Largest overall package 
dimension 
of length 


D1 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing 
between 
true lead position 
centerlines 


L 
Distance 
from seating plane to end of lead 


81 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling dimension: millimeter. 
2. Dimension "el" 
("e") is non-cumulative. 
3. Seating plane (standoff) is defined by P.C. board hole size: 0.0415-0.0430 inch. 
4. Dimensions "B", "Bl" 
and "C" are nominal. 
5. Details of Pin 1 identifier are optional. 
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5ECTION 
A-A 


PC buyers value easy and safe upgrade 
installation. 
PC manufacturers 
can make the Intel OverDrive 
Proces- 
sor installation 
in the Intel OverDrive 
Processor 
Socket 
simple and foolproof 
for the end user and reseller 
by 
implementing 
the suggestions 
listed in Table 
12-3. 


"End User Easy" 
Implementation 
Feature 


Visible OverDrive 
Processor 
Socket 
The Intel OverDrive 
Processor 
Socket should be easily visible when 
the PC's cover is removed. 
Label the. Intel OverDrive 
Processor 
Socket 
and the location 
of pin 1 by silk screening 
this information 
on the PC 
board. 


Accessible 
Overdrive 
Processor 
Make the Intel OverDrive 
Processor 
Socket easily accessible 
to the 
Socket 
end user (I.e., do not place the Intel OverDrive 
Processor 
Socket 
under 
a disk drive). If a Low Insertion 
Force (L1F) or screw machine 
socket is 
used, position 
the Intel OverDrive 
Processor 
Socket on the PC board 
such that there is ample clearance 
around the socket. 


Foolproof 
Chip Orientation 
Intel packages 
all Intel OverDrive 
Processors 
in a 169-pin, PGA 
package. 
The 169th pin is called the "key" 
pin and insures that the 
Intel OverDrive 
Processor 
fits into a 169-pin socket 
in only the correct 
orientation. 
Supplying 
a 169-pin socket as the Intel OverDrive 
Processor 
Socket eliminates 
the possibility 
of end users or resellers 
damaging 
the PC board or Intel OverDrive 
Processor 
by powering 
up 
the system with the Intel OverDrive 
Processor 
incorrectly 
oriented. 


Zero Insertion 
Force 
The high pin count of the Intel OverDrive 
Processor 
makes the 
Upgrade Socket 
insertion 
force required for installation 
in a screw machine 
PGA socket 
excessive 
for end users or resellers. 
Even most Low Insertion 
Force 
(L1F) sockets 
often require more than 60 Ibs. of insertion 
force. A Zero 
Insertion 
Force (ZIF) socket insures that the chip insertion 
force does 
not damage the PC board. If the ZIF socket 
has a handle, be sure to 
allow enough clearance 
for the socket 
handle. If a L1F or screw 
machine 
socket is used, additional 
PC board support 
is recommended. 


"Plug and Play" 
Jumper or switch changes 
should not be needed to electrically 
configure 
the system for the Intel OverDrive 
Processor. 


Thorough 
Documentation 
Describe 
the Intel OverDrive 
Processor's 
installation 
procedure 
in the 
PC's User's Manual. 
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The following lists provide examples of sockets 
which can be used as the Intel OverDrive Socket for 
Intel486 OX CPU based systems. 


NOTE: 
This is not a comprehensive list. Intel has not test- 
ed the sockets listed below and cannot guarantee 
that these sockets will meet every PC manufactur- 
er's specific requirements. 


Zero Insertion 
Force 
Upgrade 
Sockets 
and 
Vendors: 


1. AMP Inc. 


P.O. Box 3608 
Harrisburg, PA 17105-3608 
Tel: (800) 522-6752 
Part Number: 55287-3 
Contact: Rick Simonic, New Product Manager 
(717) 561-6143 


2. Aries Electronics 
P.O. Box 130 
Frenchtown, NJ 08825 
Tel: (908) 996-6841 
Part Number: 169-PRS17012-10 
Contact: Frank Folmsbee, Marketing Manager 
(908) 996-6841 


3. JAE 
599 N. Mathilda Ave., Suite 8 
Sunnyvale, CA 94086 
Tel: (408) 733-0493 
Part Number: PCPS-169-002 
Contact: Bob Gerleman, Western Sales Manager 
(408) 733-0493 


4. Thomas and Betts 
200 Executive Center Drive 
P.O. Box 24901 
Greenville, SC 29616-2401 
Tel: (803) 676-2900 
Part Number: PGA169A17-S-1AC 
Contact: Scott Roland, 
Product Marketing Manager 
(803) 676-2910 


5. Yamaichi Electronics 
1420 Koll Circle, Suite B 
San Jose, CA 95112 
Tel: (408) 452-0797 
Part Number: NP111-16911-G4 
Contact: Jim Bennett, Sales Manager 
(408) 452-0797 


1. AMP Inc. 
P.O. Box 3608 
Harrisburg, PA 17105-3608 
Tel: (800) 522-6752 
Part Number: 
(Premium Base Material) 55589-5 
(Standard Base Material) 916227-3 


2. Thomas and Betts 
200 Executive Center Drive 
P.O. Box 24901 
Greenville, SC 29616-2401 
Tel: (803) 676-2900 
Part Number: LPG169A17-S-1AC 


12.4 Thermal Management 


The OverDrive Processor Socket must be designed 
to dissipate the heat generated by the OverDrive 
Processor. In the following Sections the airflow re- 
quired over the OverDrive Processor Socket is cal- 
culated for a hypothetical system design. 


12.4.1 
THERMAL 
CALCULATIONS 
FOR 
HYPOTHETICAL 
SYSTEM 


The maximum temperature specification for the 
OverDrive Processor is 85°C (with heat sink at- 
tached). Therefore, the temperature of the heat sink 
surface (TS) cannot exceed 85°C under the worst 
case specified operating conditions for the system. 
The variables which affect the heat sink temperature 
include ambient temperature inside the system box 
(TA), Vee, and Ice. An equation for the approximate 
OverDrive Processor temperature (Ts) is: 


In the above equation, the variables under worst 
case conditions are specified as follows: 
. 


TS: 
Specified as 85°C for the OverDrive Processor 
(See Figure 12-5). 


TA: 
Specified by the PC manufacturer for the 
worst case system operating conditions. 


Vee: Specified for the OverDrive Processor as 5V. 
Ice: 
Specified for the OverDrive Processor and re- 
lated to clock frequency. 


8SA: 
8SA = 8JA - 
8JS· 
8JA and 8JS are specified in Table 13-4. 


Figure 12.5. Heat Sink Measurement (0.005" Dla. Thermocouple) 
on the Center of Heat Sink with a goo 
Angle Adhesive Bond Through a Hole Drilled Through the Center of the Name Plate. 


The OverDrive Processor for Intel486 OX CPU- 
based systems will be provided with a heat sink. The 


(JJSand (JJAvalues for the OverDrive Processor with 
a heat sink are shown in Table 12-4. The maximum 
TA values for the 25 MHz and 33 MHz OverDrive 
Processor are shown in Table 12-5. The maximum 
TA values shown in Table 12-5 were calculated us- 
ing TS = 85°C, VCC= 5V, the maximum Icc values, 
and the (JJA and (JJS values shown in Table 12-4. 


NOTE: 
°The thermal 
resistance 
from the junction 
to ambient 
(/lJAl 
in static 
air is actually 
a linear function 
of power dissipation. 
The 
value 
shown 
in the table 
(14.0 
°C/W) 
represents 
the 
worst 
case expected 
value. 


Table 12.5. Maximum TA for 25 MHz and 
33 MHz OverDrive Processor 


OverDrive 
Linear Airflow (ft/mln) 
Processor 
wIth Heat 
felK 
Sink 
(MHz) 
0 
200 
400 
600 
800 


TA 
25 
30 
49 
61 
67 
70 
(0G) 
33 
16 
40 
55 
63 
66 


12.4.2 HEAT SINKS 


The OverDrive Processor is shipped with a heat sink • 
attached. Because of the heat sink, it is vital that 
vertical clearance is provided for the OverDrive 
Processor Socket. The height of the package and 
the heat sink is shown in Table 12-1 in Section 
12.2.1. 


The following should be considered when designing 
the Upgrade Socket for a Intel486 DX2 microproces- 
sor-based system. 


12.5.1 INTEL OVERDRIVE PROCESSOR 
DETECTION 


The 
component 
identifier 
and 
stepping/revision 
identifier for the Intel OverDrive Processor is read- 
able in the DH and DL registers respectively, imme- 
diately after RESET, where 


DH = 15h 


DL = 30h-3Fh 


As it is difficult to differentiate betwee the Intel486 
DX CPU and the Intel OverDrive Processor in soft- 
ware, it is recommended that the BIOS save the 
contents of the DX register, immediately after RE- 
SET, so that this information can be used later, if 
required, to identify an Intel OverDrive Processor in 
the system. 


The Intel OverDrive Processor executes instructions 
at twice the frequency of the input clock. Thus soft- 
ware (or instuction based) timing loops will execute 
faster on the Intel OverDrive Processor than on the 
Intel486 DX or Intel486 SX CPU (at the same input 
clock frequency). Instructions such as NOP, LOOP, 
and JMP$ + 2, have been used by BiOS to imple- 
ment timing loops that are required, for example, to 
enforce recovery time between consecutive access- 
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es for I/O devices. These instruction based, timing 
loop implementations may require modification for 
systems intended to be upgradable with the Intel 
OverDrive Processor. 


In order to avoid any incompatibilities, it is recom- 
mended that timing requirements be iimplemented in 
hardware rather than in software. This provides 
transparency and also does not require any change 
in BIOS or I/O device drivers in the future when 
moving to higher processor clock speeds. As an ex- 
ample, a timing routine may be implemented as fol- 
lows: The software performs a dummy I/O instruc- 
tion to an unused I/O port. The hardware for the bus 
controller logic recognizes this I/O instruction and 
delays the termination of the I/O cycle to the CPU 
by keeping RDYII or BRDYII deasserted for the ap- 
propriate amount of time. 


A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


1 
/020 
019 
011 
09 
VSS 
OP1 
VSS 
VSS 
VCC 
VSS 
VSS 
VSS 
02 
DO 
A~ 1 
A28 
A27 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


2 
022 
021 
018 
01~ 
VCC 
08 
VCC 
O~ 
05 
Vec 
06 
Vec 
01 
A29 
VSS 
A25 
A26 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


3 
NC 
Vss 
CLK 
017 
010 
015 
012 
OP2 
016 
014 
07 
04 
OPO 
A~O 
A17 
Vee 
A2~ 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


4 
02~ 
Vss 
VCC 
KEY 
A19 
VSS 
NC 
4 
0 
0 
0 
0 
0 
0 
0 


5 
OP~ 
Vss 
Vce 
A21 
A18 
AI4 
5 
0 
0 
0 
0 
0 
0 


6 
024 
025 
027 
A24 
Vee 
Vss 
6 
0 
0 
0 
0 
0 
0 


7 
Vss 
VCC 
026 
A22 
A15 
A12 
7 
0 
0 
0 
0 
0 
0 


8 
029 
0~1 
028 
A20 
Vec 
Vss 
8 
0 
0 
0 
0 
0 
0 


9 
Vss 
Vce 
Intel 
OVERDRIVETM 
PROCESSOR 
Vee 
Vss 
9 
o~o 
A16 
0 
0 
0 
PIN 
SIDE 
VIEW 
0 
0 
0 


10 
NC 
Ne 
NC 
Al~ 
Vec 
Vss 
10 
0 
0 
0 
0 
0 
0 


11 
Vss 
Vce 
NC 
A9 
Vee 
Vss 
11 
0 
0 
0 
0 
0 
0 


12 
NC 
NC 
NC 
AS 
All 
vss 
12 
0 
0 
0 
0 
0 
0 


13 
FERR" 
NC 
NC 
A7 
A8 
AID 
13 
0 
0 
0 
0 
0 
0 


14 
HC 
UP" 
NC 
A2 
Vee 
Vss 
14 
0 
0 
0 
0 
0 
0 


15 
IGNNE" 
HIoU 
FLUSH" 
A20t.4" 
HOLD 
KEN" 
NC 
BROY" 
BE2" 
BED" 
PWT 
o/c" 
LOCK" 
HLOA 
BREQ 
A~ 
A6 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


16 
INTR 
HC 
RESET 
BS8" 
Vcc 
ROY" 
Vce 
Vec 
BEl" 
Vec 
Vce 
Vce 
t.4/10" 
Vee 
PLOCK" 
BLAST" 
A4 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


17 
AHOLO 
[ADS" 
8516" 
sorr" 
Vss 
BU" 
VSS 
VSS 
PCO 
Vss 
Vss 
VSS 
W/R" 
VSS 
PCHK" 
NC 
ADS" 
17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


240440-A2 
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S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 


1 
A27 
A28 
A31 
00 
02 
VSS 
VSS 
VSS 
Vee 
VSS 
VSS 
OPI 
VSS 
09 
011 
019 
02~ 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


2 
A26 
A25 
Vss 
A29 
01 
Vee 
06 
Vee 
05 
03 
Vee 
08 
Vee 
013 
018 
021 
022 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


3 
A23 
Vee 
Al7 
A30 
OPO 
04 
07 
014 
016 
OP2 
012 
015 
010 
017 
CLK 
VSS 
NC 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


4 
NC 
Vss 
AI9 
KEY 
Vee 
VSS 
023 
4 
0 
0 
0 
0 
0 
0 
0 


5 
AU 
A18 
A21 
Vee 
VSS 
OP3 
5 
0 
0 
0 
0 
0 
0 


6 
VSS 
Vee 
A24 
027 
025 
024 
6 
0 
0 
0 
0 
0 
0 


7 
A12 
A15 
An 
026 
Vee 
Vss 
7 
0 
0 
0 
0 
0 
0 


8 
Vss 
Vee 
A20 
028 
031 
029 
8 
0 
0 
0 
0 
0 
0 


9 
Vss 
Vee 
A16 
Intel 
OVERORIVETM 
PROCESSOR 
030 
Vee 
Vss 
9 
0 
0 
0 
0 
0 
0 


Vss 
Vee 
TOP 
SIDE 
VIEW 


10 
A13 
NC 
NC 
NC 
10 
0 
0 
0 
0 
0 
0 


1 1 
Vss 
Vee 
A9 
NC 
Vee 
Vss 
1 1 
0 
0 
0 
0 
0 
0 


12 
vss 
All 
AS 
NC 
NC 
NC 
12 
0 
0 
0 
0 
0 
0 


13 
AID 
A8 
A7 
NC 
NC 
fERR. 
13 
0 
0 
0 
0 
0 
0 


14 
Vss 
Vee 
A2 
NC 
UP. 
NC 
14 
0 
0 
0 
0 
0 
0 


15 
A6 
A3 
BREQ 
HLOA 
LOCK. 
o/c. 
PWT 
BED· 
BE2· 
BROY· 
NC 
KEN· 
HOLD 
A20M. 
flUSH. 
NMI 
IGNNE. 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


16 
A4 
BLAST. 
PLOCK. 
Vee 
M/IO. 
Vee 
Vee 
Vee 
BEl. 
Vee 
Vee 
ROY. 
Vee 
Bs8· 
RESET 
NC 
INTR 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


17 
ADS· 
NC 
PCHK· 
Vss 
W/R. 
Vss 
Vss 
Vss 
PCO 
Vss 
Vss 
BE3. 
Vss 
serf. 
8516. 
(AOS. 
AHOLO 
17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 


~ 
240440-A3 


Address 
Data 
Control 
HIC 
Vcc 
Vss 


A2 
014 
Do 
P1 
A20M# 
015 
A10 
87 
A7 
A3 
R15 
01 
N2 
A08# 
817 
A12 
89 
A9 
A4 
816 
02 
N1 
AHOLO 
A17 
A14 
811 
A11 
A5 
012 
03 
H2 
8EO# 
K15 
812 
C4 
83 
A6 
815 
04 
M3 
8EU 
J16 
813 
C5 
84 
A7 
013 
05 
J2 
8E2# 
J15 
C10 
E2 
85 
A8 
R13 
06 
L2 
8E3# 
F17 
C13 
E16 
E1 
A9 
011 
07 
L3 
8LA8T# 
R16 
G15 
G2 
E17 
AlO 
813 
08 
F2 
80FF# 
017 
R17 
G16 
G1 
A11 
R12 
09 
01 
8ROY# 
H15 
84 
H16 
G17 
A12 
87 
010 
E3 
8REO# 
015 
A3 
J1 
H1 
A13 
010 
011 
C1 
888# 
016 
810 
K2 
H17 
A14 
85 
012 
G3 
8816# 
C17 
816 
K16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
C11 
l16 
K17 
A16 
09 
014 
K3 
O/C# 
M15 
C12 
M2 
L1 
A17 
03 
015 
F3 
OPO 
N3 
C14 
M16 
L17 
A18 
R5 
016 
J3 
OP1 
F1 
P16 
M1 
A19 
04 
017 
03 
OP2 
H3 
R3 
M17 
A20 
08 
018 
C2 
OP3 
A5 
R6 
P17 
A21 
05 
019 
81 
EA08# 
817 
R8 
02 
A22 
07 
020 
A1 
FERR# 
A13 
R9 
R4 
A23 
83 
021 
82 
FLU8H# 
C15 
R10 
86 
A24 
06 
022 
A2 
HLOA 
P15 
R11 
88 
A25 
R2 
023 
A4 
HOLD 
E15 
R14 
89 
A26 
82 
024 
A6 
IGNNE# 
A15 
810 
A27 
81 
025 
86 
INTR 
A16 
811 
A28 
R1 
026 
C7 
KEN# 
F15 
812 
A29 
P2 
027 
C6 
LOCK# 
N15 
814 
A30 
P3 
028 
C8 
MIIO# 
N16 
A31 
01 
029 
A8 
NMI 
815 
030 
C9 
PCO 
J17 
031 
88 
PCHK# 
017 
PWT 
L15 
PLOCK # 
016 
ROY# 
F16 
RE8ET 
C16 
UP# 
814 
W/R# 
N17 
KEY 
04 
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Symbol 
Type 
Name and Function 


Intel486 
DX2 CPU INTERFACE 


UP# 
0 
The Upgrade Present 
pin is used to signal the Intel486 
OX microprocessor 
to float its 
outputs 
and get-off the bus. It is active low and is never floated. 
UP# 
is driven low at 
power-up 
and remains active for the entire duration 
of the Upgrade 
Processor 
operation. 


KEY PIN 


KEY 
The Key pin is an electrically 
non-functional 
pin which is used to ensure correct 
orientation 
for 169-pin upgrade products. 


OverDrive 
Processor 
is compatible 
to the maximum 
ratings 
and A.C. Specifications 
of the 
Intel486 
OX 
Microprocessor. 
Table 
12-8 provides 
the D.C. Oper- 


ating Conditions 
for the OverDrive 
Processor. 


The electrical 
specifications 
in this section 
represent 
the electrical 
interface 
of the Upgrade 
Processor 
for 
a Intel486 
OX microprocessor-based 
system. 
The 


Table 
12·8. Intel OverDrive™ 
Processor 
Socket 
D.C. Parametric 
Values(1) 


Symbol 
Parameter 
Min 
Max 
Unit 
Notes 


Vil 
Input low 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
Vcc 
+ 0.3 
V 


VOL 
Output low 
Voltage 
0.45 
V 
(Note 2) 


VOH 
Output High Voltage 
2.4 
V 
(Note 3) 


ICC 
Power Supply Current 
ClK 
= 25 MHz 
950 
mA 
(Note 4) 
ClK 
= 33 MHz 
1200 


III 
Input leakage 
Current 
±15 
p..A 
(Note 5) 


IIH 
Input leakage 
Current 
200 
p..A 
(Note 6) 


III 
Input leakage 
Current 
-400 
p..A 
(Note 7) 


ILO 
Output 
leakage 
Current 
±15 
p..A 


CIN 
Input Capacitance 
13 
pF 
Fc = 1 MHz(8) 


Co 
I/O or Output Capacitance 
17 
pF 
Fc = 1 MHz(8) 


CClK 
ClK 
Capacitance 
15 
pF 
Fc = 1 MHz(8) 


NOTES: 
1. Functional operating range: Vcc = 5V; Ts = O'C to +85'C. 
2. This parameter is measured at: 


- 
Address, Data, BEn 
4.0 mA 
- 
Definition, Control 
5.0 mA 
3. This parameter is measured at: 


- 
Address, Data, BEn 
-1.0 
mA 
- 
Definition, Control 
-0.9 
mA 
4. Typical supply current: 


775 mA @ ClK = 25 MHz 
975 mA @ ClK = 33 MHz 
5. This parameter is for inputs without pullups or pulldowns and 0 ,;; VIN ,;; Vcc. 
6. This parameter is for inputs with pulldowns and VIH = 2.4V. 
7. This parameter is for inputs with pullups and VIL = 0.45V. 
8. Not 100% tested. 
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The following sections describe recommended elec- 
trical connections for the Intel486 Microprocessor, 
and its electrical specifications. 


The 
Intel486 Microprocessor 
is implemented in 
CHMOS IV technology and has modest power re- 
quirements. However, its high clock frequency out- 
put buffers can cause power surges as multiple out- 
put buffers drive new signal levels simultaneously. 
For clean on-chip power distribution at high frequen- 
cy, 24 Vcc and 28 Vss pins feed the Intel486 Micro- 
processor. 


Power and ground connections must be made to all 
external Vcc and GND pins of the Intel486 Micro- 
processor. On the circuit board, all Vcc pins must be 
connected on a Vcc plane. All Vss pins must be 
likewise connected on a GND plane. 


13.1.2 
POWER 
DECOUPLING 
RECOMMENDATIONS 


Liberal decoupling capacitance should be placed 
near the Intel486 Microprocessor. The Intel486 Mi- 
croprocessor driving its 32-bit parallel address and 
data busses at high frequencies can cause transient 
power surges, particularly when driving large capaci- 
tive loads. 


Low inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. Inductance can be reduced by shortening 
circuit board traces between the Intel486 Microproc- 
essor and decoupling capacitors as much as possi- 
ble. Capacitors specifically for PGA packages are 
also commercially available. 


13.1.3 
OTHER 
CONNECTION 
RECOMMENDATIONS 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Active LOW in- 
puts should be connected to VCCthrough a pullup 
resistor. Pullups in the range of 20 Kfi are recom- 2 
mended. Active HIGH inputs should be connected to 
GND. 


Table 13.1 is a stress rating only, and functional op- 
eration at the maximums is not guaranteed. Function 
operating conditions are given in 13.3 D.C. Specifi- 
cations and 13.4 A.C. Specifications. 


Extended exposure to the Maximum Ratings may af- 
fect device reliability. Furthermore, although the 
Intel486 Microprocessor contains protective circuitry 
to resist damage from static electric discharge, al- 
ways take precautions to avoid high static voltages 
or electric fields. 


Voltage 
on Any Pin with 
Respect 
to Ground 
- 0.5 to Vee + 0.5V 


Supply Voltage 
with 
Respect to Vss 
- 0.5V to + 6.5V 


Case Temperature 
under Bias 
-65°C 
to + 110°C 


Storage Temperature 
-65°C 
to + 150°C 


13.3 D.C. Specifications 


Functional 
Operating 
Range: Vee = 5V ±5%; 
TeASE = O°C to +85°C 


Table 
13.2. Intel486™ 
OX Microprocessor 
DC Parametric 
Values (for PGA Package) 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input low 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
Vee +0.3 
V 


VOL 
Output 
low 
Voltage 
0.45 
V 
(Note 1) 


VOH 
Output High Voltage 
2.4 
V 
(Note 2) 


Ice 
Power Supply Current (50 MHz) 
1000~ 
mA 
(Note 3) 
Power Supply Current (33 MHz) 
~~S 
Power Supply Current (25 MHz) 
A' 


III 
Input leakage 
Current 
•..~l"' 
L.~~ 
15 
/LA 
(Note 4) 


IIH 
Input leakage 
Current 
~~ 
;)':'01 00 
/LA 
(Note 5) 


IlL 
Input leakage 
Current 
;~,.dr'4" .• -400 
/LA 
(Note 6) 


ILO 
Output leakage 
Current 
,,(1..' 
"l.l!l~\ 
±15 
/LA 


CIN 
Input Capacitance 
• 
~U' 


(25 MHz and 33 MHz) 
, 
20 
pF 
Fe = 1 MHz (Note 7) 


(50 MHz) 
~.f.t* 
13 
pF 
Fe = 1 MHz (Note 7) 


Co 
I/O or Output capaci~~' 


(25 MHz and 33 MHz 
20 
pF 
Fe = 1 MHz (Note 7) 


(50 MHz) 
17 
pF 
Fe = 1 MHz (Note 7) 


CeLK 
ClK 
Capacitance 
(25 MHz and 33 MHz 
20 
pF 
Fe = 1 MHz (Note 7) 


(50 MHz) 
15 
pF 
Fe = 1 MHz (Note 7) 


NOTES: 
1. This parameter is measured at: 


Address. Data. BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 


Address, Data, BEn 
-1.0 
mA 
Definition, Control 
- 0.9 mA 
3. Typical supply current: 


550 mA @ 25 MHz 
700 mA @ 33 MHz 
BOO mA @ 50 MHz 
4. This parameter is for inputs without internal pullups or pulldowns and 0 ~ VIN ~ Vcc. 
5. This parameter is for inputs with internal pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with internal pullups and VIL = 0.45V. 
7. Not 100% tested. 
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The AG. specifications, given in Table 13.3, consist 
of output delays, input setup requirements and input 
hold requirements. All AG. specifications are rela- 
tive to the rising edge of the GlK signal. 


AG. specifications measurement is defined by Fig- 
ures 13.1-13.7. 
All timings are referenced to 1.5V 


unless otherwise specified. Inputs must be driven to 
the voltage levels indicated by Figure 13.3 when 


AG. specifications are measured. Intel486 Micro- 
processor output delays are specified with minimum 
and maximum limits, measured as shown. The mini- 
mum Intel486 Microprocessor delay times are hold 
times provided to external circuitry. Intel486 Micro- 
processor input setup and hold times are specified 
as minimums, defining the smallest acceptable sam- 
pling window. Within the sampling window, a syn- 
chronous input signal must be stable for correct 
Intel486 Microprocessor operation. 
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Table 
13.3.25 
MHz Intel486™ 
Microprocessor 
A.C. Characteristics 
(PGA) 


Vcc = 5V ± 5%; TCASE = O°C to + 85°C; CL = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
25 
MHz 
1X ClK 
to Intel486 


t1 
ClK 
Period 
40 
125 
ns 
13.1 


t1a 
ClK 
Period Stability 
0.1% 
t:. 
Adjacent 
Clocks 


t2 
ClK 
High Time 
14 
ns 
13.1 
at2V(1) 


t3 
ClK 
low 
Time 
14 
ns 
13.1 
at 0.8V(1) 


t4 
ClK 
Fall Time 
4 
ns 
13.1 
(2V - 
0.8V)(1) 


t5 
ClK 
Rise Time 
4 
ns 
13.\.' 
(0.8V - 
2V)(1) 


tEl 
A2-A31, 
PWT, PCO, BEO-3#, 
3 
22 . 
ns 
1.s.~"'" 


M/IO#, 
O/C#, 
W/R#, 
AOS#, 
-~~~ 


lOCK#,FERR#,BREQ, 
HlOA 
;'''' 
Valid Delay 
~ 


t7 
A2-A31, 
PWT, PCO, BEO-3#, 
r; 30 1l 
13.6 
(Note 1) 
M/IO#, 
O/C#, 
W/R#, 
AOS#, 
.~I~~ ~ ~~. 
lOCK # Float Delay 
.(" 
~, 
,.,-,' ~:..;; 


fa 
PCHK# 
Valid Delay 
0'1' 
3 
. 27V, \JI 
13.4 


t6a 
BLAST #, PLOCK # Valid Dela~ 
.-.3". 
~'27/ 
ns 
13.5 


t9 
BLAST #, PLOCK # FI 
, 
~ " 
A3p~) 
13.6 
(Note 1) 
ns 


t10 
00-031, 
OPO-3 w~6 
Valid 
)<;~ 
ns 
13.5 
Delay 
/ 
~(\.. 


t11 


• 
L.~..•..•., 
", •• 
30 
ns 
13.6 
(Note 1) 
00-031, 
OPO-3 Write oa'7"/ 


Delay 
~ 


t12 
EAOS# 
Setup TimEl" 
::>.~" 
8 
ns 
13.2 


t13 
EAOS # Hold Time 
,," 
'\.., 
..... 
3 
ns 
13.2 


t14 
KEN # , BS1,e'4. SS8 # Setup Time 
8 
ns 
13.2 


t15 
KEN # ,~6#, 
BS8# 
Hold Time 
3 
ns 
13.2 


t16 
ROY # ~SQoY # Setup Time 
8 
ns 
13.3 


t17 
ROY # , BROY #. Hold Time 
3 
ns 
13.3 


t16 
HOLD, AHOlO, 
BOFF # Setup Time 
10 
ns 
13.2 


t19 
HOLD, AHOlO, 
BOFF # Hold Time 
3 
ns 
13.2 


t20 
RESET, FlUSH#, 
A20M#, 
NMI, 
10 
ns 
13.2 
INTR, IGNNE# 
Setup Time 


t21 
RESET, FLUSH #, A20M #, NMI, 
3 
ns 
13.2 


INTR, IGNNE# 
Hold Time 


t22 
00-031, 
OPO-3, A4-A31 
Read 
5 
ns 
13.2,13.3 
Setup Time 


t23 
00-031, 
OPO-3, A4-A31 
Read 
3 
ns 
13.2,13.3 
Hold Time 


NOTE: 
1. Not 100% 
tested. 
Guaranteed 
by design 
characterization. 
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IIIU""OO 
'M Microprocessor 
A.C. Characteristics 
(PGA) 


Vcc = 5V ± 5%; TCASE = O°C to + 85°C; CL = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
33 
MHz 
1X ClK 
to Intel486 


t1 
ClK 
Period 
30 
125 
ns 
13.1 


t1a 
ClK 
Period Stability 
0.1% 
Ii. 
Adjacent 
Clocks 


t2 
ClK 
High Time 
11 
ns 
13.1 
at2V(1) 


t3 
ClK 
low 
Time 
11 
ns 
13.1 
at 0.8V(1) 


t4 
ClK 
Fall Time 
3 
ns 
13.1 
(2V - 
0.8V)(1) 


t5 
ClK 
Rise Time 
3 
ns 
13.\. " 
""(0.8V 
- 
2V)l1) 


t6 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
16 
• 
ns rt!~ 


M/IO#, 
D/C;II, 
W/R;II, 
ADS;II, 


~ 


lOCK#,FERR;II,BREQ, 
HlDA 


..•. ~ 
Valid Delay 
,'" 


t7 
A2-A31, 
PWT, PCD, BEO-3;11, 
.•. 
~•• 20 1?~ 


13.6 
(Note 1) 


M/IO#, 
D/C;II, 
W/R;II, 
ADS;II, 
~.''''';.\\ 
~ 


~, 
lOCK;II 
Float Delay 
" 
t8 
PCHK;II Valid Delay 
~~ ·'3 
.•. 22' 
."~,,,) 
13.4 


tea 
BLAST;II, 
PlOCK;II 
Valid Del~ 
1~t~o/ 
n§ 
13.5 


t9 
BLAST;II, 
PlOCK;II 
FI~Welft'Y 
~~.) A~' 
ns 
13.6 
(Note 1) 


tlO 
00-031, 
DPO-3 
w~ra 
Valid 
r}f ~,. 
ns 
13.5 
Delay 
,/ 
A~ 
A 


t11 
00-031, 
DPO-3 
write 
D~~ 
I~ 
20 
ns 
13.6 
(Note 1) 


Delay 


t12 
EADS;II Setup Tim~ 
::>~~ 
5 
ns 
13.2 


t13 
EADS;II 
Hold 1Jme 
'\."> 
3 
ns 
13.2 


t14 
KEN;II, BSlt4.as8;11 
Setup Time 
5 
ns 
13.2 


t15 
KEN;II ,BS.t6#, 
BS8;11Hold Time 
3 
ns 
13.2 


t16 
RDY#~S90Y;II 
Setup Time 
5 
ns 
13.3 


t17 
RDY;II, BRDY # Hold Time 
3 
ns 
13.3 


t18 
HOLD, AHOlD, 
Setup Time 
6 
ns 
13.2 


t18a 
BOFF;II 
Setup Time 
8 
ns 
13.2 


t19 
HOLD, AHOlD, 
BOFF;II 
Hold Time 
3 
ns 
13.2 


t20 
RESET, FlUSH;II, 
A20M;II, 
NMI, 
5 
ns 
13.2 
INTR, IGNNE;II 
Setup Time 


t21 
RESET, FlUSH;II, 
A20M;II, 
NMI, 
3 
ns 
13.2 
INTR, IGNNE;II 
Hold Time 


t22 
00-031, 
DPO-3, 
A4-A31 
Read 
5 
ns 
13.2,13.3 
Setup Time 


t23 
00-031, 
DPO-3, 
A4-A31 
Read 
3 
ns 
13.2,13.3 
Hold Time 


NOTE: 
1. Not 100% tested. Guaranteed by design characterization. 
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Table 13.5. 50 MHz Intel486™ Microprocessor 
A.C. Specifications 


VCC = 5V ± 5%; TCASE = O·C to + 85·C; CL = See Note 2 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
16 
50 
MHz 
1X CLK to Intel486 


t1 
CLK Period 
20 
62.5 
ns 
13.1 


t1a 
CLK Period Stability 
0.1% 
Adjacent 
Clocks 


t2 
CLK High Time 
7 
ns 
13.1 
at2V(1) 


t3 
CLK Low Time 
7 
ns 
13.1 
at 0.8V(1) 


4 
CLK Fall Time 
2 
ns 
13.1 
• 
(2.0V -0.8V)(1) 


t5 
CLK Rise Time 
2 
ns 
13~1''''''- L.(0.8V -2.0V)(1) 


ts 
A2-A31, 
PWT, PCD, BEO-3#<, MilO#<, 
3 
12: 
ns 
~~~3#~' 
D/C#<, 
W/R#<, ADS#<, LOCK#<, 


If 
FERR#<, BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#<, MilO#<, 


~~ 


'~18 


~~ 


.." 
13.6 
(Note 1) 


D/C#<, 
W/R#<, ADS#<, LOCK#<, 


~ 
FERR#<, BREQ Float Delay 
-. 


t8 
PCHK #<Valid Delay 
~ 
,.~ 
3 
'-1't. ,.~ 
' 
13.4 


t8a 
BLAST#<, PLOCK#< Valid Dela~'S'-' 
r 
12.•~l.'W 
13.5 


t9 
BLAST #<,PLOCK #<Float ~~ 
I'" 
,') 
..••..19' 
ns 
13.6 
(Note 1) 


tlO 
DO-D31, 
DPO-3 Write, Qii&(,Yillid 
DeW; ~i 
ns 
13.5 


t11 
DO-D31, 
DPO-3 '1I~,~Ma 
FIOilt Detar' 
18 
ns 
13.6 
(Note 1) 


t12 
EADS#< Setup Tirile"': 
~V.~ 


~. 
ns 
13.2 


t13 
EADS#< Hold Time 
-,""'y"i~ 2 
ns 
13.2 


t14 
KEN#<, BS16#, 
BS8,'f'~p 
~ 
5 
ns 
13.2 


t15 
KEN#<, BS16#,J3S8~0Id 
~~- 
2 
ns 
13.2 


t16 
RDY #<,BRD;:::. 
S.uP 
Time 
5 
ns 
13.3 


t17 
RDY #<,B~' 
Hold Time 
2 
ns 
13.3 


t18 
HOLD, AHc;iD 
Setup Time 
5 
ns 
13.2 


t18a 
BOFF #<Setup Time 
5 
ns 
13.2 


t19 
HOLD, AHOLD, 
BOFF #<Hold Time 
2 
ns 
13.2 


t20 
RESET, FLUSH#<, A20M#<, NMI, INTR, 
5 
ns 
13.2 
IGNNE #<Setup Time 


t21 
RESET, FLUSH#<, A20M#<, NMI,INTR, 
2 
ns 
13.2 
IGNNE#< Hold Time 


t22 
DO-D31, 
DPO-3, 
A4-A31 
Read Data 
4 
ns 
13.2,13.3 
Setup Time 


t23 
DO-D31, 
DPO-3, A4-A31 
Read Data 
2 
ns 
13.2, 13.3 
Hold Time 


NOTES: 
1. Not 100% tested. Guaranteed by design characterization. 
2. Specifications assume CL = 0 pF. I/O Buffer model must be used to determine delays due to loading (trace and compo- 
nent). First Order I/O buffer models for the Intel486 CPU are available. Contact Intel for the latest release. 
3. All timings are referenced at 1.5V (as illustrated in the listed figures) unless otherwise noted. 


Table 
13.6.50 
MHz Intel486™ 
Microprocessor 
A.C. Characteristics 
for Boundary 
Scan Test Signals 


Vcc = 5V ± 5%; TCASE = O°C to + 85°C; CL = 50 pF. All Inputs and Outputs 
are TIL 
Level 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t24 
TCK Frequency 
25 
MHz 
1x Clock 


t25 
TCK Period 
40 
ns 
(Note 2) 


t26 
TCK High Time 
10 
ns 
@2.0V 


t27 
TCKLowTime 
10 
ns 
@0.8V 


t28 
TCK Rise Time 
4 
ns 
(Note 1) 


t29 
TCK Fall Time 
4 
ns 
(Note 1) 


t30 
TDI, TMS Setup Time 
8 
ns 
13.7 
(Note 3) 


t31 
TDI, TMS Hold Time 
7 
ns 
13.7 
(Note 3) 


t32 
TOO Valid Delay 
3 
25 
ns 
13.7 
(Note 3) 


t33 
TOO Float Delay 
TBD 
ns 


t34 
All Outputs 
(Non-Test) 
Valid Delay 
3 
25 
ns 
13.7 
(Note 3) 


t35 
All Outputs 
(Non-Test) 
Float Delay 
36 
ns 
13.7 
(Notes 3,5) 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
ns 
13.7 
(Note 3) 


t37 
All Inputs (Non-Test) 
Hold Time 
7 
ns 
13.7 
(Note 3) 


NOTES: 
1. Rise/Fall times are measured between 0.8V and 2.0V. Rise/Fall times can be relaxed by 1 ns per 10 ns increase in TCK 
period. 
2. TCK period :? CLK period. 
3. Parameter measured from TCK. 
4. Boundary Scan A.C. Specifications in the above table are target values. They have not been characterized. Therefore 
they are subject to change. 
5. Not 100% tested. Guaranteed by design characterization. 
• 


858'. 8516'. 
[ 
KEN' 


RESET. 
fLUSH'. 
[ 
A20104,. 
IGNNE,. 
INTR. 
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13.4.1 
TYPICAL 
OUTPUT 
VALID 
DELAY 
VERSUS 
LOAD CAPACITANCE 
UNDER 
WORST 
CASE' 


CONDITIONS 
FOR THE 25 MHz AND 33 MHz Intel486 
CPU 


NOTE: 
This graph 
will not be linear outside 
of the CL range 
shown. 
nom = nominal 
value 
given in A.C. Characteristics 
table. 
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13.4.2 TYPICAL LOADING DELAY VERSUS CAPACITIVE LOADING UNDER WORST·CASE 
CONDITIONS FOR A HIGH TO LOW TRANSITION ON THE 50 MHz Intel486 CPU 
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13.4.3 TYPICAL LOADING DELAY VERSUS CAPACITIVE LOADING UNDER WORST-CASE 
CONDITIONS FOR A LOW TO HIGH TRANSITION ON THE 50 MHz Intel486 CPU 
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13.4.4 
TYPICAL 
OUTPUT 
RISE TIME 
VERSUS 
LOAD 
CAPACITANCE 
UNDER 
WORST- 
CASE CONDITIONS 


NOTE: 
240440-76 
This graph will not be linear outside of the CL range 
shown. 


13.5 Designing for ICD·486 
(Advance Information) 


The ICD-486 
(In-Circuit 
Debugger) 
is a hardware 
as- 
sisted 
debugger 
for the 
Intel486 
CPU. To use the 
ICD-486, 
the Intel486 
CPU component 
must be re- 
moved 
from 
its socket 
replaced 
with 
the 
ICD-486 
module. 
Because 
of the high operating 
frequency 
of 
Intel486 
CPU systems, 
there 
is no buffering 
of sig- 
nals between 
the Intel486 
CPU in the ICD-486 
and 
the target system. 
A direct result of the non-buffered 
interconnect 
is that the ICD-486 
shares the address 
and data bus of the target 
system. 
In order for the 
ICD-486 
to function 
properly 
(without 
the 
Optional 
Isolation 
Board 
installed), 
the 
design 
of the target 
system 
must meet the following 
restrictions: 


1. The bus controller 
must only enable 
data trans- 
ceivers 
onto the data bus during valid read cycles 
of the Intel486 
CPU, other local devices, 
or other 
bus masters. 


2. Before 
another 
bus master 
drives the local proc- 
essor 
address 
bus, the 
other 
bus master 
must 
gain access 
to the address 
bus through 
the use 
of HOlD-HlDA, 
AHOlD, 
or BOFF#. 


In addition 
to the above restrictions, 
the ICD-486 has 
several 
electrical 
and 
mechanical 
characteristics 
that 
should 
be taken 
into 
consideration 
when 
de- 
signing 
the Intel486 
CPU system. 


Capacitive 
loading: 
ICD-486 adds up to 30 pF to the 
ClK 
signal, 
and up to 20 pF to each 
of the other 
Intel486 
CPU signals. 
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DC loading: 
ICD-486 
adds 
± 15 p.A loading 
to the 
ClK 
and data bus signals and ± 5 p.A loading to the 
address 
and control 
signals. 


Power 
Requirements: 
For 
noise 
immunity 
and 
CMOS 
latch-up 
protection 
the 
ICD-486 
is powered 
by the target 
system 
through 
the power and ground 
pins of the Intel486 
CPU socket. 
The circuitry 
on the 
ICD-486 
draws 
up to 
1.3A 
excluding 
the 
Intel486 
CPU Icc. 


No Connects: 
Pins specified 
as N.C. in the Intel486 
CPU pin description 
must be left unconnected. 
Con- 
nection of any of these pins to power, ground, or any 
other 
signal 
may cause 
the processor 
or the 
ICD- 
486 to malfunction. 


Intel486 CPU location 
and Orientation: 
The ICD-486 
may require lateral clearance. 
Figure 13.4 shows the 
clearance 
requirements 
of the ICD-486. 


Optional 
Isolation 
Board (OIB) 


Due to its unbuffered 
design, the ICD-486 
is suscep- 
tible to errors 
on the target 
system's 
bus. The OIB 
installs 
between 
the 
ICD-486 
and 
Intel486 
CPU 
socket 
in the target 
system 
and allows 
the ICD-486 
to function 
in systems 
with faults 
(i.e., shorted 
sig- 
nals). After electrical 
verification 
the OIB may be re- 
moved. The 018 has the following 
electrical 
and me- 
chanical 
characteristics: 


Buffer Characteristics: 
The 018 buffers 
the address 
and data busses as well as the byte enables, 
ADS # , 
W/R#, 
MIIO#, 
BLAST#, 
and 
HlDA. 
The 
buffers 
are advanced 
CMOS devices 
and have the following 
DC drive 
specifications: 
IOH = 
- 15 mA, 
IOL = 
64 mA. The propagation 
delay of each buffer is 5 ns 
max driving a 50 pF load. To guarantee 
proper oper- 
ation 
with the 
018, the clock 
period 
should 
be in- 
creased 
by the round trip buffer delay (10 ns) unless 
the target 
system 
design already 
has enough 
timing 
margin. 


Unbuffered 
Signals: Signals not listed above as buff- 
ered are passed through 
the 018 and will have addi- 
tional capacitive 
loading 
due to the connectors 
and 
circuit board of up to 10 pF. 


Power 
Requirements: 
The 018 
is also 
powered 
by 
the target 
system 
through 
the Intel486 
CPU socket 
and requires 
0.5A in addition 
to the ICD-486 
and In- 
tel486 
CPU requirements. 


OIB Clearance 
Requirements: 
The 018 requires 
an 
extra 0.55" 
of vertical 
clearance 
in the target system 
above the Intel486 
CPU socket. 
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2.29 RE~ 
1.52 
,. 


45° 
CHAlAfER 
(INDEX CORNER) 


SWAGGED 
PIN 
(4 PL) 


SEATING 
PLANE"1 


0B (ALL 
PINS) I 
}=~ 


SWAGGED 
PIN 
DETAIL 


Family: Ceramic 
Pin Grid Array Package 


Symbol 
Millimeters 
Inches 


Min 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
2.8 
3.5 
SOLID LID 
0.110 
0.140 
SOLID LID 


Aa 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


D 
44.07 
44.83 
1.735 
1.765 


D1 
40.51 
40.77 
1.595 
1.605 


e1 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
168 
168 


S1 
1.52 
2.54 
0.060 
0.100 


ISSUE 
IWS 
REVX 
7/15/88 
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Letter or 
Description of Dimensions 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance 
from seating plane to bottom 
of body 


B 
Diameter 
of terminal 
lead pin 


0 
Largest overall package 
dimension 
of length 


01 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing 
between 
true lead position 
centerlines 


L 
Distance 
from seating plane to end of lead 


S1 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling dimension: millimeter. 
2. Dimension "e1" ("e") is non-cumulative. 
3. Seating plane (standoff) is defined by P.C. board hole size: 0.0415-0.0430 
inch. 
4. Dimensions "B", "B1" and "C" are nominal. 
5. Details of Pin 1 identifier are optional. 


where 
TJ, TA, TC = Junction, 
Ambient 
and 
Case 
Temperature 
respectively. 
8JC, 8JA = Junction-to- 
Case and Junction-to-Ambient 
Thermal 
Resistance, 
respectively. 
The Intel486 
Microprocessor 
is specified 
for opera- 
tion when 
TC (the case 
temperature) 
is within 
the 
range of 0·C-85·C. 
TC may be measured 
in any en- 
vironment 
to determine 
whether 
the Intel486 
micro- 
processor 
is within 
specified 
operating 
range. 
The 
case temperature 
should be measured 
at the center 
of the top surface 
opposite 
the pins. 


The ambient 
temperature 
(TA,)is guaranteed 
as long 
as TC is not violated. 
The ambient 
temperature 
can 
be calculated 
from 
8JC and 8JA from the followihg 
equations. 


The values 
for 8JA and 8JC are given in Table 
14.2 
for the 1.75 sq. in., 168-pin, 
ceramic 
PGA. 


Table 14.3 shows the TA allowable 
(without 
exceed- 


ing Tc) at various airflows 
and operating 
frequencies 
(fClK)' 


Note that TA is greatly 
improved 
by attaching 
"fins" 


or a "heat 
sink" 
to the package. 
P (the maximum 
power consumption) 
is calculated 
by using the maxi- 


mum Icc at 5V as tabulated 
in the DC Characteris- 


tics of Section 
13. 


TJ = Tc + p. IIJC 


TA=TJ-P·IIJA 


Tc = TA + p. [IIJA - 
IIJcl 


8JA vs Alrflow-ft/mln 
(m/sec) 


8JC 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


I 
Without 
Heat Sink 
1.5 
17 
14.5 
12.5 
11.0 
10.0 
9.5 
I 
With Heat Sink' 
2.0 
13 
8.0 
6.0 
5.0 
4.5 
4.25 


·0.350· 
high unidirectional heat sink (AI alloy 6063, 40 mil fin width, 155 mil 
center-to-center fin spacing). 


•• 


intel~ 


9JA vs Airflow-ft/mln 
(m/sec) 


9JC 
0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


I 
Without Heat Sink 
1.5 
16.5 
14.0 
12.0 
10.5 
9.5 
9.0 


I 
With Heat Sink' 
2.0 
12.0 
7.0 
5.0 
4.0 
3.5 
3.25 


r- 0.290" --j*l 


$1 
---1.53,,---1 


Airflow-ft/min 
(m/sec) 


fClK 
0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TAwith Heat Sink 
25.0 
47 
64 
71 
75 
76 
77 


33.3 
36 
58 
67 
72 
74 
75 


50 
35 
60 
70 
75 
77.5 
78.75 


TA without Heat Sink 
25.0 
31 
40 
47 
52 
55 
57 


33.3 
15 
27 
36 
42 
47 
49 


50 
10 
22.5 
32.5 
40 
45 
47.5 


15.0 
LOW POWER 
Intel486™ 
OX 
MICROPROCESSOR 
• Lower Power Dissipation 
- 
Dynamic Frequency Scalability 
- 
lec(max) Reduced to 150 mA at 2 MHz 
- 
Improved Vee Rating (± 10%) 
• 168-Lead Pin Grid Array Package for Intel486 OX 
Microprocessor 


• High Performance Design 
- 
25 MHz Operation for Intel486™ OX 
- 
64 MByte/Sec Burst Bus 
- 
CHMOS IV Process Technology 
- 
Dynamic Bus Sizing for 8-, 16- and 32-Bit Bus- 
es 


This section describes the Low Power Intel486 OX 
microprocessor. 


The Low Power Intel486 Microprocessor meets to- 
day's need for high performance portables. The 
combination of special features like dynamic fre- 
quency scaling, lower minimum frequency, improved 
Vee operation and high integration contribute signifi- 
cantly to lower power dissipation and meet the 
needs of portable computing. 


The Low Power capability is achieved by operating 
the Intel486 Microprocessor in the 2X mode. The 
frequency can be varied dynamically between maxi- 
mum to minimum as needed. The frequency change 
does not affect contents of the registers and data 
integrity is maintained. Power dissipation is reduced 
significantly at 2 MHz where Ice is only 150 mA 
compared to 600 mA at 20 MHz. Low power ver- 
sions are offered for both the Intel486 SX and the 
Intel486 OX Microprocessors. 


The Low Power Intel486 Microprocessor is 100-per- 
cent compatible with all versions of the Intel386™ 
Microprocessor family, assuring compatibility with 
the more than $50 billion software base of MS-DOS, 
Windows, OS/2 and UNIX/System operating system 
applications. The Low Power Intel486 Microproces- 
sor integrates the same RISC-technology, one clock 
per instruction integer core, on-chip cache, and 
memory management unit as the standard Intel486 
Microprocessor. 


Note that the Intel OverDrive™ Processor does not 
work in systems based on the Low Power Intel486 
CPU. 


The following section on the Low Power Intel 486 
OX Microprocessor contains information specific to 
the Low Power device only. All data not defined are 
located in the appropriate sections of this data sheet 
unless specified otherwise. 


The Low Power Intel486 Microprocessor brings In- 
tel486 technology and performance to the portable 
computer 
market. The 
low 
power 
capability 
is 
achieved by a frequency scalability feature during 
normal operation. The operating frequency can be 
brought down dynamically resulting in lower power 
supply current (led. This results in minimal power 
dissipation which ensures a longer battery life. 


The Low Power Intel486 Microprocessor integrates 
the same RISC-technology, one clock per instruc- 
tion integer core, on-chip cache, and memory man- 
agement unit as the standard Intel486 Microproces- 
sor. 


The Low Power Intel486 Microprocessor has the fol- 
lowing special features: 


• 
Frequency 
Scalability-This 
is achieved by op- 


erating the Intel486 Microprocessor in the 2X 
clock mode. The frequency can be varied dynam- 
ically from maximum back to minimum or vice 
versa. The frequency change does not affect the 
register content of the CPU, thus data integrity is 
maintained. 


• 
Lower 
Minimum 
Frequency-The 
Low Power 
Intel486 Microprocessor can be operated at a 
minimum frequency of 2 MHz, at which lec(max) 
is only 150 mA, compared to an lec(max) of 600 
mA at 20 MHz operation. The power dissipation is 
thus drastically reduced ensuring a longer battery 
life. 


• 
Improved 
Vcc Operation-The 
Low Power In- 
tel486 Microprocessor has an improved Vee rat- 
ing of ± 10%. Again this feature makes it ex- 
tremely attractive to portable battery powered ap- 
plications. 


The above three features ensure power savings for 
portable computer systems resulting in prolonged 
battery life. 


Besides the above special features, the Low Power 
Intel486 Microprocessor has an identical feature set 
to the standard Intel486 CPU. This includes: 


• 
Binary Compatibility-The 
Low Power Intel486 
CPU is binary compatible with the 8086, 8088, 
80186,80286, Intel386 SX, Intel386 OX, Intel486 
SX and Intel486 OX CPUs. 


• 


• Full 32-Blt Integer Processor-The 
Low Power 


Intel486 CPU performs a complete set of arith- 
metic and logical operations on 8-, 16-, and 32-bit 
data types using a full-width ALU and eight gener- 
al-purpose registers. 


• Separate 
32-Blt 
Address 
and Data Paths- 


Four gigabytes of physical memory can be ad- 
dressed directiy. 


• Single-Cycle 
Execution-Many 
instructions ex- 


ecute in a single clock cycle. 


• On-Chip Floating Point Unit-The 
32-, 64-, and 
80-bit formats specified in IEEE standard 754 are 
supported. The unit is binary compatible with the 
8087, 80287, Inte1387™, Intel387 SX, and In- 
tel487TM Math Coprocessors and the Intel486™ 
CPU. 


• On-Chip Memory Management Unlt-Address- 
management 
and 
memory-space 
protection 
mechanisms maintain the integrity of memory. 
This is necessary in multitasking and virtual-mem- 
ory environments, like those implemented by the 
UNIX and OS/2 operating systems. Both memory 
segmentation and paging are supported. 


• On-Chip Cache with Cache Consistency 
Sup- 
port-The 
internal write-through cache can hold 


8 KBytes of data or instructions. Cache hits are 
as fast as read accesses to a processor register. 
Bus activity is tracked to detect alterations in the 
memory which internal cache represents. The in- 
ternal cache can be invalidated or flushed, so 
that an external cache controller can maintain 
cache consistency in multi-processor environ- 
ments. 


• External Cache Control-Write-back 
and flush 
controls over an external cache are provided so 
that the processor can maintain cache consisten- 
cy in multi-processor environments. 
• Instruction 
Plpellnlng- 
The fetching, decoding, 


execution and address translation of instructions 
are overlaped within the Low Power Intel486 Mi- 
croprocessor. This results in a continuous execu- 
tion rate of one clock cycle per instruction, for 
most instructions. 


• Burst Cycles-Burst 
transfers allow a new dou- 


bleword to be read from memory each clock cy- 
cle. With this capability the internal cache and in- 
struction prefetch buffer can be filled very rapidly. 


• Write 
Buffers-The 
processor contains write 


buffers to enhance the performance of consecu- 
tive writes to memory. The Low Power Intel486 
CPU can continue operations internally after a 
write, without waiting for the write to be executed 
on the external bus. 


• Bus Backoff-If 
another bus master needs con- 


trol of the bus during a Low Power Intel486 Mi- 
croprocessor initiated bus cycle, the Low Power 
Intel486 Microprocessor will float its bus signals, 
then restart its cycle when the bus becomes 
available again. 


• Instruction 
Restart-Programs 
can continue ex- 


ecution following an exception generated by an 
unsuccessful attempt to access memory. This 
feature is important for supporting demand-paged 
virtual memory applications. 


• Dynamic Bus Sizing-External 
controllers can 


dynamically alter the effective width of the data 
bus. Bus widths of 8, 16 or 32 bits can be used. 


The Low Power Intel386 DX Microprocessor pinout 
follows the same definition as the Intel486 DX Micro- 
processor given in Section 1.0 except for those list- 
ed in Table 15.1. 


1486OX 
Low Power 1486OX 
Pin# 
Microprocessor 
Microprocessor 


CLK 
CLK2 
C3 


NC 
CLKSEL 
A3(1) 


NOTE: 
1. This pin is TCK on the 50 MHz lnte1486 OX Microproces- 
sor. 
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Intel486TM OX MICROPROCESSOR 


15.2 
Pinout 


s 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 


0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


A27 
A31 
02 
Vss 
Vcc 
Vss 
Vss 
011 
020 
A28 
DO 
Vss 
Vss 
Vss 
OPl 
09 
019 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 
A26 
Vss 
01 
06 
05 
Vcc 
Vcc 
018 
022 
A25 
A29 
Vcc 
Vcc 
03 
08 
013 
021 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
A23 
A17 
OPO 
07 
016 
012 
010 
CLK2 
CLKSEL 
Vcc 
A30 
04 
014 
OP2 
015 
017 
vss 
4 
0 
0 
0 
0 
0 
0 
4 
NC 
A19 
Vcc 
023 
Vss 
vss 


5 
0 
0 
0 
0 
0 
0 
5 


A14 
A21 
vcc 
OP3 
A18 
vss 
6 
0 
0 
0 
0 
0 
0 
6 


vss 
A24 
027 
024 
Vcc 
025 • 


7 
0 
0 
0 
0 
0 
0 
7 
A12 
A22 
026 
Vss 
A15 
vcc 
8 
0 
0 
0 
LOW POWER 
0 
0 
0 
8 


vss 
A20 
028 
029 
Vcc 
031 


9 
0 
0 
0 
16B-PIN 
PGA 
PINOUT 
0 
0 
0 
9 


vss 
A16 
030 
Vss 
Vcc 
Intel4B6TM 
OX CPU 
vcc 
10 
0 
0 
0 
0 
0 
0 
10 


vss 
A13 
TOP 
SIDE 
VIEW 
NC 
NC 
Vcc 
NC 
11 
0 
0 
0 
0 
0 
0 
11 
vss 
A9 
NC 
vss 
vcc 
vcc 
12 
0 
0 
0 
0 
0 
0 
12 
vss 
A5 
NC 
NC 
All 
NC 


13 
0 
0 
0 
0 
0 
0 
13 
Al0 
A7 
NC 
NC 
A8 
NC 
14 
0 
0 
0 
0 
0 
0 
14 


vss 
A2 
FERR# 
NC 
vcc 
NC 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
15 
A6 
BREa 
HLOA 
LOCK# 
o/c# 
PWT 
BE2# 
NC 
HOLD 
FLUSH# 
IGNNE# 
A3 
BEO# 
BROY# 
KEN# 
A20M# 
NMI 


16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 


A4 
PLOCK# 
M/IO# 
vcc 
BE1# 
vcc 
Vcc 
RESET 
INTR 
BLAST# 
Vcc 
Vcc 
Vcc 
Vcc 
ROY# 
BS8# 
NC 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
AOS# 
PCHK# 
W/R# 
vss 
pco 
vss 
vss 
BS16# 
AHOLO 
NC 
vss 
vss 
vss 
vss 
BE3# 
BOFF# 
EAOS# 


S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 
240440-A5 


Figure 15.1. Low Power Intel486™ 
OX CPU Pinout (Top Side View) 


Intel486TM OX MICROPROCESSOR 
inte!® 


A 
B 
C 
0 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


020 
011 
VSS 
VSS 
VCC 
VSS 
02 
A31 
A27 


019 
09 
OPI 
VSS 
VSS 
VSS 
DO 
A28 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 
022 
018 
VCC 
VCC 
05 
06 
01 
VSS 
A26 
021 
013 
08 
03 
VCC 
VCC 
A29 
A25 


3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
CLKSEL 
CLK2 
010 
012 
016 
07 
OPO 
A17 
A23 


VSS 
017 
015 
OP2 
014 
04 
A30 
vcc 
4 
0 
0 
0 
0 
0 
0 
4 
023 
VCC 
A19 
NC 


VSS 
vss 
5 
0 
0 
0 
0 
0 
0 
5 
OP3 
vcc 
A21 
AU 


VSS 
A18 


6 
0 
0 
0 
0 
0 
0 
6 
024 
027 
AH 
VSS 
025 
vcc 
7 
0 
0 
0 
0 
0 
0 
7 
vss 
026 
A22 
Al2 
VCC 
A15 


8 
0 
0 
0 
LOW POWER 


0 
0 
0 
8 
029 
028 
A20 
VSS 
031 
Vcc 
9 
0 
0 
0 
16B-PIN 
PGA PINOUT 
0 
0 
0 
9 
vss 
030 
A 16 
VSS 
VCC 
Intel4B6TM 
DX CPU 
vcc 
10 
0 
0 
0 
0 
0 
0 
10 
NC 
NC 
PIN SIDE VIEW 
A13 
Vss 
NC 
vcc 
11 
0 
0 
0 
0 
0 
0 
11 
vss 
NC 
A9 
vss 
vcc 
vcc 
12 
0 
0 
0 
0 
0 
0 
12 
NC 
NC 
AS 
VSS 
NC 
All 


13 
0 
0 
0 
0 
0 
0 
13 
NC 
NC 
A7 
Al0 
NC 
A8 
14 
0 
0 
0 
0 
0 
0 
14 
NC 
FERR# 
A2 
vss 
NC 
vcc 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
15 
IGNNE# 
FLUSH# 
HOLD 
NC 
BE2# 
PWT 
O/C# 
LOCK# 
BREQ 
A6 
NfoIl 
A20fol# 
KEN# 
BROY# 
BEO# 
HLOA 
A3 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
INTR 
RESET 
VCC 
VCC 
BE1# 
VCC 
foI/IO# 
PLOCK# 
M 


NC 
BS8# 
ROY# 
VCC 
VCC 
VCC 
VCC 
BLAST# 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


W?R# 
0 
0 
0 
0 
17 
AHOLO 
BS16# 
vss 
vss 
pco 
vss 
PCHK# 
AOS# 


EAOS# 
BOFF# 
BE3# 
vss 
vss 
vss 
vss 
NC 


A 
B 
C 
0 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


240440-A6 


Figure 15.2. Low Power Intel486TM OX CPU Pinout (Pin Side View) 


Address 
Data 
Control 
NIC 
Vec 
Vss 


A2 
Q14 
Do 
P1 
A20M# 
015 
A10 
87 
A7 


A3 
R15 
01 
N2 
A08# 
817 
A12 
89 
A9 


A4 
816 
02 
N1 
AHOLO 
A17 
A13 
811 
A11 


A5 
Q12 
03 
H2 
8EO# 
K15 
A14 
C4 
83 


A6 
815 
04 
M3 
8E1# 
J16 
810 
C5 
84 


A7 
Q13 
05 
J2 
8E2# 
J15 
812 
E2 
85 
A8 
R13 
06 
L2 
8E3# 
F17 
813 
E16 
E1 


A9 
Q11 
07 
L3 
8LA8T# 
R16 
814 
G2 
E17 


AlO 
813 
08 
F2 
80FF# 
017 
816 
G16 
G1 


An 
R12 
09 
01 
8ROY# 
H15 
C10 
H16 
G17 


A12 
87 
010 
E3 
8REQ 
Q15 
C11 
J1 
H1 


A13 
Q10 
011 
C1 
888# 
016 
C12 
K2 
H17 


A14 
85 
012 
G3 
8816# 
C17 
C13 
K16 
K1 


A15 
R7 
013 
02 
CLK2 
C3 
G15 
L16 
K17 


A16 
Q9 
014 
K3 
CLK8EL 
A3 
R17 
M2 
L1 


A17 
Q3 
015 
F3 
O/C# 
M15 
84 
M16 
L17 


A18 
R5 
016 
J3 
OPO 
N3 
P16 
M1 


A19 
Q4 
017 
03 
OP1 
F1 
R3 
M17 


A20 
Q8 
018 
C2 
OP2 
H3 
R6 
P17 


A21 
Q5 
019 
81 
OP3 
A5 
R8 
Q2 


A22 
Q7 
020 
A1 
EA08# 
817 
R9 
R4 


A23 
83 
021 
82 
FERR# 
C14 
R10 
86 


A24 
Q6 
022 
A2 
FLU8H# 
C15 
R11 
88 


A25 
R2 
023 
A4 
HLOA 
P15 
R14 
89 


A26 
82 
024 
A6 
HOLD 
E15 
810 


A27 
81 
025 
86 
IGNNE# 
A15 
811 


A28 
R1 
026 
C7 
INTR 
A16 
812 


A29 
P2 
027 
C6 
KEN# 
F15 
814 


A30 
P3 
028 
C8 
LOCK # 
N15 


A31 
Q1 
029 
A8 
M/IO# 
N16 
030 
C9 
NMI 
815 
031 
88 
PCO 
J17 
PCHK# 
Q17 
PWT 
L15 
PLOCK# 
Q16 
ROY# 
F16 
RE8ET 
C16 
W/R# 
N17 


All pin descriptions 
for the Low Power Intel486 
OX Microprocessor 
follow the same definition 
as the Intel486 
OX Microprocessor 
with the exception 
of those 
listed in Table 
15.2. 


Symbol 
Type 
Name and Function 


CLK2 
I 
CLK2 provides the fundamental 
timing for the Low Power Intel486 
OX Microprocessor. 
This is twice the internal frequency 
of the CPU. 


CLK8EL 
I 
Clock Select pin selects the 2X mode required for the Low Power Intel486 
CPU. A well 
defined 
pulse on this pin establishes 
the phase relationship 
of the 2X clock. With the 
exception 
of a pulse during cold reset, this pin should be driven low at all times and must 
be free of spikes or glitches. 


• 


Table 
15.3. lists all the output 
pins, indicating 
their 


active 
level, and when they are floated. 


Name 
Active 
When Floated 
Level 


BREQ 
HIGH 


HLDA 
HIGH 
BEO#-BE3# 
LOW 
Bus Hold 


PWT, PCD 
HIGH 
Bus Hold 
W/R#, 
D/C#, 
HIGH/LOW 
Bus Hold 
M/IO# 
LOCK# 
LOW 
Bus Hold 
PLOCK # 
LOW 
Bus Hold 
ADS# 
LOW 
Bus Hold 
BLAST # 
LOW 
Bus Hold 
PCHK# 
LOW 
FERR# 
LOW 
A2-A3 
HIGH 
Bus, 


Address 
Hold 


Table 
15.4 lists all input pins, indicating 
their active 


level, 
and whether 
they 
are synchronous 
or asyn- 


chronous 
inputs. 


Name 
Active 
Synchronousl 
Level 
Asynchronous 


CLK2 
CLKSEL 
RESET 
HIGH 
Asynchronous 


HOLD 
HIGH 
Synchronous 


AHOLD 
HIGH 
Synchronous 


EADS# 
LOW 
Synchronous 


BOFF# 
LOW 
Synchronous 


FLUSH # 
LOW 
Asynchronous 


A20M# 
LOW 
Asynchronous 


BS16#, 
BS8# 
LOW 
Synchronous 


KEN# 
LOW 
Synchronous 


RDY# 
LOW 
Synchronous 


BRDY# 
LOW 
Synchronous 


INTR 
HIGH 
Asynchronous 


NMI 
HIGH 
Asynchronous 


IGNNE# 
LOW 
Asynchronous 


Table 
15.5 lists all the input/output 
pins, indicating 


their active 
level and when they are floated. 


intel~ 


Name 
Active 
When Floated 
Level 


DO-D31 
HIGH 
Bus Hold 
DPO-DP3 
HIGH 
Bus Hold 
A4-A31 
HIGH 
Bus, Address 
Hold 


Name 
Input or 
Sampledl 
Output 
OrlvenOn 


TCK 
Input 
N/A 


TOI 
Input 
Rising Edge of TCK 


TDO 
Output 
Falling Edge of TCK 


TMS 
Input 
Rising Edge of TCK 


1486 SX Microprocessor 
Component 
Revision 
Low Power 
10 
10 
Stepping 
Name 


DO 
04 
04 


NOTE: 
Table 15.7 shows the Component 10 number and Revision 
10 number for the 0-0 stepping of the Intel486 OX Micro- 
processor. When an Intel OverDrive Processor is installed 
in the system, the Component 10 and Revision 10 is provid- 
ed by the OverDrive Processor and not the Intel486 OX Mi- 
croprocessor. The Component 10 and Revision 10 read by 
the BIOS/software may change when a Performance Up- 
grade Component, such as the Intel OverDrive Processor, 
is installed in an Intel486 OX Microprocessor based sys- 
tem. 


15.5 Signal Description 


With the exception 
of CLK2 and CLKSEL, 
all signals 


follow the same definition 
as the Intel486 
Microproc- 


essor. 
The A.C. timing 
parameters 
for all of these 


signals are given in Table 
15.11. 


CLK2 
provides 
the fundamental 
timing 
for the Low 


Power 
Intel486 
Microprocessor. 
It is divided 
by two 


internally 
to generate 
the 
internal 
processor 
clock 


used for instruction 
execution. 
The internal 
clock 
is 


comprised 
of two phases, 
"phase 
one" 
and "phase 


two". 
Each CLK2 
period 
is a phase 
of the internal 


clock. 
Figure 
15.3 illustrates 
the relationship. 
If de- 


sired, the phase of the internal 
processor 
clock 
can 


be synchronized 
to a known 
phase 
by ensuring 
the 


pulse on the CLKSEL 
pin meets the applicable 
tim- 


ings during cold boot (power-up 
reset). 


INTERNAL 
Intel486TWCPU 
ClK 
(holf 
the 
freq. 


of ClK2) 


All set-up, hold, float-delay and valid delay timings 
are referenced to the phase one of the clock. 


The internal processor clock (CLK) is similar to the 
clock signal of the standard Intel486 Microproces- 
sor. All I/O signals get sampled on the rising edge of 
this signal, i.e. the rising edge of phase one. Thus it 
is important to synchronize the external circuitry with 
the phase one of CLK2. 


Clock Select pin selects the 2X mode required for 
the Low Power Intel486 OXCPU. This pin should be 
driven low after power-up and during the entire oper- 
ation of the CPU. However, a well defined pulse is 
required on CLKSEL pin during cold boot (power-up 


reset) to establish the phase relationship of the 2X 
clock. The reset pulse width during cold reset should 
be at least 1 ms. As shown in Figure 15.4, the pulse 
on CLKSEL should be asserted by the end of reset 
(approximately 0.9 ms after driving reset active) and 
at least 30 CLK2 periods before the falling edge of 
reset. 


Figure 15.5 shows the detailed timing definition of 
this pulse. The pulse on CLKSEL pin is only required 
during power-up reset. During all other times includ- 
ing warm resets the CLKSEL pin should be driven 
low and must be free of spikes or glitches. After the 
power-up reset, the system must track the phase of 
CLK2 at all times including during warm resets so 
that the input! output signals can be sampled at the 
appropriate clock edge. The phase relationship is 
described in the next section. 
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The Low Power Intel486 OX Microprocessor is archi- 
tecturally similar to the Intel486 CPU. Thus all bus 
cycles follow the same definition The difference lies 
in the fact that the Low Power Intel486 CPU works 
with an external 2X clock input (CLK2). As shown in 
Figure 16-3, each of the internal processor clock 
(CLK) cycle is two CLK2 cycles wide. Thus a 25 MHz 
Low Power Intel486 OX Microprocessor needs a 
50 MHz clock input. 


CLK2 provides the fundamental timing for the Low 
Power Intel486 CPU. It is divided by two internally to 
generate the internal processor clock (CLK) used for 
instruction execution. The internal clock is com- 
prised of two phases, "phase one" and "phase 
two". Each CLK2 period is a phase of the internal 
clock. All Low Power Intel486 Microprocessor inputs 
are sampled at the rising edge of phase 1. Each bus 
cycle is comprised of at least two bus states, T1 and 
T2. Each bus state in turn consists of two CLK2 cy- 
cles phase 1 and phase 2 of the bus state. The bus 
state diagram in Section 7.2.13 is valid for the Low 
Power Intel486 Microprocessor. 


NOTE: 
The timing diagrams given in the Intel486 data 
sheet can be used for the Low Power Intel486 Mi- 
croprocessor. Read "CLK" 
signal as the internal 
clock of the CPU, with "CLK2" (the input clock of 
the Low Power Intel486 CPU) being twice the fre- 
quency of the internal processor clock as shown in 
Figure 15.3. 


The following describes how the input signals are 
sampled and output signals are referenced with re- 
spect to the input clock (CLK2): 


INPUT SIGNALS: 


The Low Power Intel486 CPU samples all its syn- 
chronous 
input 
signals 
(Le. 
ROY#, 
BRDY#, 


BS8#, 
BS16#, 
KEN#, 
EADS#, 
BOFF#, 
HOLD 
and AHOLD) at the rising edge of phase 1, as long 
as proper setup and hold times relative to that clock 
edge are met. 


The Low Power Intel486 CPU samples all its asyn- 
chronous 
input signals (Le. RESET, INTR, NMI, 


A20M# 
FLUSH#, 
IGNNE#) at every other rising 
edge of the system clock (Phase 1), as long as prop- 
er setup and hold times relative to that clock edge 
are met. 


The A.C. timing specifications for output signals (Le. 
valid and float delay timings) are specified with re- 
spect to the rising edge of the Phase 1 of the system 2 
clock. This holds true for all output signals including 
ADS# and PCHK#. 


15.7 
Variable CPU Frequency 


The Low Power Intel486 Microprocessor allows the 
CPU frequency to change dynamically. As shown in 
Figures 15.6 and 15.7, the relationship between fre- 
quency and power consumption is approximately lin- 
ear. Thus lowering the CPU frequency, reduces the 
power supply current (Icel consumed by the CPU. 


The following must be satisified to change the CPU 
frequency: 
1. Frequency can be changed at least 8 clocks after 
satisfying t4 (see Figure 15.5). The system can be 
started at a lower frequency and after satisfying 
the CLKSEL pulse specifications, it can be oper- 
ated at the required speed. 
2. The change in frequency should satisfy the mini- 
mum specification of 
"CLK2 
high time" 
and 
"CLK2 low time". That is, at no time should the 
clock period go below the specified clock high 
and clock low times (see A.C. specifications for 
exact values). 
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600 


500 
~ 
400 
.5 
~ 
300 


0 
200 
_0 


100 


0 
0 


700 


600 


~ 
500 


-5. 
400 
Q.~ 
'"' 
300 
-'"' 
200 


100 


0 
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Table 
15.8 provides 
the absolute 
maximum 
ratings. 
It is a stress 
rating only and functional 
operation 
at the 
maximums 
is not guaranteed. 
Functional 
operating 
conditions 
are given in Section 
15.8.1 D.C. Specifications 
and Section 
15.8.3 A.C. Specifications. 


Case Temperature 
under Bias 
- 65°C to + 110°C 


Storage Temperature 
-65°C 
to + 150°C 


Voltage on Any Pin with Respect 
to Ground 
-0.5V 
to (Vcc 
+ 0.5V) 


Supply Voltage with Respect to Vss 
-0.5V 
to +6.5V 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input Low Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
Vcc 
+ 0.3 
V 


VOL 
Output Low Voltage 
0.45 
V 
(Note 1) 


VOH 
Output High Voltage 
2.4 
V 
(Note 2) 


Icc 
Power Supply Current 
700 
mA 
(Note 3) 
CLK2 = 50 MHz 


III 
Input Leakage Current 
±15 
/LA 
(Note 4) 


IIH 
Input Leakage Current 
200 
/LA 
(Note 5) 


IlL 
Input Leakage Current 
-400 
/LA 
(Note 6) 


ILO 
Output 
Leakage Current 
±15 
/LA 


CIN 
Input Capacitance 
20 
pF 
Fe = 1 MHz(7) 


Co 
1/0 or Output Capacitance 
20 
pF 
Fe = 1 MHz(7) 


CCLK 
CLK Capacitance 
20 
pF 
Fe = 1 MHz(7) 


NOTES: 
1. This parameter is measured at: 
Address, Data BEn 4.0 mA 
Definition, Control 5.0 mA 
2. This parameter is measured at: 
Address, Data BEn -1.0 
mA 
Definition, Control - 0.9 mA 
3. Typical supply current 
Ice = 550 mA @CLK2 = 50 MHz 
4. This parameter is for inputs without pullups or pulldowns and 0 ,;; VIN ,;; Vcc. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and VIL = 0.45V. 
7. Not 100% tested. 
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Following 
is the power consumption 
of the Low Power Intel486 
Microprocessor 
installed 
in a low power system 
for different 
frequencies. 


CLK2 
Frequency 
Operating 
Frequency 
ICC(max) 
ICC(typ) 
(MHz) 
(MHz) 
(mA) 
(mA) 


4 
2 
150 
100 


16 
8 
325 
235 


32 
16 
525 
400 


40 
20 
600 
475 


50 
25 
700 
550 


The following 
table provides 
the A.C. specifications 
for the Low Power Intel486 
OX Microprocessor. 
It consists 
of output 
delays, 
input setup requirements 
and input hold requirements. 
All A.C. specifications 
are relative 
to 
the rising edge of the phase 
1 of the input system 
clock 
(CLK2), unless otherwise 
specified. 


Table 
15.11. Low 
Power 
Intel486 
DX-25 
MHz Microprocessor 
A.C. Characteristics 


VCC = 5V ± 10 %; TCASE = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
25 
MHz 
Half of CLK2 Frequency 


t1 
CLK2 Period 
20 
250 
ns 
15.8 


t2 
CLK2 High Time 
7 
ns 
15.8 
At2V 


t3 
CLK2 Low Time 
7 
ns 
15.8 
AtO.8V 
~ 
CLK2 Fall Time 
2 
ns 
15.8 
2V to 0.8V 


t5 
CLK2 Rise Time 
2 
ns 
15.8 
0.8Vt02V 


Is 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
22 
ns 
15.9 
M/lO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
FERR #, BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
30 
ns 
15.9 
After Clock Edge(1) 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK # Float Delay 


te 
PCHK# 
Valid Delay 
3 
27 
ns 
15.9 


tea 
BLAST #, PLOCK # Valid Delay 
3 
27 
ns 
15.10 


Table 
15.11. Low Power 
Intel486 
DX-25 
MHz Microprocessor 
A.C. Characteristics 
(Continued) 
VCC = 5V ± 10%; TCASE = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t9 
BLAST #, PLOCK # 
30 
ns 
15.9 
After Clock Edge(l) 


Float Delay 


t10 
00-031, 
DPO-3 Write Data 
3 
22 
15.9 


o. 


ns 
Valid Delay 


tl1 
00-031, 
DPO-3 
Write Data 
30 
ns 
15.9 
After Clock Edge(l) 


Float Delay 


t12 
EADS# 
Setup Time 
9 
ns 
15.10 


t13 
EADS# 
Hold Time 
4 
ns 
15.10 


t14 
KEN#, 
BS16#, 
BS8# 
9 
ns 
15.10 
Setup Time 


t15 
KEN#, 
BS16#, 
BS8# 
4 
ns 
15.10 
Hold Time 


t16 
ROY #, BRDY # Setup Time 
9 
ns 
15.10 


t17 
ROY #, BRDY # Hold Time 
4 
ns 
15.10 


t18 
HOLD, AHOLD, 
BOFF # 
11 
ns 
15.10 
Setup Time 


t19 
HOLD, AHOLD, 
BOFF # 
4 
ns 
15.10 
Hold Time 


t20 
RESET, FLUSH #, A20M #, NMI, 
11 
ns 
15.10 
INTR, IGNNE# 
Setup Time 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, 
4 
ns 
15.10 
INTR, IGNNE# 
Hold Time 


t22 
DO-D31,DPO-3,A4-A31 
6 
ns 
15.10 
Read Setup Time 


t23 
DO-D31, 
DPO-3, 
A4-A31 
4 
ns 
15.10 
Read Hold Time 


CLKSEL 
See Figures 15.4 and 15.5 for details on this signal. 
Figure 15.5 shows minimum timings required for the 
proper operation 
of the CPU. The pulse on CLKSEL can 
be of any length as long as the minimums 
are satisfied 
and the transitions 
from low to high occurs at the clock 


, 
edge shown. 
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NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. 
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16.0 
SUGGESTED 
SOURCES 
FOR 
Intel486™ 
ACCESSORIES 


Following are some suggested sources of accesso- 
ries for the Inte1486.They are not an endorsement 
of any kind, nor a warranty of the performance of 
any of the listed products and/or companies. 


Sockets 


1. McKenzie Technology 
44370 Old Palmspring Blvd. 
Fremont, CA 
94538 
Tel: (415) 651-2700 
2. E-CAM Technology, Inc. 
14455 North Hayden Rd. 
Suite 208 
Scottsdale, AZ 
85260 
Tel: (602) 443·1949 
3. Augat Inc. (for sockets with decaps) 
Interconnection Products Group 
33 Perry Ave. 
P.O. Box 779 
Attleboro, MA 
02703 
Tel: (508) 222·2202 


Heat Sinks/Fins 


1. Thermalloy Inc. 
2021 West Valley View Lane 
Dallas, TX 
75381-0839 
Tel: (214) 243-4321 
2. E G & G Division 
60 Audubon Road 
Wakefield, MA 
01880 
Tel: (617) 245·5900 


TTL Crystals/Oscillators 


1. NFL Frequency Controls, Inc. 
357 Beloit Street 
Burlington, WI 
53105 
Tel: (414) 763-3591 
2. M-Tron 
P.O. Box 630 
Yankton, SD 
57078 
Tel: (605) 665-9321 


Debugging 
Tower 


1. Emulation Technology 
2344 Walsh Ave., Building F 
Santa Clara, CA 
95051 
Tel: (408) 982-0664 


Revision -006 of the Intel486 DX Microprocessor 
Data Book contains many updates and improve- 
ments to the original version. A revision summary of 
major changes is listed below: 


The sections significantly revised since version -001 
are: 
Section 2.1.2 
The polarity and names of the two 
cache control bits in Control Regis- 
ter 0 (CRO) have been modified. 
The Cache Enable (CE) and Writes 
Transparent (WR) have been re- 
named Cache Disable (CD) and Not 
Write Through (NW). The value of 
CRO 
after 
RESET 
has 
been 
changed 
to 
reflect 
the 
polarity 
change. 
Section 6.2.15 The discussion of A20M# has been 
clarifed. During the falling edge of 
RESET, A20M # should be high, for 
proper operation of the CPU. 
section 6.5 
The value of CROafter RESET has 
been modified. 


section 6.5.1 
Figure 6.3, "Pin State during RE- 
SET" is added. This Figure is a gen- 
eral reference for Reset issues. Pre- 
vious Figures 8.1, 8.2, and 8.8 have 
been deleted, since Figure 6.3 now 
contains Reset information. 
section 7.2.10 A discussion of addresses and byte 
enables driven during INTA cycles 
has been added. 
section 10.1 
Clock counts and opcodes 
have 
been clarifiec,land corrected. 


Section 10.1 
The opcode slot for CMPXCHG in- 
struction 
has 
been 
moved 
from 
OFA6/A7 to OFBO/B1. 
section 12.2 
Table 12.1 has been enhanced. The 
"Case 
Temperature 
under 
Bias" 
spec was improved. The "Supply 
Voltage with Respect to Vss" spec 
was added. 
section 12.3 
Maximum Icc values have been im- 
proved to 700 mA at 25 MHz and 
900 mA at 33 MHz. 
Section 12.3 
Typical Ice values have been modi- 
fied to 550 mA at 25 MHz and 
700 mA at 33 MHz. 
Section 12.3 
CIN, Co, and CCLK values have 
been changed to 20 pF. Testing pa- 
rameters and Note 7 were added. 


section 12.5 
Section 13.1 


The A.C. Specifications have been 
improved. Float delays were im- 
proved at both 25 MHz and 33 
MHz. Note 1 was added to the float 
delays. Maximum valid delays were 
reduced at 33 MHz. 
The ICD section was enhanced. 
Thermal resistance 
(JCA values of 
the 168-pin ceramic package have 
been corrected. 
Maximum 
ambient 
temperatures 
have been corrected to use the 
max Icc values. 


Section 2.1.2.1 
Table 2.16 


The sections significantly revised since version -002 
are: 


Section 6.3.1 
section 6.3.2 


Spec change for PC and PWT bits. 
Value of intel Reserved Interrupt 
Vector 
assignment 
corrected 
to 
'18-31'. 
Added CMPCHG, XADD instruc- 
tions in the table. 
Added explanation about NMI not 
able to bring out the processor from 
shutdown under certain conditions. 
Value of task switching time cor- 
rected to 10 ms. 
Specification change for PCD and 
PWT bits. 
Specification change for PCD and 
PWT bits. 
Cache 
flushing 
procedure 
ex- 
plained, when 
FLUSH# 
applied 
synchronously or asynchronously. 
Specification 
change for 
PLOCK 
cycle. 


Added explanation for warm boot- 
up. 


Specification change for PCD and 
PWT bits. 
Explanation added for FERR# be- 
havior. 
Explanation added of IGNNE# be- 
havior. 
Explanation added for A20M # be- 
havior in protected mode and dur- 
ing RESET. 


Simplified example for read reor- 
dering in write buffers. 


Corrected REP OUTS instruction. 
Added explanation about cache up- 
date on read-modify-write cycle. 
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Added RESET pulse length require- 
ment with or without BIST 
Added table for Intel486 revision 
10. 
Corrected CROvalue after Reset. 


Corrected pin state diagram during 
RESET. 
RESET 
pulse 
length 
changed to 15 CLKs. 
Section 7.2.2.3 Added 
explanation 
to 
terminate 
burst cycle. 
Section 7.2.3.4 Clarified tex1 on changing KEN# 
during cache line fill. 


Corrected timing diagram to show 
A4-A31 , M/IO#, 
D/C#, W/R# 
do 
not change during burst. 


Corrected timing diagram to show 
A4-A31 , M/IO#, 
D/C#, W/R# 
do 
not change during burst. 


Corrected timing diagram to show 
A4-A31, M/IO#, 
D/C#, W/R# 
do 
not change during burst. 


Section 7.2.4.2 Added cases that follow burst or· 
der. 
Section 7.2.6 
Added explanation for read·modify· 
write for un·aligned transfers. 
Section 7.2.7 
HOLD latency decreased by provid- 
ing window in PLOCK cycle (speci- 
fication change). 


Section 7.2.8 
Added explanation about EADS# 
timing. 
Section 7.2.8 
Added the case of invalidation with 
BOFF or HOLD. 
Change 
in 
Timing 
Diagram 
for 
BREQ. 
Change 
in 
Timing 
Diagram 
for 
BREQ. 
Change 
in 
Timing 
Diagram 
for 
RDY#lBRDY#. 
Section 7.2.9 
Added 
explanation 
about 
HOLD 
getting recognized during unaligned 
writes. 
Section 7.2.11 Added status of address and data 
busses during special bus cycles. 
Section 7.2.11 
Added sections on Halt and Shut- 
down cycles: 
Corrected state diagram by ANDing 
BRDY# and BLAST# for the last 
transfer of the burst cycle. 


Section 7.2.14 
Difference 
in 
FERR# 
and 
ER- 
ROR# explained. 
Changed Reset width to 15 CLKs. 


Table 6.2 
Figure 6.3 


Table 10.1 
Section 11.0 


Section 12.3 
Section 12.3 


Figure 12.2 & 
Figure 12.3 


Added 
explanation 
on 
tri-state 
status. 
Corrected value in format. 


Added 
Note 6 on 
FERR# 
and 
ERROR# difference. 
Added TLB replacement algorithm 
for 386 OX. 
Corrected values in Note 2. 
Added "internal" 
for 
pullup and 
pulldown resistors 
Waveforms for input and output sig- 
nals have been re-drawn to show 
details about set-up, hold and float 
times. 
Added details about TA calculation 
from 9JCand 9JA· 
Added new section on suggested 
sources 
of 
Intel486 
accessories 
like sockets, debugging tower, heat 
sinks, etc. 


The sections significantly revised since revision -003 
are: 
Cover Page 


Added Test Access Port pin de- 
scriptions. 
Added Test Access Port pin sam- 
ple/driven data. 
Added DO,cAx, and cBx revision 10 
information. 
Section 6.2.9 
Added description of HOLD recog- 
nition during BOFF# . 
Section 6.2.12 Added PCD and PWT description 
when paging disabled. 
Section 6.2.16 
Added signal description for Test 
Access Port signals. 
Added DO,cAx, and cBx revision 10 
information. 
Added additional details on signal 
samping during RESET. 
Added HOLD to state transition be- 
tween Tb and T1b. 


Section 8.0, 8.5 Added Boundary Scan to test fea- 
ture description. 
Added 50 MHz D.C. specifications. 
Added 50 MHz A.C. specifications. 


Figure 1.3 
and Figure 1.4 
Pin Cross 
Reference 
Table 
Quick Pin 
Reference 
Table 1.4 


Table 12.2 
Table 12.3 


Add 50 MHz information to tex1and 
block diagram. 
Added 50 MHz pinout diagrams. 


Added column for Test Access Port 
pins. 
• 


Figure 12.7 
Added Test signal timing reference 
diagram. 
Figure 12.4.2 
Added 50 MHz capacitive load de- 
rating curves. 
Table 13.2 
Added 50 MHz thermal resistance 
values. 
Table 13.3 
Added 50 MHz ambient tempera- 
ture data. 


The sections significantly revised since revision -004 
are: 


Intel OverDrive Processor information/specifications 
have been added throughout the document. Section 
13.0 contains OverDrive Processor specific informa- 
tion. 


Low Power Intel486 OX CPU information/specifica- 
tions have been added. Section 15.0 contains Low 
Power specific information. 
Quick Pin Reference 
Clarified the description for 
KEN#. 
Table 1·5 
Updated component and re- 
vision 1.0. information. 


More clearly defined A20M # 
bit by defining functionality 
during I/O writes, prefetch- 
ing, etc. 
Designated 
that 
FLUSH# 
must be inactive during SIST. 


Added 
section 
for 
floating 
point 
error 
handling 
in 
AT 
compatible systems. 
Clarified 
A20M# , 
FLUSH# 
and AHOLD functionality dur- 
ing SIST. 
SSDL is now available through 
Intel. 


Clarified CSW and CWO. Cor- 
rected 
REP 
LODS, 
REP 
MOVS and REP STOS. 
Corrected REP INS and REP 
OUTS. 
Corrected 
FSTP, 
FUCOMP, 
FSUSR, FDIV, FDIVR. 


Added appendix for CPU Iden- 
tification Code. 


Figure 1.5 
PQFP pin tables 


Added PQFP package pinout. 
Added pin cross reference by 
signal type and complete pin 
reference tables. 
Added clarification for the built 
in self test (SIST) during reset. 
Added explanation of bus hold 
and hold acknowledge proto- 
col. 


Added 
figure 
to 
illustrate 
HOLD request acknowledge 
during SOFF# . 


APPENDIX 
A 
INTEL RECOMMENDED 
CPU IDENTIFICATION 
CODE 


The CPU identification 
assembly 
code will determine 
for the user which Intel microprocessor 
and if a Intel Math 


CoProcessor 
is installed 
in the system. 
If a 486 microprocessor 
has been installed, 
the program 
will determine 
if the CPU is with/without 
a floating 
point unit. This code should be executed 
so the system 
can be configured 


for a particular 
application, 
which 
may depend 
on the microprocessor 
and Math CoProcessor 
installed 
in the 


system. 


TITLE 
CPUID 
DOSSEG 
.model 
small 


•data 
fp_status 
dw 


id_mess 
db 


fp_8087 
db 


fp_80287 
db 
fp_80387 
db 


c8086 
db 
c286 
db 
c386 
db 


c486 
db 


c486nfp 
db 
period 
db 
present_86 
dw 
present_286 
dw 
present_386 
dw 


present_486 
dw 


? 
"This 
system 
has as" 
"and an 8087 
Math 
CoProcessorS" 
"and an 287 Math 
CoProcessorS" 
"and an 387 Math 
CoProcessorS" 
"n8086/8088 
microprocessorS" 
"n80286 
microprocessorS" 
"386 microprocessorS" 
"486 DX microprocesser/487 
SX Math 
"CoProcessorS" 
"486 
SX MicroprocesserS" 
".S".13,lO 
o 
o 
o 
o 


The purpose 
of this 
code 
is to allow 
the user 
the ability 
to 
identify 


the processor 
and 
coprocesor 
that 
is currently 
in the 
system. 
The 
algorithm 
of the program 
is to 
first 
determine 
the processor 
id. 
When 
that 
is accomplished, 
the program 
continues 
to then 
identify 
whether 
a coprocessor 
exists 
in the 
system. 
If a coprocessor 
or 


integrated 
coprocessor 
exists, 
the program 
will 
identify 
the 


coprocessor 
id. If one does 
not 
exist, 
the program 
then 
terminates. 


mov 
mov 
int 


ax,@data 
dS,ax 


dX,offset 
id_mess 
ah,9h 
21h 


• 


8086 
check 
Bits 
12-15 
are 
always 
set 
on the 
8086 
processor. 


pushf 
pop 
mov 
and 
push 
popf 
pushf 
pop 
and 
cmp 
mov 
mov 
je 


bx 
ax,Offfh 
ax,bx 
ax 


ax 
ax,OfOOOh 
ax,OfOOOh 
dx,offset 
c8086 
present_86,l 
check_fpu 


save 
EFLAGS 
store 
EFLAGS 
in BX 
clear 
bits 
12-15 
in EFLAGS 
store 
new 
EFLAGS 
value 
on 
stack 
replace 
current 
EFLAGS 
value 
set 
new 
EFLAGS 
store 
new 
EFLAGS 
in AX 
if bits 
12-15 
are 
set, 
then 
CPU 
is an 8086/8088 
store 
8086/8088 
message 
turn 
on 8086/8088 
flag 
if CPU 
is 8086/8088, 
check 
for 
8087 


80286 
CPU 
Check 
Bits 
12-15 
are 
always 
clear 
on the 
80286 
processor. 


or 
push 
popf 
pushf 
pop 
and 
mov 
mov 
mov 
jz 


bx,OfOOOh 
bx 


ax 
ax,OfOOOh 
dx,offset 
c286 
present_86,O 
present_286,l 
check_fpu 


386 
CPU 
check 
The 
AC 
bit, 
bit 
#18, 
is a new 
on the 
486 
DX 
CPU 
to generate 
on the 
486 
DX 
CPU, 
but 
not 
on 


if bits 
12-15 
are 
cleared, 
then 
CPU 
is an 
80286 
turn 
off 
8086/8088 
flag 
turn 
on 80286 
flag 
if CPU 
is 80286, 
check 
for 
80287 


bit 
introduced 
in the 
EFLAGS 
register 
alignment 
faults. 
This 
bit 
can 
be 
set 
the 
386 
CPU. 


mov 
bx,sp 
save 
current 
stack 
pointer 
to 
align 
it 
and 
sp,not 
3 
align 
stack 
to avoid 
AC 
fault 
db 
66h 
pushf 
push 
original 
EFLAGS 
db 
66h 
pop 
ax 
get 
original 
EFLAGS 
db 
66h 
mov 
cx,ax 
save 
original 
EFLAGS 
db 
66h 
xor 
EAX,40000h 
xor 
ax,O 
flip 
AC 
bit 
in EFLAGS 
dw 
4 
upper 
16-bits 
of xor 
constant 
db 
66h 
push 
ax 
save 
for 
EFLAGS 
db 
66h 
popf 
copy 
to EFLAGS 


db 
66h 
pushf 
push 
EFLAGS 
db 
66h 
pop 
ax 
get 
new 
EFLAGS 
value 
db 
66h 
xor 
ax,cx 
if AC 
bit 
cannot 
be 
changed, 
CPU 
is 
mov 
dx, offset 
c386 
store 
386 message 
mov 
present_286,O 
turn 
off 
80286 
flag 
mov 
present_386,l 
turn 
on 386 
flag 
je 
check_fpu 
if CPU 
is 386, 
now 
check 
for 
80287/80387 


mov 
mov 
mov 


dx,offset 
c486nfp 
present_386,O 
present_486,l 


store 
486NFP 
message 
turn 
off 
386 
flag 
turn 
on 486 
flag 


Co-processor 
checking 
begins 
here 
for 
the 
8086/80286/386 
CPUs, 
The 
algorithm 
is to determine 
whether 
or not 
the 
floating-point 
status 
and 
control 
words 
can be written 
to, 
the 
correct 
coprocessor 
is then 
determined 
depending 
on the 
processer 
id. 
Coprocessor 
checks 
are 
first 
performed 
for 
an 8086, 
80286 
and 
a 486 
DX 
CPU. 
If the 
coprocessor 
id 
is still 
undetermined, 
the 
system 
must 
contain 
a 386 
CPU. 
The 
386 
CPU may 
work 
with 
either 
an 80287 
or an 80387. 
The 
infinity 
of the 
coprocessor 
must 
be 
checked 
to 
determine 
the 
correct 
coprocessor 
id. 


check 
for 8087/80287/80387 
reset 
FP 
status 
word 
initialize 
temp 
word 
to non-zero 
value 
save 
FP 
status 
word 
check 
FP 
status 
word 
see 
if correct 
status 
with 
written 
jump 
if not 
Valid, 
no NPX 
installed 


fninit 
mov 
fp_status,5a5ah 


fnstsw 
fp_status 
mov 
ax,fp_status 
cmp 
al,O 


fnstcw 
mov 
and 
cmp 


save 
FP 
control 
word 
check 
FP 
control 
word 
see 
if selected 
parts 
looks 
OK 
check 
that 
ones 
and 
zeroes 
correctly 
read 
jump 
if not 
Valid, 
no NPX 
installed 


fp_status 
ax,fp_status 
ax,l03fh 
ax,3fh 


cmp 
je 
jmp 


present_486,l 
is_486 
not_486 


check 
if 486 
flag 
is on 
if so, 
jump 
to print 
486 
message 
else 
continue 
with 
386 
checking 


mov 
jmp 
dx,offset 
c486 
print_one 


• 


cmp 
jne 
presenL386,l 
print_87_287 


ah,9h 
2lh 


fldz 
fdiv 
fld 
st 
fchs 
fcompp 


fstsw 
fp_status 
mov 
aX,fp_status 
mov 
dX,offset 
fp_80287 
sahf 
jz 
restore_EFLAGS 
mov 
dX,offset 
fp_80387 


restore_EFLAGS: 
finit 
mov 
ah,9h 
int 
2lh 
db 
66h 
push 
cx 
db 
66h 
popf 
mov 
sp,bx 
jmp 
exit 


print_one: 
mov 
ah,9h 
int 
2lh 
jmp 
exit 


prinL87_287: 
mov 
ah,9h 
int 
2lh 
cmp 
present_86,l 
mov 
dX,offset 
fp_8087 
je 
print_fpu 
mov 
dX,offset 
fp_80287 


print_fpu: 
mov 
ah,9h 
int 
2lh 
jmp 
exit 


exit: 
mov 
dX,offset 
period 
mov 
ah,9h 
int 
2lh 


mov 
aX,4cOOh 
int 
2lh 


end 
start 


2-444 


check 
if 386 
flag 
is on 
if 386 
flag 
not 
on, 
check 
NPX 
for 
8086/8088/80286 
print 
out 
386 
CPU 
ID first 


must 
use 
default 
control 
from 
FNINIT 
form 
infinity 
8087/80287 
says 
+inf = inf 
form 
negative 
infinity 
80387 
says + inf < > -inf 
see 
if they 
are 
the 
same 
and 
remove 
them 
look 
at status 
from 
FCOMPP 


store 
80287 
message 
see 
if infinities 
matched 
jump 
if 8087/80287 
is present 
store 
80387 
message 


clear 
any pending 
fp exception 
print 
NPX 
message 


restore 
original 
EFLAGS 
register 
restore 
original 
stack 
pointer 


if 8086/8088 
flag 
is on 
store 
8087 
message 


else 
CPU = 80286, 
store 
80287 
message 
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This data sheet describes the Intel486 SX microprocessor, the Intel OverDrive™ Processor, ana the 
Intel487™ SX Math CoProcessor. All nQrmaltext describes the functionality for the Intel486 SX microproces- 
sor, the Intel OverDrive Processor, and the Intel487 SX Math CoProcessor unless explicitly stated otherwise. 
All sections of the data sheet which describe the Intel487 SX Math CoProcessor and Intel OverDrive Proces- 
sor functionality only are highlighted as shown in the following example: 


This is an example of what the highlighted sections look like. The highlighted sections describe func- 
tionality which apply to the Intel487 SX Math CoProcessor and Intel OverDrive Processor only. 


All references to the Intel OverDrive Processor are also applicable to the Intel487 SX Math CoProcessor 
unless otherwise stated. 


• 
Binary Compatible with Large 
Software Base 
-MS-DOS·, 
OS/2··, Windows 
-UNIX··· 
System V/386 
-IRMX@, IRMK Kernels 


• 
High Integration Enables On-Chip 
- 
8 Kbyte Code and Data Cache 
- 
Paged, Virtual Memory Management 


• 
Easy To Use 
- 
Built-In Self Test 
- 
Hardware Debugging Support 
- 
Intel Software Support 
- 
Extensive Third Party Software 
Support 


• 
196-Lead PQFPand 168-PlnGrid Array 
Package for Intel486™ SX 
Microprocessor 


169-Pln Grid Array Package for Intel 
OverDriveTU Processor and Intel487TU 
SX Math CoProcessor 


• 
High Performance Design 
-lntel486 
One Clock Instruction Core 
-33,25,20 
and 16 MHz Clock 
Frequencies at 5V 
- 
25, 20 and 16 MHz Clock Frequencies 
at 3.3V 
-106 
Mbyte/sec Burst Bus at 33 MHz 
- 
CHMOSIV and CHMOSV Process 
Technology 
- 
Dynamic Bus Sizing for 8-, 16- and 
32-Blt Busses 


• 
Complete 32-Blt Architecture 
- 
Address and Data Busses 
-Registers 
- 
8-, 16- and 32-Blt Data Types 


• 
Multiprocessor Support 
- 
Multiprocessor Instructions 
- 
Cache Consistency Protocols 
- 
Support for Second Level Cache 


• 
Optional Intel OverDrlve™ Processor/ 
Intel487™ SX Math CoProcessor 
(16-25 MHz only) 


• 
IEEE 1149.1Boundary Scan 
Compatibility 
- 
Available on PQFPIntel486 SX CPU 
Only 


Intel386™, 
Inte1387™, 
i386™, 
i387™, 
Intel486™, 
Intel487™, 
i486™, 
OverDrive™ 
and i487TM are trademarks 
of Intel Corporation. 


·MS-DOS, 
WINDOWS 
are registered 
trademarks 
of Microsoft 
Corporation. 
··OS/2 
is a trademark 
of Microsoft 
Corporation. 
···UNIX 
is a trademark 
of Unix Systems 
Labs. 


• 


Intel486TM 
SX CPU based 
systems 
are the entry-level 
business 
standard, 
delivering 
affordable 
performance 
and upgradability 
with Intel's 
OverDrive™ 
Processor. 
The Intel486 
SX CPU is the industry 
standard 
for entry 
level business 
computing. 
It has replaced 
the 80386 
as the minimum 
performance 
requirement 
for the de- 
mands of today's 
software. 
Intel486 
SX CPU based systems 
are now available 
at comparable 
prices to 80386 
systems. 
Additionally, 
they offer 
investment 
protection 
through 
built-in 
single 
chip 
upgradability 
with 
Intel's 
OverDrive 
Processor. 
The Intel486 
SX microprocessor 
is a binary compatible 
member 
of the Intel486 
micro- 
processor 
family, giving it access to the $50 Billion installed 
software 
base of over 50,000 MS-DOS, Windows, 
OS/2, 
and UNIX System V/386 
applications. 
The Intel486 
SX microprocessor 
has the same integrated 
RISC 
integer core, 8 KByte cache memory, 
and memory 
management 
unit as the Intel486 
DX microprocessor. 
The 
Intel OverDrive 
Processor 
provides 
optional 
overall 
performance 
upgrade 
capability 
for users who 
want 
to 
increase 
their system 
performance 
up to 70% on DOS, UNIX, WINDOWS, 
OS/2, 
and UNIX applications. 


The high-performance 
RISC integer core of the Intel486 
SX microprocessor 
executes 
frequently 
used instruc- 
tions in one clock cycle. An 8 KByte unified code and data cache allow this performance 
level to be sustained. 


A 106 Mbyte/sec 
burst bus at 33 MHz ensures 
high system 
throughput 
even with inexpensive 
DRAMs. 
The 
33 MHz Intel486 
SX microprocessor 
has a Norton SI V5.0 rating of 72.1, a Dhrystone 
MIPS rating of 27, and a 
9.5 SPECint92 
rating. It provides 
up to twice the performance 
of a 33 MHz Intel386 
DX microprocessor 
with an 
external 
cache 
(depending 
on the application). 


NOTE: 
The Intel486 
SX CPU and Intel OverDrive 
Processor 
are available 
at 33 MHz external 
clock frequency. 
How- 
ever, the Intel487 
SX Math CoProcessor 
is not available 
at 33 MHz. 
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K2 
H17 
A14 
55 
012 
G3 
B516# 
C17 
C11 
K16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
C12 
L16 
K17 
A16 
09 
014 
K3 
O/C# 
M15 
C13 
M2 
L1 
An 
03 
015 
F3 
OPO 
N3 
C14 
M16 
L17 
A18 
R5 
016 
J3 
OP1 
F1 
G15 
P16 
M1 
A19 
04 
017 
03 
OP2 
H3 
R17 
R3 
M17 
A20 
08 
018 
C2 
OP3 
A5 
54 
R6 
P17 
A21 
05 
019 
B1 
EA05# 
B17 
R8 
02 
A22 
07 
020 
A1 
FLU5H# 
C15 
R9 
R4 
A23 
53 
021 
B2 
HLOA 
P15 
R10 
56 
A24 
06 
022 
A2 
HOLO 
E15 
R11 
58 
A25 
R2 
023 
A4 
INTR 
A16 
R14 
59 
A26 
52 
024 
A6 
KEN# 
F15 
510 
A27 
51 
025 
B6 
LOCK # 
N15 
511 
A28 
R1 
026 
C7 
M/IO# 
N16 
512 
A29 
P2 
027 
C6 
NMI 
A15 
514 
A30 
P3 
028 
C8 
PCO 
J17 
A31 
01 
029 
A8 
PCHK# 
017 
030 
C9 
PWT 
L15 
031 
B8 
PLOCKI' 
016 
ROYI' 
F16 
RE5ET 
C16 
W/R# 
N17 


Address 
Data 
Control 
H/C 
Vcc 
Vss 


A2 
Q14 
00 
P1 
A20M# 
015 
A3 
B7 
A7 
A3 
R15 
01 
N2 
AOS# 
S17 
A10 
B9 
A9 
A4 
S16 
02 
N1 
AHOLO 
A17 
A12 
B11 
A11 
A5 
Q12 
03 
H2 
BEO# 
K15 
A14 
C4 
B3 
A6 
S15 
04 
M3 
BE1# 
J16 
B10 
C5 
B4 
A7 
Q13 
Os 
J2 
BE2# 
J15 
B12 
E2 
B5 
A8 
R13 
06 
L2 
BE3# 
F17 
B13 
E16 
E1 


A9 
Q11 
07 
L3 
BLAST # 
R16 
B16 
G2 
E17 
AlO 
S13 
08 
F2 
BOFF# 
017 
C10 
G16 
G1 
A11 
R12 
09 
01 
BROY# 
H15 
C11 
H16 
G17 
A12 
S7 
010 
E3 
BREQ 
Q15 
C12 
J1 
H1 


A13 
Q10 
011 
C1 
BS8# 
016 
C13 
K2 
H17 
A14 
S5 
012 
G3 
BS16# 
C17 
C14 
K16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
G15 
L16 
K17 
A16 
Q9 
014 
K3 
O/C# 
M15 
R17 
M2 
L1 
A17 
Q3 
015 
F3 
OPO 
N3 
S4 
M16 
L17 
A18 
R5 
016 
J3 
OP1 
F1 
04 
P16 
M1 
A19 
Q4 
017 
03 
OP2 
H3 
R3 
M17 
A20 
Q8 
018 
C2 
OP3 
A5 
R6 
P17 
A21 
Q5 
019 
B1 
EAOS# 
B17 
R8 
Q2 
A22 
Q7 
020 
A1 
FERR# 
A13 
R9 
R4 
A23 
S3 
021 
B2 
FLUSH # 
C15 
R10 
S6 
A24 
Q6 
022 
A2 
HLOA 
P15 
R11 
S8 
A25 
R2 
023 
A4 
HOLO 
E15 
R14 
S9 
A26 
S2 
D24 
A6 
IGNNE# 
A15 
S10 
A27 
S1 
025 
B6 
INTR 
A16 
S11 


A28 
R1 
026 
C7 
KEN# 
F15 
S12 
A29 
P2 
027 
C6 
LOCK# 
N15 
S14 
A30 
P3 
028 
C8 
M/IO# 
N16 
A31 
Q1 
029 
A8 
UP# 
B14 
D30 
C9 
NMI 
B15 
031 
B8 
PCO 
J17 
PCHK# 
Q17 
PWT 
L15 
PLOCK# 
Q16 
ROY# 
F16 
RESET 
C16 
W/R# 
N17 
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Pin 
Signal 
Pin 
Signal 
Pin 
Signal 
Pin 
Signal 


1 
VSS 
50 
VSS 
99 
VSS 
148 
VSS 
2 
A21 
51 
021 
100 
NMI 
149 
NC 
3 
A22 
52 
NC 
101 
INTR 
150 
A3 
4 
A23 
53 
022 
102 
FLUSH# 
151 
NC 
5 
A24 
54 
VCC 
103 
RESET 
152 
A4 
6 
VCC 
55 
023 
104 
A20M# 
153 
NC 
7 
A25 
56 
NC 
105 
EAOS# 
154 
A5 
8 
A26 
57 
OP3 
106 
PCO 
155 
NC 
9 
A27 
58 
VSS 
107 
VCC 
156 
UP# 
10 
A28 
59 
024 
108 
PWT 
157 
NC 
11 
VSS 
60 
NC 
109 
VSS 
158 
A6 
12 
A29 
61 
025 
110 
O/C# 
159 
A7 
13 
A30 
62 
VCC 
111 
M/IO# 
160 
NC 
14 
A31 
63 
026 
112 
VCC 
161 
A8 
15 
NC 
64 
NC 
113 
BE3# 
162 
NC 
16 
OPO 
65 
027 
114 
VSS 
163 
A9 
17 
00 
66 
VSS 
115 
BE2# 
164 
VCC 
18 
01 
67 
028 
116 
BE1# 
165 
AlO 
19 
VCC 
68 
NC 
117 
BEO# 
166 
NC 
20 
02 
69 
029 
118 
BREQ 
167 
VSS 
21 
VSS 
70 
VCC 
119 
VCC 
168 
VSS 
22 
VSS 
71 
030 
120 
W/R# 
169 
NC 
23 
03 
72 
NC 
121 
VSS 
170 
VCC 
24 
VCC 
73 
NC 
122 
HLOA 
171 
NC 
25 
04 
74 
031 
123 
CLK 
172 
A11 
26 
05 
75 
NC 
124 
NC 
173 
NC 
27 
06 
76 
NC 
125 
VCC 
174 
A12 
28 
VCC 
77 
NC 
126 
VSS 
175 
VCC 
29 
07 
78 
NC 
127 
NC 
176 
A13 
30 
OP1 
79 
NC 
128 
TCK 
177 
VSS 
31 
08 
80 
TOO 
129 
AHOLD 
178 
A14 
32 
09 
81 
NC 
130 
HOLO 
179 
VCC 
33 
VSS 
82 
NC 
131 
VCC 
180 
A15 
34 
NC 
83 
NC 
132 
KEN# 
181 
A16 
35 
010 
84 
VCC 
133 
ROY# 
182 
VSS 
36 
VCC 
85 
NC 
134 
NC 
183 
A17 
37 
.011 
86 
VSS 
135 
BS8# 
184 
VCC 
38 
012 
87 
NC 
136 
BS16# 
185 
TOI 
39 
013 
88 
NC 
137 
BOFF# 
186 
NC 
40 
VSS 
89 
NC 
138 
BROY# 
187 
TMS 
41 
014 
90 
NC 
139 
PCHK# 
188 
NC 
42 
015 
91 
NC 
140 
NC 
189 
A18 
43 
OP2 
92 
NC 
141 
VSS 
190 
NC 
44 
016 
93 
VCC 
142 
LOCK# 
191 
A19 
45 
017 
94 
NC 
143 
PLOCK# 
192 
NC 
46 
018 
95 
VSS 
144 
BLAST # 
193 
A20 
47 
019 
96 
VSS 
145 
AOS# 
194 
VSS 
48 
020 
97 
NC 
146 
A2 
195 
NC 
49 
VCC 
98 
VCC 
147 
VCC 
196 
VCC 
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Address 
Data 
Control 
NC 
Vcc 
Vss 


A2 
146 
00 
17 
A20M# 
104 
15 
6 
1 
A3 
150 
01 
18 
AOS# 
145 
34 
19 
11 
A4 
152 
02 
20 
AHOLO 
129 
52 
24 
21 
A5 
154 
03 
23 
BEO# 
117 
56 
28 
22 
A6 
158 
04 
25 
BEH 
116 
60 
36 
33 
A7 
159 
05 
26 
BE2# 
115 
64 
49 
40 
A8 
161 
06 
27 
BE3# 
113 
68 
54 
50 
A9 
163 
07 
29 
BLAST # 
144 
72 
62 
58 
A10 
165 
08 
31 
BOFF# 
137 
73 
70 
66 
An 
172 
09 
32 
BROY# 
138 
75 
84 
86 
A12 
174 
010 
35 
BREQ 
118 
76 
93 
95 
A13 
176 
On 
37 
BS8# 
135 
77 
98 
96 
A14 
178 
012 
38 
BS16# 
136 
78 
107 
99 
A15 
180 
013 
39 
CLK 
123 
79 
112 
109 
A16 
181 
014 
41 
O/C# 
110 
81 
119 
114 
A17 
183 
015 
42 
OPO 
16 
82 
125 
121 
A18 
189 
016 
44 
OP1 
30 
83 
131 
126 
A19 
191 
017 
45 
OP2 
43 
85 
147 
141 
A20 
193 
018 
46 
OP3 
57 
87 
164 
148 
A21 
2 
019 
47 
EAOS# 
105 
88 
170 
167 
A22 
3 
020 
48 
FLUSH # 
102 
89 
175 
168 
A23 
4 
021 
51 
HLOA 
122 
90 
179 
177 
A24 
5 
022 
53 
HOLO 
130 
91 
184 
182 
A25 
7 
023 
55 
INTR 
101 
92 
196 
194 
A26 
8 
024 
59 
KEN# 
132 
94 
A27 
9 
025 
61 
LOCKI' 
142 
97 
A28 
10 
026 
63 
M/IO# 
111 
124 
A29 
12 
027 
65 
NMI 
100 
127 
A30 
13 
028 
67 
PCO 
106 
134 
A31 
14 
029 
69 
PCHK# 
139 
140 
030 
71 
PWT 
108 
149 
031 
74 
PLOCK# 
143 
151 
ROYI' 
133 
153 
RESET 
103 
155 
TOI 
185 
157 
TOO 
80 
160 
TMS 
187 
162 
W/R# 
120 
166 
169 
171 
173 
186 
188 
190 
192 
195 


•• 


intel~ 


Symbol 
Type 
Name and Function 


ClK 
I 
Clock provides 
the fundamental 
timing and the internal operating 
frequency 
for the 
Intel486 SX microprocessor 
lintel 
OverDrive 
Processor. 
All external 
timing parameters 
are specified 
with respect 
to the rising edge of ClK. 


ADDRESS 
BUS 


A31-A4 
I/O 
A31-A2 
are the address lines of the microprocessor. 
A31-A2, 
together 
with the byte 
A2-A3 
0 
enables 
BEO# -BE3#, 
define the physical 
area of memory or input/output 
space 
accessed. 
Address 
lines A31-A4 
are used to drive addresses 
into the microprocessor 
to 
perform 
cache line invalidations. 
Input signals must meet setup and hold times t22 and 
t23. A31-A2 
are not driven during bus or address 
hold. 


BEO-3# 
0 
The byte enable signals indicate active bytes during read and write cycles. 
During the 
first cycle of a cache fill, the external 
system should assume that all byte enables 
are 
active. BE3# 
applies to D24-D31, 
BE2# 
applies to D16-D23, 
BE1 # applies to 
D8-D15 
and BEO# applies to DO-D7. 
BEO#-BE3# 
are active lOW 
and are not driven 
during bus hold. 


DATA BUS 


D31-DO 
I/O 
These are the data lines for the Intel486 SX microprocessor/Intel 
OverDrive 
Processor. 


Lines DO-D7 
define the least significant 
byte of the data bus while lines D24-D31 
define 
the most significant 
byte of the data bus. These signals must meet setup and hold times 
t22 and t23 for proper operation 
on reads. These pins are driven during the second and 
subsequent 
clocks of write cycles. 


DATA 
PARITY 


DPO-DP3 
I/O 
There is one data parity pin for each byte of the data bus. Data parity is generated 
on all 
write data cycles with the same timing as the data driven by the Intel486 SX 
microprocessor/lntel 
OverDrive 
Processor. 
Even parity information 
must be driven back 
into the microprocessor 
on the data parity pins with the same timing as read information 
to insure that the correct 
parity check status is indicated 
by the Intel486 SX 
microprocessor/Intel 
OverDrive 
Processor. 
The signals read on these pins do not affect 
program 
execution. 
Input signals must meet setup and hold times t22 and t23. DPO-DP3 
should be 
connected 
to Vcc through a pullup resistor 
in systems which do not use parity. DPO-DP3 
are active HIGH and are driven during the second and subsequent 
clocks of write cycles. 


PCHK# 
0 
Parity Status is driven on the PCHK # pin the clock after ready for read operations. 
The 
parity status is for data sampled 
at the end of the previous 
clock. A parity error is 
indicated 
by PCHK # being lOW. 
Parity status is only checked 
for enabled 
bytes as 
indicated 
by the byte enable and bus size signals. PCHK # is valid only in the clock 
immediately 
after read data is returned 
to the microprocessor. 
At all other times PCHK # 
is inactive (HIGH). PCHK # is never floated. 


Symbol 
Type 
Name and Function 


BUS CYCLE 
DEFINITION 


M/IO# 
0 
The memory/input-output, 
data/control 
and write/read 
lines are the primary bus 
D/C# 
0 
definition 
signals. These signals are driven valid as the ADS # signal is asserted. 


W/R# 
0 
M/IO# 
D/C# 
W/R# 
Bus Cycle 
Initiated 


0 
0 
0 
Interrupt 
Acknowledge 
0 
0 
1 
Halt/Special 
Cycle 
0 
1 
0 
I/O Read 
0 
1 
1 
I/O Write 
1 
0 
0 
Code Read 
1 
0 
1 
Reserved 
1 
1 
0 
Memory 
Read 
1 
1 
1 
Memory Write 


The bus definition 
signals are not driven during bus hold and follow the timing of the 
address 
bus. Refer to Section 
7.2.11 for a description 
of the special bus cycles. 


LOCK # 
0 
The bus lock pin indicates 
that the current bus cycle is locked. The Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
will not allow a bus hold when LOCK# 
is 
asserted 
(but address 
holds are allowed). 
LOCK # goes active in the first clock of the first 
locked bus cycle and goes inactive after the last clock of the last locked bus cycle. The 
last locked cycle ends when ready is returned. 
LOCK # is active LOW and is not driven 
during bus hold. Locked read cycles will not be transformed 
into cache fill cycles if KEN # 
is returned 
active. 


PLOCK # 
0 
The pseudo-lock 
pin indicates 
that the current bus transaction 
requires 
more than one 
bus cycle to complete. 
For the Intel486 SX microprocessor/Intel 
OverDrive 
Processor, 
examples 
of such operations 
are segment 
table descriptor 
reads (64 bits), in addition to 
cache line fills (128 bits). 


For the Intel486 
SX microprocessor/Intel 
OverDrive 
Processor, 
examples 
of such 
operations 
are floating 
point long reads and writes (64 bits), segment 
table descriptor 
reads (64 bits), in addition to cache line fills (128 bits). 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
will drive PLOCK # active 
until the addresses 
for the last bus cycle of the transaction 
have been driven regardless 
of whether 
ROY # or BRDY # have been returned. 


Normally 
PLOCK # and BLAST # are inverse of each other. However 
during the first bus 
cycle of a 64-bit floating 
point write, both PLOCK # and BLAST # will be asserted. 


PLOCK # is a function 
of the BS8 #, BS 16 # and KEN # inputs. PLOCK # should be 
sampled 
only in the clock ready is returned. 
PLOCK # is active LOW and is not driven 
during bus hold. 


BUS CONTROL 


ADS# 
0 
The address status output indicates 
that a valid bus cycle definition 
and address are 
available 
on the cycle definition 
lines and address 
bus. ADS# 
is driven active in the same 
clock as the addresses 
are driven. ADS# 
is active LOW and is not driven during bus hold. 


RDY# 
I 
The non-burst 
ready input indicates 
that the current bus cycle is complete. 
ROY # 


indicates 
that the external 
system has presented 
valid data on the data pins in response 
to a read or that the external 
system has accepted 
data from the Intel486 SX 
microprocessor/Intel 
OverDrive 
Processor 
in response 
to a write. ROY # is ignored when 
the bus is idle and at the end of the first clock of the bus cycle. 
ROY # is active during address hold. Data can be returned 
to the processor 
while AHOLD 
is active. 
ROY # is active LOW, and is not provided 
with an internal pullup resistor. 
ROY # must 
satisfy setup and hold times t16 and t17 for proper chip operation. 
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Symbol 
Type 
Name and Function 


BURST CONTROL 


BRDYII 
I 
The burst ready input performs 
the same function 
during a burst cycle that ROY II 
performs 
during a non-burst 
cycle. BRDY II indicates 
that the external 
system 
has 
presented 
valid data in response 
to a read or that the external 
system has accepted 
data 
in response 
to a write. BRDY II is ignored when the bus is idle and at the end of the first 
clock in a bus cycle. 
BRDY II is sampled 
in the second and subsequent 
clocks of a burst cycle. The data 
presented 
on the data bus will be strobed 
into the microprocessor 
when BRDY II is 
sampled 
active. If ROY II is returned 
simultaneously 
with BRDY II , BRDY II is ignored and 
the burst cycle is prematurely 
aborted. 
BRDY II is active LOW and is provided 
with a small pullup resistor. 
BRDY II must satisfy 
the setup and hold times t16 and t17' 


BLAST II 
0 
The burst last signal indicates 
that the next time BRDY II is returned 
the burst bus cycle is 
complete. 
BLAST II is active for both burst and non-burst 
bus cycles. BLAST II is active 
LOW and is not driven during bus hold. 


INTERRUPTS 


RESET 
I 
The reset 
input forces the Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
to 
begin execution 
at a known state. The microprocessor 
cannot 
begin execution 
of 
instructions 
until at least 1 ms after Vcc and CLK have reached 
their proper DC and AC 
specifications. 
The RESET pin should remain active during this time to insure proper 
microprocessor 
operation. 
RESET is active HIGH. RESET is asynchronous 
but must 
meet setup and hold times t20 and t21 for recognition 
in any specific 
clock. 


INTR 
I 
The maskable 
interrupt 
indicates 
that an external 
interrupt 
has been generated. 
If the 
internal 
interrupt flag is set in EFLAGS, 
active interrupt 
processing 
will be initiated. 
The 
Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
will generate 
two locked interrupt 
acknowledge 
bus cycles in response 
to the INTR pin going active. INTR must remain 
active until the interrupt 
acknowledges 
have been performed 
to assure that the interrupt 
is recognized. 
INTR is active HIGH and is not provided 
with an internal pulldown 
resistor. 
INTR is 
asynchronous, 
but must meet setup and hold times t20 and t21 for recognition 
in any 
specific 
clock. 


NMI 
I 
The non-maskable 
interrupt 
request signal indicates 
that an external 
non-maskable 
interrupt 
has been generated. 
NMI is rising edge sensitive. 
NMI must be held LOW for at 
least four CLK periods before this rising edge. NMI is not provided 
with an internal 
pulldown 
resistor. 
NMI is asynchronous, 
but must meet setup and hold times t20 and t21 
for recognition 
in any specific 
clock. 


BUS ARBITRATION 


BREa 
0 
The bus request 
signal indicates 
that the Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
has internally 
generated 
a bus request. 
BREa 
is generated 
whether 
or not the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
is driVing the bus. BREa 
is active 
HIGH and is never floated. 


HOLD 
I 
The bus hold request 
allows another 
bus master complete 
control 
of the processor 
bus. 


In response 
to HOLD going active the Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
will float most of its output and input/output 
pins. HLDA will be asserted 
after 
completing 
the current bus cycle, burst cycle or sequence 
of locked cycles. The Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
will remain in this state until HOLD is 
deasserted. 
HOLD is active high and is not provided with an internal 
pulldown 
resistor. 
HOLD must satisfy setup and hold times t18 and t19 for proper operation. 


Symbol 
Type 
Name and Function 


BUS ARBITRATION 
(Continued) 


HLDA 
0 
Hold acknowledge 
goes active in response 
to a hold request presented 
on the HOLD pin. 
HLDA indicates 
that the Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
has given 
the bus to another 
local bus master. HLDA is driven active in the same clock that the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
floats its bus. HLDA is driven 
inactive when leaving bus hold. HLDA is active HIGH and remains driven during bus hold. 


BOFF# 
I 
The backoffinput 
forces the Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
to 
float its bus in the next clock. The microprocessor 
will float all pins normally floated 
during 
bus hold but HLDA will not be asserted 
in response 
to BOFF #. BOFF # has higher 
priority than ROY # or BRDY #; if both are returned 
in the same clock, BOFF # takes 
effect. The microprocessor 
remains in bus hold until BOFF # is negated. 
If a bus cycle 
was in progress 
when BOFF # was asserted 
the cycle will be restarted. 
BOFF # is active 
LOW and must meet setup and hold times t18 and t19 for proper operation. 


CACHE INVALIDATION 


AHOLD 
I 
The address hold request allows another 
bus master access to the processor's 
address 
bus for a cache invalidation 
cycle. The Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
will stop driving its address bus in the clock following 
AHOLD going active. 
Only the address 
bus will be floated 
during address 
hold, the remainder 
of the bus will 
remain active. AHOLD is active HIGH and is provided 
with a small internal 
pulldown 
resistor. For proper operation 
AHOLD must meet setup and hold times t18 and t19. 


EADS# 
I 
This signal indicates 
that a valid external address 
has been driven onto the Intel486 SX 
microprocessor 
lintel 
OverDrive 
Processor 
address 
pins. This address will be used to 
perform 
an internal cache invalidation 
cycle. EADS# 
is active LOW and is provided 
with 
an internal pullup resistor. 
EADS# 
must satisfy setup and hold times t12 and t13 for 
proper operation. 


CACHE CONTROL 


KEN# 
I 
The cache enable pin is used to determine 
whether 
the current cycle is cacheable. 
When 
the Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
generates 
a cycle that can be 
cached and KEN # is active one clock before ROY # or BRDY # during the first transfer 
of the cycle, the cycle will become 
a cache line fill cycle. Returning 
KEN # active one 
clock before ROY # during the last read in the cache line fill will cause the line to be 
placed in the on-chip cache. KEN # is active LOW and is provided 
with a small internal 
pullup resistor. 
KEN # must satisfy setup and hold times t14 and t15 for proper operation. 


FLUSH# 
I 
The cache flush input forces the Intel486 SX microprocessor 
lintel 
OverDrive 
Processor 
to flush its entire internal cache. FLUSH # is active low and need only be asserted 
for one 
clock. FLUSH # is asynchronous 
but setup and hold times t20 and t21 must be met for 
recognition 
in any specific 
clock. FLUSH# 
causes the Intel486 SX microprocessorllntel 
OverDrive 
Processor 
to enter the tri-state 
test mode. 


PAGE CACHEABILITY 


PWT 
0 
The page write-through 
and page cache disable pins reflect the state of the page 
PCD 
0 
attribute 
bits, PWT and PCD, in the page table entry, page directory 
entry or control 
register 3 (CR3) when paging is enabled. 
If paging is disabled, 
the CPU ignores the PCD 
and PWT bits and assumes 
they are zero for the purpose of caching 
and driving PCD and 
PWT pins. PWT and PCD have the same timing as the cycle definition 
pins (MIIO#, 
D/C#, 
and W/R#). 
PWT and PCD are active HIGH and are not driven during bus hold. 
PCD is masked by the cache disable bit (CD) in Control 
Register O. 


• 


Symbol 
Type 
Name and Function 


NUMERIC 
ERROR REPORTING 


FERR# 
0 
The floating point error pin is driven active when a floating 
point error occurs. 
FERR # is 
similar to the ERROR # pin on the Intel387TM Math CoProcessor. 
FERR,* 
is included 
for 
compatibility 
with systems 
using DOS type floating 
point error reporting. 
FERR,* 
wilt not 
go active if FP errors are masked in FPU register. 
FERR # is active LOW, and is not 
floated 
during bus hold. 


IGNNE# 
I 
When the ignore numeric error pin is asserted 
the Intel OverDrive 
Processor 
will ignore a 
numeric error and continue 
executing 
non-control 
floating 
point instructions, 
but FERR# 
will still be activated 
by the Intel OverDrive 
Processor 
When IGNNE # is deasserted 
the 
Intel OverDrive 
Processor 
wilt freeze on a non-control 
floating 
point instruction, 
if a 
previous 
floating 
point instruction 
caused an error. IGNNE # has no effect when the NE 
bit in control 
register 0 is set. IGNNE# 
is active lOW 
and is provided 
with a smaH intemaJ 
pullup resistor. IGNNE,* 
is aaynchronous 
but setup and hold times t20 and t21 must be 
met to insure recognition 
on any specific clock. 


INTEL 
OverDrive™ 
PROCESSOR 
INTERFACE 


UP# 
0 
The upgrade present 
pin is used to signal the Intel486 microprocessor 
to float its outputs 
and get-off the bus. This pin can also be used to check the presence 
of the OverDrive 
processor 
in the two socket upgrade circuit. It is active low and is never floated. 
UP,* is 
driven low at power-up 
and remain active for the entire duration 
of the Intel OverDrive 
Processor 
operation. 


KEY PIN 


KEY 
The KEY pin is an electrically 
non.functlonal 
pin which Is used to insure correct 
Intel 
OverDrive 
Processor 
orientation 
in a 169 pin socket. 


BUS SIZE CONTROL 


8S16# 
I 
The bus size 16 and bus size 8 pins (bus sizing pins) cause the Intel486 SX 
8S8# 
I 
microprocessor/Intel 
OverDrive 
Processor 
to run multiple 
bus cycles to complete 
a 
request from devices that cannot provide or accept 32 bits of data in a single cycle. The 
bus sizing pins are sampled 
every clock. The state of these pins in the clock before ready 
is used by the Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
to determine 
the 
bus size. These signals are active LOW and are provided 
with internal pullup resistors. 
These inputs must satisfy setup and hold times t14 and t15 for proper operation. 


ADDRESS 
MASK 


A20M# 
I 
When the address bit 20 mask pin is asserted, 
the Intel486 SX microprocessor/Intel 
OverDrive 
Processer 
masks physical address 
bit 20 (A20) before per10rming a lookup to 
the internal cache or driving a memory cycle on the bus. A20M # emulates 
the address 
wraparound 
at one Mbyte which occurs on the 8086. A20M # is active LOW and should 
be asserted 
only when the processor 
is in real mode. This pin is asynchrous 
but should 
meet setup and hold times t20 and t21 for recognition 
in any specific 
clock. For proper 
operation, 
A20M # should be sampled 
high at the falling edge of RESET. 


Symbol 
Type 
Name and Function 


PERFORMANCE 
UPGRADE 
SUPPORT 
(Inte1486 SX CPU PQFP Version 
Only) 


UP# 
I 
Upgrade Present 
forces the Intel486 SX CPU to tri-state 
all its outputs and enter the 
power down mode. UP*, is active low and is sampled 
at all times, including 
after power-up 
and during reset. UP# 
is provided 
with an internal pull-up resistor. 


TEST ACCESS 
PORT (Inte1486 SX CPU PQFP Version 
Only) 


TCK 
I 
Test Clock is an input to the Intel486™ 
CPU and provides 
the clocking 
function 
required 
by the JT AG Boundary 
scan feature. TCK is used to clock state information 
and data into 
component 
on the rising edge of TCK on TMS and TOI, respectively. 
Oata is clocked 
out 
of the part on the falling edge of TCK and TOO. TCK is provided 
with an internal 
pull-up 
resistor. 


TOI 
I 
Test Data Input 
is the serial input used to shift JT AG instructions 
and data into 
component. 
TOI is sampled 
on the rising edge of TCK, during the SHIFT-IR 
and SHIFT-OR 
TAP controller 
states. Ouring all other tap controller 
states, TOI is a "don't 
care". 
TOI is 
provided with an internal pull-up resistor. 


TOO 
0 
Test Data Output 
is the serial output used to shift JT AG instructions 
and data out of the 
component. 
TOO is driven on the falling edge of TCK during the SHIFT-IR 
and SHIFT-OR 
TAP controller 
states. At all other times TOO is driven to the high impedance 
state. 


TMS 
I 
Test Mode Select 
is decoded 
by the JT AG TAP (Tap Access 
Port) to select the operation 
of the test logic. TMS is sampled 
on the rising edge of TCK. To guarantee 
deterministic 
behavior 
of the TAP controller 
TMS is provided with an internal pull-up resistor. • 


Name 
Active 
When 
Level 
Floated 


BREQ 
HIGH 
HLDA 
HIGH 
BEO#-BE3# 
LOW 
Bus Hold 
PWT, PCD 
HIGH 
Bus Hold 
W/R#, 
D/C#, 
M/IO# 
HIGH 
Bus Hold 
LOCK# 
LOW 
Bus Hold 
PLOCK# 
LOW 
Bus Hold 
ADS# 
LOW 
Bus Hold 
BLAST # 
LOW 
Bus Hold 
PCHK# 
LOW 


FERR# 
LOW 


A2-A3 
HIGH 
Bus, Address 
Hold 


MP# 
LOW 


intel~ 


Name 
Active 
Synchronousl 
Level 
Asynchronous 


CLK 
RESET 
HIGH 
Asynchronous 
HOLD 
HIGH 
Synchronous 
AHOLD 
HIGH 
Synchronous 
EADS# 
LOW 
Synchronous 
BOFF# 
LOW 
Synchronous 
FLUSH# 
LOW 
Asynchronous 
A20M# 
LOW 
Asynchronous 
BS16#, 
BS8# 
LOW 
Synchronous 
KEN# 
LOW 
Synchronous 
RDY# 
LOW 
Synchronous 
BRDY# 
LOW 
Synchronous 
INTR 
HIGH 
Asynchronous 
NMI 
HIGH 
Asynchronous 


IGNNE# 
LOW 
Asynchronous 
UP#(1) 
LOW 
Asynchronous 


NOTE: 
1. The UP", 
pin is present 
on the Intel486 
SX CPU PQFP 
package 
only. 


Name 
Active 
When 
Level 
Floated 


00-031 
HIGH 
Bus Hold 
DPO-DP3 
HIGH 
Bus Hold 
A4-A31 
HIGH 
Bus, Address 
Hold 


Name 
Input 
or 
Sampled I 
Output 
Driven On 


TCK 
Input 
N/A 


TDI 
Input 
Rising Edge of TCK 


TOO 
Output 
Falling Edge of TCK 


TMS 
Input 
Rising Edge of TCK 


Product 
Stepping 
Component 
ID 
Revision 
ID 


i486 SX Microprocessor 
AO 
04 
20 
BO 
04 
22 
cAO 
04 
27 
cBO 
04 
28 


i487 SX Math CoProcessor 
AO 
04 
20 
BO 
04 
21 


OverDrive 
Processor 
A2 
04 
32 
B1 
04 
33 


NOTE: 
Table 
1.6 shows 
the Component 
10 number 
and Revision 
10 number 
for the B-O stepping 
of the 
Intel486 
SX Microprocessor 
and Intel OverDrive 
Processor. 
When 
an Intel OverDrive 
Processor 
is installed 
in the system, 
the Component 
10 and Revision 
10 is provided 
by the Intel OverDrive 
Processor 
and not the Intel486 
SX Microprocessor. 
The Component 
10 and Revision 
10 read by 
the 
BIOS/software 
may change 
when 
an OverDrive 
Processor 
is installed 
in an Intel486 
SX 
Microprocessor 
based 
system. 


The Intel486 SX microprocessor/lntel 
OverDrive 
Processor is a 32-bit architecture with on-chip mem- 
ory management, and cache memory units. 


The Intel OverDrive Processor is the upgrade for 
the Intel486 SX microprocessor. It Is a 32-bit ar- 
chitecture with on-chip memory management, 
floating point and cache memory units. 


The Intel486 SX microprocessor/lntel 
OverDrive 
Processor contains all the features of the Intel386 
microprocessor with enhancements to increase per- 
formance. The instruction set includes the complete 
Intel386 microprocessor instruction set along with 
extensions to serve new applications. The on-chip 
memory management unit (MMU) is completely 
compatible with the Intel386 microprocessor MMU. 
All software written for the Intel386 microprocessor 
and previous members of the Inte1386TM/lnte1486TM 
architectural family will run on the Intel486 SX micro- 
processor/Intel 
OverDrive Processor without any 
modifications. 


The Intel OverDrive Processor brings the 387 
Math CoProcessor on-chlp. All software written 
for the Intel386 microprocessor, Intel387 Math 
CoProcessor or Intel486 DX microprocessor and 
previous members of the 86/87 
architectural 
family will run on the Intel OverDrive Processor 
without any modifications. 


Several enhancements have been added to in- 
crease performance. On-chip cache memory allows 
frequently used data and code to be stored on-chip 
reducing accesses to the external bus. RiSe design 
techniques have been used to reduce instruction cy- 
cle times. A burst bus feature enables fast cache 
fills. All of these features, combined, lead to per- 
formance greater than twice that of an Intel386 mi- 
croprocessor. 


The memory management unit (MMU) consists of a 
segmentation unit and a paging unit. Segmentation 
allows management of the logical address space by 
providing easy data and code relocatibility and effi- 
cient sharing of global resources. The paging mech- 
anism operates beneath segmentation and is trans- 
parent to the segmentation process. Paging is op- 
tional and can be disabled by system software. Each 
segment can be divided into one or more 4 Kbyte 
segments. To implement a virtual memory system, 
full restartability for all page and segment faults is 
supported. 


Memory is organized into one or more variable 
length 
segments, 
each 
up 
to 
four 
gigabytes 


(232 bytes) in size. A segment can have attributes 
associated with it which include its location, size, 
type (I.e., stack, code or data), and protection char- 
acteristics. Each task on an Intel486 SX microproc- 
essor/Intel OverDrive Processor can have a maxi- 
mum of 16,381 segments, each up to four gigabytes 
in size. Thus each task has a maximum of 64 tera- 
bytes (trillion bytes) of virtual memory. 


The segmentation unit provides four-levels of pro- 
tection for isolating and protecting applications and 
the operating system from each other. The hardware 
enforced protection allows the design of systems 
with a high degree of integrity. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor has two modes of operation: Real Ad- 2 
dress Mode (Real Mode) and Protected Mode Virtu- 
al Address Mode (Protected Mode). In Real Mode 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor operates as a very fast 8086. Real Mode 
is required primarily to set up the Intel486 SX micro- 
processor/Intel 
OverDrive Processor for Protected 
Mode operation. Protected Mode provides access to 
the sophisticated memory management paging and 
privilege capabilities of the processor. 


Within Protected Mode, software can perform a task 
switch to enter into tasks designated as Virtual 8086 
Mode tasks. Each virtual 8086 task behaves with 
8086 semantics, allowing 8086 software (an applica- 
tion program or an entire operating system) to exe- 
cute. 


The on-chip floating point unit operates in paral- 
lel with the arithmetic and logic unit and provides 
arithmetic instructions for a variety of numeric 
data types. It executes numerous built-in tran- 
scendental functions (e.g., tangent, sine, cosine, 
and log functions). The floating point unit fully 
conforms to the ANSI/IEEE standard 754·1985 
for floating point arithmetic. 


The on-chip cache is 8 Kbytes in size. It is 4-way set 
associative and follows a write-through policy. The 
on-chip cache includes features to provide flexibility 
in external memory system design. Individual pages 
can be designated as cacheable or non-cacheable 
by software or hardware. The cache can also be en- 
abled and disabled by software or hardware. 


Finally the Intel486 SX microprocessor/Intel Over- 
Drive Processor have features to facilitate high per- 
formance hardware designs. The 1X clock input eas- 
es high frequency board level designs. The burst 
bus feature enables fast cache fills. These features 
are described beginning in Section 6. 


The Intel486 SX microprocessor is also available in 
PQFP package. The PQFP version is compatible to 
the one in PGA. Additionally, it offers the following 
new and/or improved features: 


• Boundary Scan (JTAG) 
• Reduced Power Dissipation 
• Power Down Mode 
• Lower Input Capacitance 
• New Pin to Support the performance upgrade 
components 


Boundary Scan (JTAG) 


Boundary Scan is discussed in detail in Section 8.5. 


Lower Power Supply Current (Ice) 


See DC specifications. 


New Pin to Support the OverDrive Processor 


The PQFP version of the Intel486 SX CPU provides 
a direct interface with the OverDrive Processor. 
Thus, the interface logic between the Intel486 SX 
CPU and the Intel487 SX MCP, as shown in Figure 
6.6, is not required. Instead, the UP# pin of the 
Intel487 MCP is connected directly to the UP# pin 
of the Intel486 SX CPU. All other signals are con- 
nected together as shown in Figure 6.7. 


Power Down Mode 


Power Down Mode is a new mode in the Intel486 SX 
CPU in PQFP. It is initiated by the Intel OverDrive 
Processor, via the Upgrade Circuit. Upon sensing 
the presence of the Intel OverDrive Processor, the 
Intel486 SX microprocessor not only tri-states its 
output but also enters the "Power Down Mode", 
whereby it remains tri-stated and lowers its power 
consumption (see D.C. specifications). 


The Intel486 SX CPU samples the UP# pin to enter 
the Power Down Mode (PDM). The UP# pin of the 
Intel486 SX microprocessor is driven active by the 
UP# pin of Intel OverDrive Processor in the Per- 
formance Upgrade Circuit, as shown in Figure 6.7. 
More details on power down mode are given in Sec- 
tion 6.6.2.1. 


Lower Input Capacitance 


See DC specifications. 


The Intel486 SX microprocessor register set in- 
cludes all the registers contained in the Intel386 mi- 
croprocessor. 


The Intel OverDrive Processor register set In- 
cludes all the registers contained in the Inte1386 
microprocessor and the Intel387 Math CoProc- 
essor. 


The register set can be split into the following cate- 
gories: 


Base Architecture Registers 
General Purpose Registers 
Instruction Pointer 
Flags Register 
Segment Registers 


Systems Level Registers 
Control Registers 
System Address Registers 


Floating Point Registers 
Data Registers 


Tag Word 
Status Word 
Instruction and Data Pointers 


Control Word 


The base architecture and floating point registers 
are accessible by the applications program. 


I 


The floating point registers are accessible in the I 


. Intel OverDrive Processor. 
. 


The system level registers are only accessible at 
privilege level 0 and are used by the systems level 
program. The debug and test registers are also only 
accessible at privilege level O. 


Figure 2.1 shows the Intel486 SX microprocessor/ 
Intel OverDrive Processor base architecture regis- 
ters. The contents of these registers are task-specif- 
ic and are automatically loaded with a new context 
upon a task switch operation. 


General 
Purpose 
Registers 


31 
24123 
16 
15 
817 
0 


AH 
Ax 
AL 
EAX 


BH 
~x 
BL 
EBX 


CH 
ex 
CL 
ECX 


OH 
ox 
OL 
EOX 


Sl 
ESI 


01 
EOI 


BP 
EBP 


SP 
ESP 


Segment 
Registers 


15 
0 


CS 
Code segment 


55 
Stack 
Segment 


00) 
ES 
Data segments 
FS 


GS 


Instruction 
Pointer 


31 
16 
15 
0 


I 
I 
IP 
I EIP 


Flags Register 


I 
I 
FLAGS 
I EFLAGS 


The base architecture includes six directly accessi- 
ble descriptors, each specifying a segment up to 4 
Gbytes in size. The descriptors are indicated by the 
selector values placed in the Intel486 SX microproc- 
essor/Intel OverDrive Processor segment registers. 
Various selector values can be loaded as a program 
executes. 


The selectors are also task-specific, so the segment 
registers are automatically loaded with new context 
upon a task switch operation. 


2.1.1.1 General 
Purpose 
Registers 


The eight 32-bit general 
purpose registers are 


shown in Figure 2.1. These registers hold data or 
address quantities. The general purpose registers 
can support data operands of 1, 8, 16 and 32 bits, 
and bit fields of 1 to 32 bits. Address operands of 16 
and 32 bits are supported. The 32-bit registers are 
named EAX, EBX, ECX, EDX, ESI, EDI, EBP and 
ESP. 


The least significant 16 bits of the general purpose 
registers can be accessed separately by using the 2 
16-bit names of the registers AX, BX, CX, OX,SI, 01, 
BP and SP. The upper 16 bits of the register are not 
changed when the lower 16 bits are accessed sepa- 
rately. 


Finally 8-bit operations can individually access the 
lowest byte (bits 0-7) and the higher byte (bits 8- 
15) of the general purpose registers AX, BX, CX and 
OX.The lowest bytes are named AL, BL, CL and DL 
respectively. The higher bytes are named AH, BH, 
CH and DH respectively. The individual byte acces- 
sibility offers additional flexibility for data operations 
but is not used for effective address calculation. 


The instruction pointer, shown in Figure 2.1, is a 
32-bit register named EIP. EIP holds the offset of the 
next instruction to be executed. The offset is always 
relative to the base of the code segment (CS). The 
lower 16 bits (bits 0-15) of the EIP contain the 16-bit 
instruction pointer named IP, which is used for 16-bit 
addressing. 


2.1.1.3 Flags Register 


The 
flags 
register 
is 
a 
32-bit 
register 
named 


EFLAGS. The defined bits and bit fields within 
EFLAGS control certain operations and indicate 
status of the Intel486 SX microprocessor/Intel Over- 
Drive Processor. The lower 16 bits (bit 0-15) 
of 


EFLAGS contain the 16-bit register named FLAGS, 
which is most useful when executing 8086 and 
80286 code. EFLAGS is shown in Figure 2.2. 


ALIGNMENT CHECK 
VIRTUAL MODE 
RESUME FLAG 
NESTED TASK FLAG 
I/O 
PRIVILEGE LEVEL 
OVERFLOW 
DIRECnON 
FLAG 


INTERRUPT ENABLE 


NOTE: 
o indicates 
Intel Reserved: 
do not define; 
see Section 
2.1 .6. 


CARRY FLAG 
PARITY FLAG 
AUXILIARY 
CARRY 


ZERO FLAG 
SIGN FLAG 
TRAP FLAG 


EFLAGS bits 1, 3, 5, 15 and 19-31 are "undefined". 
When these bits are stored during interrupt process- 
ing or with a PUSHF instruction (push flags onto 
stack), a one is stored in bit 1 and zeros in bits 3, 5, 
15 and 19-31. 


The EFLAGS register in the Intel486 SX microproc- 
essor/Intel OverDrive Processor contain a new bit 
not previously defined. The new bit, AC, is defined in 
the upper 16 bits of the register and it enables faults 
on accesses to misaligned data. 
AC 
(Alignment Check, bit 18) 
The AC bit enables the generation of faults if a 
memory reference is to a misaligned address. 
Alignment faults are enabled when AC is set 
to 1. A misaligned address is a word access 


to an odd address, a dword access to an ad- 
dress that is not on a dword boundary, or an 
8-byte reference to an address that is not on a 
64-bit word boundary. See Section 7.1.6 for 
more information on operand alignment. 
Alignment faults are only generated by pro- 
grams running at privilege level 3. The AC bit 
setting is ignored at privilege levels 0, 1 and 2. 
Note that references to the descriptor tables 
(for selector loads), or the task state segment 
(TSS), are implicitly level 0 references even if 
the instructions causing the references are 
executed at level 3. Alignment faults are re- 
ported through interrupt 17, with an error code 
of O. Table 2.1 gives the alignment required 
for 
the 
Intel486 
SX 
microprocessor/Intel 


OverDrive Processor data types. 


Memory Access 
Alignment (Byte Boundary) 


Word 
2 
Dword 
4 
Single Precision Real 
4 
Double Precision Real 
8 
Extended Precision Real 
8 
Selector 
2 
48-Bit Segmented Pointer 
4 
32-Bit Flat Pointer 
4 
32-Bit Segmented Pointer 
2 
48-Bit "Pseudo-Descriptor" 
4 
FSTENV/FLDENV Save Area 
4/2 (On Operand Size) 
FSAVE/FRSTOR Save Area 
4/2 (On Operand Size) 
Bit String 
4 


IMPLEMENTATION 
NOTE: 
Several instructions on the Intel486 SX microproc- 
essor/Intel 
OverDrive Processor generate 
misa- 
ligned references, even if their memory address is 
aligned. For example, on the Intel486 SX micro- 
processor/Intel 
OverDrive Processor, the SGDT/ 
SIDT (store, global/interrupt 
descriptor table) in- 
struction reads/writes two bytes, and then reads/ 
writes four bytes from a "pseudo-descriptor" at the 
given address. The Intel486 SX microprocessor/In- 
tel OverDrive Processor will generate misaligned 
references unless the address is on a 2 mod 4 
boundary. The FSAVE and FRSTOR instructions 
(floating point save and restore state) will generate 
misaligned references for one-half of the register 
save/restore cycles. The Intel486 SX microproces- 
sor/Intel 
OverDrive Processor will not cause any 
AC faults if the effective address given in the in- 
struction has the proper alignment. 


VM 
(Virtual 8086 Mode, bit 17) 
The VM bit provides Virtual 8086 Mode within 
Protected Mode. If set while the Intel486 SX 
microprocessor/Intel OverDrive Processor is 
in Protected Mode, the Intel486 SX micro- 
processor/Intel 
OverDrive 
Processor 
will 
switch to Virtual 8086 operation, handling seg- 
ment loads as the 8086 does, but generating 
exception 13 faults on privileged opcodes. 
The VM bit can be set only in Protected Mode, 
by the IRET instruction (if current privilege lev- 
el = 0) and by task switches at any privilege 
level. The VM bit is unaffected by POPF. 
PUSHF always pushes a 0 in this bit, even if 
executing in Virtual 8086 Mode. The EFLAGS 
image pushed during interrupt processing or 
saved during task switches will contain a 1 in 
this bit if the interrupted code was executing 
as a Virtual 8086 Task. 
RF 
(Resume Flag, bit 16) 
The RF flag is used in conjunction with the 
debug register breakpoints. It is checked at 
instruction boundaries before breakpoint pro- 
cessing. When RF is set, it causes any debug 
fault to be ignored on the next instruction. RF 
is then automatically reset at the successful 
completion of every instruction (no faults are 
signalled) except the IRET instruction, the 
POPF instruction, (and JMP, CALL, and INT 
instructions causing a task switch). These in- 
structions set RF to the value specified by the 
memory image. For example, at the end of the 
breakpoint service routine, the IRET instruc- 
tion can pop an EFLAG image having the RF 
bit set and resume the program's execution at 
the breakpoint address without generating an- 
other breakpoint fault on the same location. 
NT 
(Nested Task, bit 14) 
This flag applies to Protected Mode. NT is set 
to indicate that the execution of this task is 


nested within another task. If set, it indicates 
that the current nested task's Task State Seg- 
ment (TSS) has a valid back link to the previ- 
ous task's TSS. This bit is set or reset by con- 
trol transfers to other tasks. The value of NT 
in EFLAGS is tested by the IRET instruction to 
determine whether to do an inter-task return 
or an intra-task return. A POPF or an IRET 
instruction will affect the setting of this bit ac- 
cording to the image popped, at any privilege 
level. 


10PL (Input/Output Privilege level, bits 12-13) 
This two-bit field applies to Protected Mode. 
10Pl indicates the numerically maximum CPl 
(current privilege level) value permitted to ex- 
ecute I/O instructions without generating an 
exception 13 fault or consulting the I/O Per- • 
mission Bitmap. It also indicates the maximum 
CPL value allowing alteration of the IF (INTR 
Enable Flag) bit when new values are popped 
into the EFLAG register. POPF and IRET in- 
struction can alter the 10PLfield when execut- 
ed at CPL = o. Task switches can always al- 
ter the 10PL field, when the new flag image is 
loaded from the incoming task's TSS. 
OF 
(Overflow Flag, bit 11) 
OF is set if the operation resulted in a signed 
overflow. Signed overflow occurs when the 
operation resulted in carry/borrow 
Into the 
sign bit (high-order bit) of the result but did not 
result in a carry/borrow out of the high-order 
bit, or vice-versa. For 8-, 16-, 32-bit opera- 
tions, OF is set according to overflow at bit 7, 
15, 31, respectively. 
DF 
(Direction Flag, bit 10) 
DF defines whether ESI and/or EDI registers 
postdecrement or postincrement during the 
string instructions. Postincrement occurs if DF 
is reset. Postdecrement occurs if DF is set. 


IF 
(INTR Enable Flag, bit 9) 
The IF flag, when set, allows recognition of 
external interrupts signalled on the INTR pin. 
When IF is reset, external interrupts signalled 
on the INTR are not recognized. 10PL indi- 
cates the maximum CPL value allowing altera- 
tion of the IF bit when new values are popped 
into EFLAGS or FLAGS. 
TF 
(Trap Enable Flag, bit 8) 
TF controls the generation of exception 1 trap 
when single-stepping through code. When TF 
is set, the Intel486 SX microprocessor/Intel 
OverDrive Processor generates an exception 
1 trap after the next instruction is executed. 
When TF is reset, exception 1 traps occur 
only as a function of the breakpoint addresses 
loaded into debug registers DRO-DR3. 


SF 
(Sign Flag, bit 7) 


SF is set if the high-order 
bit of the result 
is 
set, 
it is reset 
otherwise. 
For 8-, 16-, 32-bit 
operations, 
SF reflects 
the state of bit 7, 15, 
31 respectively. 


ZF 
(Zero Flag, bit 6) 


ZF is set if all bits of the result are O. Other- 
wise it is reset. 


AF 
(Auxiliary 
Carry Flag, bit 4) 


The Auxiliary 
Flag is used to simplify the addi- 
tion 
and 
subtraction 
of packed 
BCD quanti- 
ties. 
AF is set if the 
operation 
resulted 
in a 
carry out of bit 3 (addition) 
or a borrow 
into bit 
3 (subtraction). 
Otherwise 
AF is reset. 
AF is 
affected 
by carry out of, or borrow 
into bit 3 
only, regardless 
of overall 
operand 
length: 
8, 
16 or 32 bits. 


PF 
(Parity Flags, bit 2) 


PF is set if the low-order 
eight bits of the oper- 
ation contains 
an even number of "1's" 
(even 
parity). 
PF is reset 
if the low-order 
eight 
bits 
have odd parity. 
PF is a function 
of only the 
low-order 
eight 
bits, 
regardless 
of 
operand 
size. 


CF 
(Carry Flag, bit 0) 


CF is set if the operation 
resulted 
in a carry 
out of (addition), 
or a borrow into (subtraction) 
the high-order 
bit. Otherwise 
CF is reset. 
For 
8-, 16- or 32-bit operations, 
CF is set accord- 
ing to carry/borrow 
at bit 7,15 
or 31, respec- 
tively. 


SEGMENT 
REGISTERS 
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NOTE: 
In these 
descriptions, 
"set" 
means 
"set 
to 1," and 
"reset" 
means 
"reset 
to 0." 


2.1.1.4 
Segment 
Registers 


Six 16-bit segment 
registers 
hold segment 
selector 
values identifying 
the currently 
addressable 
memory 
segments. 
In protected 
mode, 
each 
segment 
may 
range 
in size from 
one by1e up to the entire 
linear 
and 
physical 
address 
space 
of 
the 
machine, 
4 
Gby1es (232 by1es). In real address 
mode, the maxi- 
mum segment 
size is fixed at 64 Kby1es (216 by1es). 


The 
six addressable 
segments 
are defined 
by the 
segment 
registers 
CS, SS, OS, ES, FS and GS. The 
selector 
in CS indicates 
the current 
code 
segment; 
the selector 
in SS indicates 
the current 
stack 
seg- 
ment; the selectors 
in OS, ES, FS and GS indicate 
the current 
data segments. 


2.1.1.5 Segment 
Descriptor 
Cache 
Registers 


The segment 
descriptor 
cache registers 
are not pro- 
grammer 
visible, 
yet it is very useful to understand 
their 
content. 
A 
programmer 
invisible 
descriptor 
cache register 
is associated 
with each programmer- 
visible 
segment 
register, 
as shown 
by Figure 
2.3. 


Each descriptor 
cache 
register 
holds 
a 32-bit 
base 
address, a 32-bit segment 
limit, and the other neces- 
sary segment 
attributes. 
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Figure 
2.3.lnteI486TM 
SX Microprocessor/Intel 
OverDrive 
Processor 
Segment 
Registers 
and Associated 
Descriptor 
Cache 
Registers 


When a selector value is loaded into a segment reg- 
ister, the associated descriptor cache register is au- 
tomatically updated with the correct information. In 
Real Address Mode, only the base address is updat- 
ed directly (by shifting the selector value four bits to 
the left), since the segment maximum limit and attri- 
butes are fixed in Real Mode. In Protected Mode, 
the base address, the limit, and the attributes are all 
updated per the contents of the segment descriptor 
indexed by the selector. 


Whenever a memory reference occurs, the segment 
descriptor cache register associated with the seg- 
ment being used is automatically involved with the 
memory reference. The 32-bit segment base ad- 
dress becomes a component of the linear address 
calculation, the 32-bit limit is used for the limit-check 
operation, and the attributes are checked against 
the type of memory reference requested. 


The system level registers, Figure 2.4, control opera- 
tion of the on-chip cache, the on-chip floating point 
unit (FPU) and the segmentation and paging mecha- 
nisms. 


I Only the Intel OverDrIve Processor 
contains 
the I 
on-chip floating point unit (FPU). 


The Intel486 SX microprocessor contains all the 
system level registers except the floating point unit. 
These registers are only accessible to programs run- 
ning at privilege level 0, the highest privilege level. 


The system level registers include three control reg- 
isters and four segmentation base registers. The 
three control registers are CRD,CR2 and CR3. CR1 
is reserved for future Intel processors. The four seg- 
mentation base registers are the Global Descriptor 
Table Register (GDTR), the Interrupt Descriptor Ta- 
ble Register (IDTR),the local Descriptor Table Reg- 
ister (lDTR) and the Task State Segment Register 
(TR). 
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2.1.2.1 Control 
Registers 


Control 
Register 
0 (CRO) 


CRO, shown 
in Figure 2.5, contains 
10 bits for con- 
trol and status 
purposes. 
Five of the bits defined 
in 
the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
CRO are newly defined. 
The new bits are 
CD, NW, AM, WP and NE. The function 
of the bits in 
CRO can be categorized 
as follows: 


Intel486 
SX Microprocessor/Intel 
OverDrive 
Proces- 
sor Operating 
Modes: 
PG, PE (Table 2.2) 


On-Chip 
Cache Control 
Modes: 
CD, NW (Table 2.3) 


On·F1oating Point Unit Control: TS, EM, MP, NE 
(Table 2.4) 


Alignment 
Check 
Control: 
AM 


Supervisor 
Write Protect: 
WP 


Table 2.2. Intel486™ 
SX Microprocessor! 
Intel OverDrive™ 
Processor 
Operating 
Modes 


PG 
PE 
Mode 


0 
0 
REAL Mode. Exact 8086 semantics, 
with 32-bit extensions 
available 
with 
prefixes. 


0 
1 
Protected 
Mode. Exact 80286 
semantics, 
plus 32-bit extensions 
through 
both prefixes and "default" 
prefix setting associated 
with code 
segment 
descriptors. 
Also, a sub- 
mode is defined to support 
a virtual 
8086 within the context 
of the 
extended 
80286 protection 
model. 


1 
0 
UNDEFINED. 
Loading CRO with this 
combination 
of PG and PE bits will 
raise a GP fault with error code O. 


1 
1 
Paged Protected 
Mode. All the 
facilities 
of Protected 
mode, with 
paging enabled 
underneath 
segmentation. 


CD 
NW 
Operating 
Mode 


1 
1 
Cache fills disabled, 
write-through 
and 
invalidates 
disabled. 


1 
0 
Cache fills disabled, 
write-through 
and 
invalidates 
enabled. 


0 
1 
INVALID. 
If CRO is loaded with this 
configuration 
of bits, a GP fault with 
error code is raised. 


0 
0 
Cache fills enabled, 
write-through 
and 
invalidates 
enabled. 


Table 2.4. Intel OverDriveTli 
Processor 
Floating Point Ill8tructlon 
Control 


CROBIT 
Instruction 
Type 


EM 
TS 
MP 
FloatIng-Point 
Walt 


0 
0 
0 
Execute 
Execute 
0 
0 
1 
Execute 
Execute 
0 
1 
0 
Trap 7 
Execute 
0 
1 
1 
Trap 7 
Trap 7 
1 
0 
0 
Trap 7 
Execute 
1 
0 
1 
Trap 7 
Execute 
1 
1 
0 
Trap 7 
Execute 
1 
1 
1 
Trap 7 
Trap 7 


The low-order 
16 bits of CRO are also known 
as the 
Machine 
Status 
Word 
(MSW), 
for compatibility 
with 
the 80286 protected 
mode. LMSW and SMSW (load 
and 
store 
MSW) 
instructions 
are taken 
as special 
aliases of the load and store CRO operations, 
where 
only the low-order 
16 bits of CRO are involved. 
The 
LMSW 
and 
SMSW 
instructions 
in the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
work 
in 
an identical 
fashion 
to the 
LMSW 
and 
SMSW 
in- 
structions 
in the 80286 (i.e., they only operate 
on the 
low-order 
16 bits of CRO and ignores 
the new bits). 
New 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
operating 
systems 
should 
use the 
MOV 
CRO, Reg instruction. 


The defined 
CRO bits are described 
below. 


PG 
(Paging 
Enable, 
bit 31) 


The PG bit is used to indicate 
whether 
paging is 
enabled 
(PG = 1) or disabled 
(PG = 0). See Ta- 
ble 2.2. 


CD 
(Cache 
Disable, 
bit 30) 


The CD bit is used to enable the on-chip 
cache. 
When 
CD = 1, the 
cache 
will 
not 
be filled 
on 
cache misses. When CD = 0, cache fills may be 
performed 
on misses. 
See Table 2.3. 


The state of the CD bit, the cache 
enable 
input 
pin (KEN #), and the relevant 
page cache 
dis- 
able 
(PCD) 
bit determine 
if a line read 
in re- 
sponse 
to a cache 
miss will be installed 
in the 
cache. 
A line is installed 
in the 
cache 
only 
if 
CD =0 and KEN # and PCD are both zero. The 
relevant 
PCD bit comes 
from 
either 
the 
page 
table entry, page directory 
entry or control 
reg- 
ister 3. Refer to Section 
5.6 for more details 
on 
page cacheability. 


CD is set to one after RESET. 


NW (Not Write-Through, 
bit 29) 


The 
NW 
bit 
enables 
on-chip 
cache 
write- 
throughs 
and write-invalidate 
cycles 
(NW = 0). 


When NW= 0, all writes, including cache hits, 
are sent out to the pins. Invalidate cycles are 
enabled when NW= O.During an invalidate cy- 
cle a line will be removed from the cache if the 
invalidate address hits in the cache. See Table 
2.3. 
When NW= 1, write-throughs and write-invali- 
date cycles are disabled. A write will not be sent 
to the pins if the write hits in the cache. With 
NW= 1 the only write cycles that reach the ex- 
ternal bus are cache misses. Write hits with 
NW= 1 will never update main memory. Invali- 
date cycles are ignored when NW= 1. 
AM (Alignment Mask, bit 18) 
The AM bit controls whether the alignment 
check (AC) bit in the flag register (EFLAGS) can 
allow an alignment fault. AM = 0 disables the 
AC bit. AM = 1 enables the AC bit. AM = 0 is the 
Intel386 microprocessor compatible mode. 
Intel386 microprocessor software may load in- 
correct data into the AC bit in the EFLAGS reg- 
ister. Setting AM= 0 will prevent AC faults from 
occurring before the Intel486 SX microproces- 
sor/Intel OverDrive Processor has created the 
AC interrupt service routine. 
WP (Write Protect, bit 16) 
WP protects read-only pages from supervisor 
write access. The Intel386 microprocessor al- 
lows a read-only page to be written from privi- 
lege levels 0-2. The Intel486 SX microproces- 
sor/Intel OverDrive Processor are compatible 
with the Intel386 microprocessor when WP=O. 
WP= 1 forces a fault on a write to a read-only 
page from any privilege level. Operating sys- 
tems with Copy-on-Write features can be sup- 
ported with the WP bit. Refer to Section 4.5.3 
for further details on use of the WP bit. 


NE (Numerics Exception, bit 5) 
For the Intel486 SX microprocessor, interrupt 7 
will be generated upon encountering any float- 
ing point instruction regardless of the value of 
the NE bit. It is recommended that NE= 1 for 
normal operation of the Intel486 SX microproc- 
essor. 


For the Intel OverDrive Proce~sor, the NE bit 
controls whether unmasked floating point ex- 
cepti()ns (UFPE) are handled through inter- 
rupt vector 16 (NE= 1) or through an external 
interrupt (NE=O). 
NE=O 
(default at reset) 
supports the DOS operating system error re- 
porting scheme from the 8087, 80287 and In- 
tel387 Math COProcessor. In DOS systems, 
math coprocessor errors are reported via ex- 
ternal interrupt vector 13. DOS uses interrupt 
vector 16 for an operating system call. Refer 
to Sections 6.2.13 and 7.2.14 for more infor- 
mation on floating point error reporting. 
For any UFPE the fk)ating point error output 
pin (FERR#) will be driven active. 
For NE= 0, the Intel OverDrive Processor 
works in conjunction with the ignore numeric 
error input (IGNNE#) and the FERR# output 
pins. When a UFPE occurs and the IGNNE'" 
input is inactive, the Intel OverDrive Proces- 
sor freezes immediately before executing the 
next floating point instruction. An external in- 
terrupt controller will supply an interrupt vec- 
tor when FERR'" is driven active. The UFPE 
is ignored if IGNNE# is active and floating 
point execution continues. 


NOTE: 
The freeze does not take place if the next 
instruction is one of the control instructions 
FNCLEX, 
FNINIT, 
FNSAVE, 
FNSTENV. 


FNSTCW, FNSTSW, FNSTSW AX, FNENI, 
FNDISI and FNSETPM. The freeze does oc- 
cur if the next instruction is WAIT. 
For NE= 1, any UFPE will result in a software 
interrupt 16, immediately before executing the 
next non-control floating point or WAIT in- 
struction. The ignore numeric error input 
(IGNNE#) signal will be ignored. 


• 


PE (Protection Enable, bit 0) 
The PE bit en~bles the segment based protec- 
tion mechanism if PE= 1 protection is enabled. 
When PE=O the Intel486 SX microprocessor/ 
Intel OverDrive Processor operates in REAL 
mode, with segment based protection disabled, 
and addresses formed as in an 8086. Refer to 
Table 2.2. 


For the Intel486 SX microprocessor, the following 
descriptions for EM, MP and TS apply. 
EM (Emulate Coprocessor, bit 2) 


EM bit should be set to one for the Intel486 SX 
microprocessor. This will cause the Intel486 SX 
microprocessor to trap via interrupt vector 7 
(Device Not Available) to a software exception 
handler whenever it encounters a floating point 
instruction. If EM bit is 0, the system will hang. 


MP (Monitor Coprocessor, bit 1) 
For normal operation of the Intel486 SX micro- 
processor it is required to set this bit as zero 
(MP= 0). The MP bit is used in conjunction with 
the TS bit to determine if WAIT instructions 
should trap. For MP=O, the value of TS is a 
don't care for these type of instructions (see 
Table 2.5a). 


TS 
(Task Switch, bit 3) 
The TS bit is set whenever a task switch opera- 
tion is performed. Execution of floating point in- 
structions with TS = 1 will cause a Device Not 
Available (DNA) fault 
(trap vector 
7). With 


MP=O, the value of TS bit is a don't care for 
the WAIT instructions Le.,these instructions will 
not generate trap 7 (see Table 2.5a). 
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Table 2.5a. Recommended Values of EM and MP Bits 
In CRORegister for Intel486TM SX Microprocessor 


CROBlt 
Instruction 
Type 


NE 
EM 
TS 
MP 
FP 
WAIT 
1 
1 
0 
0 
Trap 7 
Execute 


1 
1 
1 
0 
Trap 7 
Execute 


Table 2.5c shows the interpretation of different com- 
binations of the EM, T5 and MP bits for system with 
an 
Intel486 
5X 
microprocessor/Intel 
OverDrive 
Processor. 


For the Intel OverDrive Processor, Table 2.5b 
shows the recommended values for the EM, MP 
and NE bits. 


Table 2.5b. Recommended Values of the Floating Point 
Related Bits for a System with an Intel OverDrlve™ Processor 


CROBit 
Intel486 SX Microprocessor 
Intel OverDrive ProceSllOf' 


EM 
1 
0 


MP 
0 
1 
. 


NE 
1 
0, for DOS Systems 


1, for User-Defined Exception Handler 


Table 2.5c. Interpretation 
of Different Combinations of the EM, TS and MP Bits In an 
Intel486TM SX Microprocessor/Intel 
OverDrive™ Processor Based System 


CROBit 
Instruction Type 


EM 
TS 
MP 
Floating Point 
Walt 


0 
0 
0 
Execute 
Execute 


0 
0 
1 
Execute 
Execute 


0 
1 
0 
Exception 7 
Execute 


0 
1 
1 
Exception 7 
Exception 7 


1 
0 
0 
Exception 7 
Execute 


1 
0 
1 
Exception 7 
Execute 


1 
1 
0 
Exception 7 
Execute 


1 
1 
1 
Exception 7 
Exception 7 


NOTE: 
If MP'" 1 and TS= 1, the Intel OverDrive 
Processor will generate a trap 7 so that the 
system software can save the floating point 
status of the old task. 


CR1 is reserved for use in future Intel microproces- 
sors. 


CR2, shown in Figure 2.6, holds the 32-bit linear ad- 
dress that caused the last page fault detected. The 
error code pushed onto the page fault handler's 
stack when it is invoked provides additional status 
information on this page fault. 


All new CRObits added to the Intel386 and Intel486 
SX microprocessors/Intel OverDrive Processor, ex- 
cept for ET and NE, are upward compatible with the 
80286 because they are in register bits not defined 
in the 80286. For strict compatibility with the 80286, 
the load machine status word (LMSW) instruction is 
defined to not change the ET or NE bits. 
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Intel reserved: 
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See Section 
2.1.6. 


CR3, shown in Figure 2.6, contains the physical 
base address of the page directory table. The page 
directory is always page aligned (4 Kbyte-aligned). 
This alignment is enforced by only storing bits 
20-31 in CR3. 


In the Intel486 SX microprocessor/Intel OverDrive 
Processor, CR3 contains two new bits, page write- 
through (PWT) (bit 3) and page cache disable (PCD) 
(bit 4). The page table entry (PTE) and page directo- 
ry entry (PDE) also contain PWT and PCD bits. PWT 
and PCD control page cacheability. When a page is 
accessed in external memory, the state of PWT and 
PCD are driven out on the PWT and PCD pins. The 
source of PWT and PCDcan be CR3, the PTE or the 
PDE. PWT and PCDare sourced from CR3 when the 
PDE is being updated. When paging is disabled (PG 
= 0 in CRO),PCD and PWT are assumed to be 0, 
regardless of their state in CR3. 


A task switch through a task state segment (TSS) 
which changes the values in CR3, or an explicit load 
into CR3 with any value, will invalidate all cached 
page table entries in the translation lookaside buffer 
(TLB). 


The page directory base address in CR3 is a physi- 
cal address. The page directory can be paged out 
while its associated task is suspended, but the oper- 
ating system must ensure that the pa~ 
directory is 
resident in physical memory before the task is dis- 
patched. The entry in the TSS for CR3 has a physi- 
cal address, with no provision for a present bit. This 
means that the page directory for a task must be 
resident in physical memory. The CR3 image in a 
TSS must point to this area, before the task can be 
dispatched through its TSS. 


2.1.2.2 System 
Address 
Registers 


Four special registers are defined to reference the 
tables or segments supported by the 80286, In- 
tel386, Intel486 SX microprocessors/lntel OverDrive 
Processor protection model. These tables or seg- 
ments are: 


GDT (Global Descriptor Table) 
IDT (Interrupt Descriptor Table) 
LDT (Local Descriptor Table) 
TSS (Task State Segment) 


The addresses of these tables and segments are 
stored in special registers, the System Address and 
System Segment Registers, illustrated in Figure 2.4. 
These registers are named GDTR, IDTR, LDTR and 
TR respectively. Section 4, Protected Mode Archi- 
tecture, describes the use of these registers. 


System 
Address 
Registers: 
GOTR and IOTR 


The GDTR and IDTR hold the 32·bit linear base ad- 
dress and 16-bit limit of the GDT and IDT, respec- 
tively. 


Since the GDT and IDT segments are global to all 
tasks in the system, the GDT and IDT are defined by 
32-bit linear addresses (subject to page translation if 
paging is enabled) and 16-bit limit values. 


System 
Segment 
Registers: 
LOTR and TR 


The LDTR and TR hold the 16-bit selector for the 
LDT descriptor and the TSS descriptor, respectively. 


Since the LDT and TSS segments are task specific 
segments, the LDT and TSS are defined by selector 
values stored in the system segment registers. 


NOTE: 
A programmer-invisible segment descriptor register 
is a~sociated with each system segment register. 
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2.1.3 FLOAnNG 
POINT REGISTERS 


FIgUre 2.7 shows the floating point register set 
The on-chIp FPU contains eight data registers. a 
tag word, a control register, a status register, an 
lnstructIon pointer and a data pointer. 
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figure 2.7. floating 
Point Registers 


The operation of the Intel OverDrive Processor on- 
chip floating point unit is exactly the same as the 
Intel387 Math CoProcessor and Intel486 OX mi- 
croprocessor. Software written for the Intel387 
Math CoProcessor will run on the on-chip floating 
point unit (FPU) without any modifications. 


2.1.3.1 Data Registers 


Floating point computations use the Intel Over- 
Drive Processor FPU data registers. These eight 
80-bit registers provide the equivalent capacity of 
twenty 32-bit registers. Each of the eight data 


registers is divided into "fields" corresponding to 
the FPU's extended-precision data type. 


The FPU's register set can be accessed either as 
a stack. with Instructions operating on the top one 
or two stack elements, or as a fixed register set, 
with instructions operating on explicitly designated 
registers. The TOP field in the status word identi· 
fIes the current top-of·stack register. A "push" 0p- 
eration decrements TOP by one and loads a value 
into the new top register. A "pop" operation stores 
the value from the current top register and then 
increments TOP by one. Uke other Intel OverDrive 
Processor stacks in memory, the FPU register 
stack grows "down" toward lower-addressed reg- 
isters. 


Instructions may address the data registers either 
Implicitly or expIlcItly. Many instructions operate on 
the register at the TOP of the stack. These instruc- 
tions impIicltIy address the register at which TOP 
points. Other Instructions allow the programmer to 
explicitly spectty which register to use. This explicit 
register addressing is also relative to TOP. 


2.1.3.2 Teg Word 


The tag word marks the content of each numeric 
data register, as shown in Figure 2.8. Each two-bit 
tag represents one of the eight data registers. The 
principal function of the tag word is to optimize the 
FPUs performance and stack handling by making it 
possible to distinguish between empty and non- 
empty register locations. It also enables exception 
handlers to check the contents of a stack location 
without the need to perform complex decoding of 
the actual data. 


2.1.3.3 Statu. Word 


The 16-bit status word reflects the overall state of 
the FPU. The status word is shown in Figure 2.9 
and is located in the status register. 


15 
0 


TAG(7) 
I 
TAG (6) I 
TAG (5) I 
TAG (4) I 
TAG (3) I 
TAG (2) I 
TAG (1) I 
TAG (0) 


NOTE: 
The index i of tag(l) 
Is not 
top-relative. 
A program 
typically 
uses the "top" 
field of Status 
Word 
to determine 
which 
tag(1) field 
refers 
to logical 
top of stack. 
TAG VALUES: 
00 = Valid 
01 = Zero 
10 = QNaN, 
SNaN, 
Infinity, 
DenormaI 
and Unsupported 
Formats 
11 = Empty 


• 


intel~ 


tun 


TOP OF STACK POINTER 


CONOmON 
CODE 


ERROR SUMMARY 
STATUS 


STACK FLAG 


EXCEPTION 
FLAGS: 


PRECISION 


UNDERFLOW 


OVERFLOW 


ZERO DIVIDE 


DENORlolAlllED 
OPERAND - 


INVAllO 
OPERATION 


ES is set if any unmasked exception bit is set; cleared otherwise. 
See Table 2.6 for Interpretation of condition code. 
TOP values: 
000 = Register 0 is Top Of Stack 
001 - 
Register 1 is Top of Stsck 
••• 
111 = Register 7 Is Top of Stsck 
For definitions of exceptions, re1erto the section entitled 
"Exception Handling". 


The B bit (Busy, bit 15) is included for 8087 com· 
patibility. The B bit reflects the contents of the ES 
bit (bit 7 of the status word). 


Bits 13-11 (TOP) point to the FPU register that is 
the current top-of-stack. 


The tour numeric condition code bits, CO-C3, are 
similar to the flags in EFLAGS. Instructions that 
perform arithmetic operations uPdateCO-C3 to re- 
flect the outcome. The effects of these instructions 
on the condition codes are summarized in Tables 
2.6 through 2.9. 


Instruction 
COtS) 
I 
C3(Z) 
C1 (A) 
C2(C) 


FPREM. FPREM1 
Three least signiflcant 
bits 
Reduction 
(see Table 2.3) 
of quotient 
0= 
complete 
Q2 
QO 
01 
orO/U# 
1 = incomplete 


FCOM, FCOMP, 
FCOMPP. 
FTST, 
Result of comparison 
Zero 
Operand 
is not 
FUCOM, FUCOMP, 
(see Table 2.8) 
orO/U# 
comparable 
FUCOMPP, 
FICOM, 
(Table 2.8) 
FICOMP 


FXAM 
Operand 
class 
Sign 
Operand 
class 
(see Table 2.9) 
orO/U# 
(Table 2.9) 


FCHS, FABS, FXCH, 
FINCTOP, 
FDECTOP, 
Constant 
loads, 
UNDEFINED 
Zero 
UNDEFINED 
FXTRACT, 
FLD, 
orO/U# 
FILD,FBLD, 
FSTP (ext real) 


FIST, FBSTP, 
FRNDINT, 
FST, 
FSTP, FADD, FMUL, 
FDIV, FDIVR, 
UNDEFINED 
Roundup 
UNDEFINED 
FSUB, FSUBR, 
orO/U# 
FSCALE, 
FSORT, 
FPATAN, 
F2XM1, 
FYL2)(, FYL2XP1 


FPTAN,FSIN 
Roundup 
Reduction 
FCOS, FSINCOS 
UNDEFINED 
orO/U#, 
0= 
complete 
undefined 
1 = incomplete 
ifC2 
= 1 


FLDENV, 
FRSTOR 
Each bit loaded from memory 


FINIT 
Clears these bits 


FLOCW, FSTENV, 
FSTCW, FSTSW, 
UNDEFINED 
FCLE)(, FSAVE 


When 
both 
IE and SF bits of status 
word 
are set, indicating 
a stack 
exception, 
this 
bit 
distinguishes 
between 
stack overflow 
(C1 = 1) and underflow 
(C1 = 0). 


If FPREM 
or FPREM1 
produces 
a remainder 
that 
is less than the modulus, 
reduction 
is 
complete. 
When 
reduction 
is incomplete 
the 
value 
at the 
top 
of the 
stack 
is a partial 
remainder, 
which can be used as input to further 
reduction. 
For FPTAN, 
FSIN, FCOS, and 
FSINCOS, 
the reduction 
bit is set if the operand 
at the top of the stack 
IS too large. In this 
case the original 
operand 
remains 
at the top of the stack. 


When the PE bit of the status word is set, this bit indicates 
whether 
the last rounding 
in the 
instruction 
was upward. 


Do not rely on finding any specific 
value in these 
bits. 


• 


Condition 
Code 
Interpretation 
after FPREM and FPREM1 
C2 
C3 
C1 
CO 


Incomplete 
Reduction: 
1 
X 
X 
X 
further interaction 
required 
for complete 
reduction 


01 
QO 
02 
OMOD8 


0 
0 
0 
0 
0 
1 
0 
1 
Complete 
Reduction: 
1 
0 
0 
2 
0 
1 
1 
0 
3 
CO. C3, C1 contain three least 


0 
0 
1 
4 
significant 
bits of quotient 


0 
1 
1 
5 
1 
0 
1 
6 
1 
1 
1 
7 


Order 
C3 
C2 
CO 


TOP> 
Operand 
0 
0 
0 
TOP < Operand 
0 
0 
1 
TOP = Operand 
1 
0 
0 
Unordered 
1 
1 
1 


C3 
C2 
C1 
CO 
Value at TOP 


0 
0 
0 
0 
+ Unsupported 
0 
0 
0 
1 
+ NaN 
0 
0 
1 
0 
- 
Unsupported 
0 
0 
1 
1 
- 
NaN 
0 
1 
0 
0 
+ Normal 
0 
1 
0 
1 
+ Infinity 
0 
1 
1 
0 
- 
Normal 
0 
1 
1 
1 
- 
Infinity 
1 
0 
0 
0 
+0 
1 
0 
0 
1 
+ Empty 
1 
0 
1 
0 
-0 
1 
0 
1 
1 
- 
Empty 
1 
1 
0 
0 
+ Denormal 
1 
1 
1 
0 
- 
Denormal 


Bit 7 is the error summary (ES) status bit. The ES 
bit is set if any unmasked exception bit (bits 0-5 in 
the status word) is set; ES is clear otherwise. The 
FERR/I 
(floating point error) signal is asserted 
when ES is set. 


Bit 6 is the stack flag (SF). This bit is used to distin· 
guish invalid operations due to stack overflow or 
underflow. When SF is set. bit 9 (C1) distinguishes 
between stack overflow (C1=1) and underflow 
(C1 =0). 


Table 2.10 shows the six exception flags in bits 0- 
5 of the status word. Bits 0-5 are set to indicate 
that the FPU has detected an exception while exe· 
cuting an instruction. 


The six exception flags in the status word can be 
individually masked by mask bits in the FPU con· 
trol word. Table 2.10 lists the exception conditions. 
and their causes in order of precedence. Table 
2.10 also shows the action taken by the FPU if the 
corresponding exception flag is masked. 


An exception that is not masked by the control 
word will cause three things to happen: the corre· 
sponding exception flag in the status word will be 
set. the ES bit in the status word will be set and the 
FERR/I output signal will be asserted. When the 
Intel OverDrive Processor attempts to execute an- 
other floating point or WAIT instruction, exception 
16 occurs or an external interrupt happens if the 
NE=1 
in 
control 
register 
O. 
The 
exception 


condition must be resolved via an interrupt service 
routine. The FPU saves the address of the floating 
point instruction that caused the exception and the 
address of any memory operand required by that 
instruction in the instruction and data pointers (see 
section 2.1.3.4). 


Note that when a new value is loaded into the 
status word by the FLDENV (load environment) or 
FRSTOR (restore state) instruction. the value of 
ES (bit 7) and its reflection in the B bit (bit 15) are 
not derived from the values loaded from memory. 
The values of ES and B are dependent upon the 
values of the exception flags in the status word 
and their corresponding masks in the control word. 
If ES is set in such a case. the FERR/I output of 
the Intel OverDrive Processor is activated immedi· 
• 
ately. 


2.1.3.4 Instruction 
and Data Pointers 


Because the FPU operates in parallel with the ALU 
(in the Intel OverDrive Processor the arithmetic 
and logic unit (ALU) consists of the base architec- 
ture registers). any errors detected by the FPU 
may be reported after the ALU has executed the 
floating point instruction that caused it. To allow 
identification of the failing numeric instruction, the 
Intel OverDrive Processor contains two pointer 
registers that supply the address of the failing nu· 
meric instruction and the address of its numeric 
memory operand (if appropriate). 


Exception 
Cause 
Default Action 
(If exception Is masked) 


Invalid 
Operation on a signaling NaN. unsupported format, 
Result is a quiet NaN. integer 
Operation 
indeterminate form (0' 00, 0/0. (+ 00l + (- 
00l.etc.), or 
indefinite, or BCD indefinite 
stack overflow/underflow (SF is also set). 


Denormalized 
At least one of the operands is denormalized, Le., it has 
Normal processing 
Operand 
the smallest exponent but a nonzero significand. 
continues 


Zero Divisor 
The divisor is zero while the dividend is a noninfinite, 
Result is 00 
nonzero number. 


Overflow 
The result is too large in magnitude to fit in the specified 
Result is largest finite value 
format. 
or 00 


Underflow 
The true result is nonzero but too small to be 
Result is denormalized or 
represented in the specified format, and. if underflow 
zero 
exception is masked, denormalization causes loss of 
accuracy. 


Inexact 
The true result is not exactly representable in the 
Normal processing 
Result 
specified format (e.g., 1/3); the result is rounded 
continues 
(Precision) 
according to the rounding mode. 


The instruetiorl and data pointers are provided for 
user-written error handlers. These registers are ac- 
cessed 
by 
the 
FlDENV 
(load 
environment), 
FSTENV (store environment), FSAVE (save state) 
and FRSTOR (restore state) instructions. Whenev- 
er the Intel OverDrive Processor decodes a new 
floating point instruction, it saves the instruction 
(including any prefixes that may be present), the 
address of the operand (if present) and the op- 
code. 


The instruction and data pointers appear in one of 
four formats depending on the operating moda of 
the Intel OverDrive Processor (protected mode or 
real-address 
mode) 
and 
depending 
on 
the 


operand-size attribute in effect (32-bit operand or 
16-bit operand). When the Intel OverDrive Proces- 
sor is in the virtual-86 moda. the real address 
mode formats are used. The four formats are 
shown in Figures 2.10-2.13. The floating point in- 
structions 
FLDENV, 
FSTENV. 
FSAVE 
and 
FRSTOR are used to transfer these values to and 
from memory. Note that the value of the data 
pointer is undafined if the prior floating point in- 
struction did not have a memory operand. 


NOTE: 
The operand size attribute is the D bit in a seg- 
ment descriptor. 


32·BIT PROTECTED 
MODE FORMAT 
15 


RESERVED 
CONTROL 
WORD 


RESERVED 
STATUS 
WORD 


RESERVED 
TAG WORD 


IPOFFSET 


0ססoo I 
OPCODE 
10..0 
CSSELECTOR 


DATA OPERAND 
OFFSET 
. 


RESERVED 
OPERAND 
SELECTOR 
. 


32·BIT REAL-ADDRESS 
MODE FORMAT 
15 


RESERVED 
CONTROL 
WORD 


RESERVED 
STATUS 
WORD 


RESERVED 
TAG WORD 


RESERVED 
INSTRUCTION 
POINTER 
15..0 


0000 I 
INSTRUCTION 
POINTER 
31 ..16 
I 
0 I 
OPCODE 10.. 
0 


RESERVED 
OPERAND 
POINTER 
15 ..0 


0000 I 
OPERAND 
POINTER 
31..16 
I 
0000 
00000000 


16-BIT PROTECTEDMODE FORMAT 
15 
7 
0 
. 


CONTROL WORD 


STATUS WORD 


TAG WORD 


IPOFFSET 


CSSELECTOR 


OPERAND OFFSET 


OPERAND SELECTOR 


Figure 
2.12. Protected 
Mode 
FPU 
Instruction 
and Data Polnter 
Image 
In Memory, 
16-81t Format 


16-BIT REAL·ADDRESS MODE AND 
VIRTUAL-8086 MODE FORMAT 
15 
7 
0 


0 
CONTROL WORD 
0 


2 
STATUS WORD 
2 


4 
TAG WORD 
4 


6 
INSTRUCTION POINTER 15••0 
6 


8 
IP19.16 
0 
OPCODE10 •.0 
8 


A 
OPERAND POINTER 15•.0 
A 


DP 19.16 
0 0 0 0 0 0 0 0 0 0 
0 
0 
C 
C 


Figure 
2.13. Real Mode 
FPU 
Instruction 
and Data Pointer 
Image 
In Memory, 
16-81t Format 


The FPU provides 
several processing 
options 
that are selected 
by loading a control 
word from memory 
into 
the control 
register. 
Figure 2.14 shows 
the format 
and encoding 
of fields in the control 
word. 


RESERVED 


RESERVED" 
ROUNDING CONTROL 


PRECISION CONTROL 


EXCEPTION MASKS: 


PRECISION 


UNDERfLOW 
OVERfLOW 


ZERO DIVIDE 
DENORMALIZED OPERAND 


INVALID OPERATION 


Precl8lon Control 
00-24 
bits (single precision) 
01-(reserved) 
1()-53 bits (double precision) 
11-64 
bits (extended precision) 


• "0" AF'TER RESET OR F'INIT; 
CHANGEABLE UPON LOADING THE 
CONTROL WORD (CW). PROGRAMS 
MUST IGNORE THIS BIT. 


Rounding Control 
oo-Round 
to nearest or even 
01-Aound 
down (toward - 00) 
1O-Aound up (toward + 00) 
11-ehop 
(truncate toward zero) 


Flgur. 
2.14. FPU Control 
Word 


The low-order byte of the FPU control word con- 
figures the FPU error and exception masking. 
Bits 0-5 
of the control word contain individual 
masks for each of the six exceptions that the 
FPU recognizes. 


The high-order byte of the control word config- 
ures the FPU operating mode, including precision 
and rounding. 
RC (Rounding Control, bits 10-11) 


The RC bits provide for directed rounding 
and true chop, as well as the unbiased round 
to nearest even mode specified in the IEEE 
standard. 
Rounding 
control 
affects 
only 
those instructions that perform rounding at 
the end of the operation (and thus can gen- 
erate a precision exception); namely, FST, 
FSTP, FIST, all arithmetic instructions (ex- 
cept FPREM, FPREM1, FXTRACT, FABS 
and FCHS), and all transcendental instruc- 
tions. 


PC (Precision Control, bits 8-9) 
The PC bits can be used to set the FPU in- 
ternal operating precision of the significand 
at less than the default of 64 bits (extended 
precision). This can be useful in providing 
compatibility with early generation arithmetic 
processors of smaller precision. PC affects 
only the instructions ADD, SUB, DIV, MUL, 
and SORT. For all other instructions, either 
the precision is determined by the opcode or 
extended precision is used. 


2.1.4.1 Debug Registers 


The six programmer accessible debug registers, Fig- 
ure 2.15, provide on-chip support for debugging. De- 
bug registers DRO-3 specify the four linear break- 
points. The Debug control register DR?, is used to 
set the breakpoints and the Debug Status Register, 
DR6, displays the current state of the breakpoints. 
The use of the Debug registers is described in Sec- 
tion 9. 


LINEAR 
BREAKPOINT 
ADDRESS 
0 


LINEAR 
BREAKPOINT 
ADDRESS 
1 


LINEAR 
BREAKPOINT 
ADDRESS 
2 


LINEAR 
BREAKPOINT 
ADDRESS 
3 


Intel Reserved 
Do Not Define 


Intel Reserved 
Do Not Define 


BREAKPOINT 
STATUS 


BREAKPOINT 
CONTROL 


Test Registers 


CACHE TEST DATA 


CACHE TEST STATUS 


CACHE TEST CONTROL 


TLB TEST CONTROL 


TLB TEST STATUS 


TLB = Translation 
Lookaside 
Buffer 


DRO 


DR1 


DR2 


DR3 


DR4 


DR5 


DR6 


DR7 


TR3 


TR4 


TR5 


TR6 


TR7 


2.1.4.2 Test Registers 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor contain five test registers. The test regis- 
ters are shown in Figure 2.15. TR6 and TR? are 
used to control the testing of the translation look- 
aside buffer. TR3, TR4 and TR5 are used for testing 
the on-chip cache: The use of the test registers is 
discussed in Section 8. 


There are a few differences regarding the accessibil- 
ity of the registers in Real and Protected Mode. Ta- 
ble 2.11 summarizes these differences. See Section 
4, Protected Mode Architecture, for further details. 


Use in 
Use in 
Use In 


Register 
Real Mode 
Protected Mode 
Virtual 8086 Mode 


Load 
Store 
Load 
Store 
Load 
Store 


General Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Segment Register 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Flag Register 
Yes 
Yes 
Yes 
Yes 
IOPL 
IOPL" 


Control Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
Yes 


GDTR 
Yes 
Yes 
PL = 0 
Yes 
No 
Yes 


IDTR 
Yes 
Yes 
PL = 0 
Yes 
No 
Yes 


LDTR 
No 
No 
PL = 0 
Yes 
No 
No 


TR 
No 
No 
PL = 0 
Yes 
No 
No 


FPU Data Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Control Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Status Registers 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Instruction Pointer 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


FPU Data Pointer 
Yes 
Yes 
Yes 
Yes 
Yes 
Yes 


Debug Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
No 


Test Registers 
Yes 
Yes 
PL = 0 
PL = 0 
No 
No 


NOTES: 
PL = 0: The registers 
can be accessed 
only when 
the current 
privilege 
level is zero. 
"IOPL: 
The PUSHF 
and POPF instructions 
are made 
I/O 
Privilege 
Level sensitive 
in Virtual 
86 Mode. 


2.1.6 COMPATIBILITY 


VERY IMPORTANT NOTE: 
COMPATIBILITY WITH FUTURE PROCESSORS 


3. Do not depend on the ability to retain infor- 
mation written into any undefined bits. 
4. When loading registers always load the unde- 
fined bits as zeros. 
5. However, 
registers 
which 
have been previ- 
ously stored may be reloaded without 
mask- 
Ing. 


In the preceding register descriptions, 
note cer- 
tain Intel486 SX microprocessor/Intel 
OverDrl- 
veTU 
Processor 
register 
bits are Intel reserved. 
When reserved bits are called out, treat them as 
fully undefined. 
This Is essential for your soft- 
ware compatibility 
with future 
processorsl 
Fol- 
low the guidelines below: 
1. Do not depend on the states of any undefined 
bits when testing the values of defined regis- 
ter bits. Mask them out when testing. 
2. Do not depend on the states of any undefined 
bits when storing them to memory or another 
register. 


Depending upon the values of undefined' regis- 
ter bits will make your software dependent upon 
the unspecified 
Intel486 SX microprocessor/In- 
tel OverDrive Processor 
handling of these bits. 
Depending 
on undefined 
values 
risks 
making 
your software 
incompatible 
with future 
proces- 
sors that define usages for the Intel486 SX mi- 
croprocessor-undefined 
bits and Intel OverDrive 
Processor bits. AVOID ANY SOFTWARE DEPEN- 
DENCE UPON THE STATE OF UNDEFINED In- 
tel486 SX MICROPROCESSORllntel OverDrlve™ 
PROCESSOR REGISTER BITS. 


• 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor instruction set can be divided into 11 cat- 
egories of operations: 


Data Transfer 
Arithmetic 
Shift/Rotate 
String Manipulation 
Bit Manipulation 
Control Transfer 
High Level Language Support 
Operating System Support 
Processor Control 


Floating Point 
Floating Point Control 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor instructions are listed in Section 10. 


Only the Intel OverDrive Processor has floating 
point instructions. The Intel486 SX microproces- 
sor does not have floating point Instructions. 
Note that all floating point unit Instruction mne- 
monics begin with an F. 


All 
Intel486 
SX 
microprocessor/Intel 
OverDrive 


Processor instructions operate on either 0, 1, 2 or 3 
operands; where an operand resides in a register, in 
the instruction itself or in memory. Most zero oper- 
and instructions (e.g., CLI, STI) take only one byte. 
One operand instructions generally are two bytes 
long. The average instruction is 3.2 bytes long. 
Since the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor have a 32-byte instruction queue, 
an average of 10 instructions will be prefetched. The 
use of two operands permits the following types of 
common instructions: 


Register to Register 
Memory to Register 
Memory to Memory 
Immediate to Register 
Register to Memory 
Immediate to Memory 


The operands can be either 8, 16, or 32 bits long. As 
a general rule, when executing 32-bit code, oper- 
ands are 8 or 32 bits; when executing existing 80286 
or 8086 code (16-bit code), operands are 8 or 16 
bits. Prefixes can be added to all instructions which 
override the default length of the operands (I.e., use 
32-bit operands for 16-bit code, or 16-bit operands 
for 32-bit code). 


Introduction 


Memory on the Intel486 SX Microprocessor/lntel 
OverDrive Processor is divided up into 8-bit quanti- 
ties (bytes), 16-bit quantities (words), and 32-bit 
quantities (dwords). Words are stored in two consec- 
utive bytes in memory with the low-order byte at the 
lowest address, the high order byte at the high ad- 
dress. Dwords are stored in four consecutive bytes 
in memory with the low-order byte at the lowest ad- 
dress, the high-order byte at the highest address. 
The address of a word or dword is the byte address 
of the low-order byte. 


In addition to these basic data types, the Intel486 SX 
microprocessor/I ntel OverDrive Processor support 
two larger units of memory: pages and segments. 
Memory can be divided up into one or more variable 
length segments, which can be swapped to disk or 
shared between programs. Memory can also be or- 
ganized into one or more 4 Kbyte pages. Finally, 
both segmentation and paging can be combined, 
gaining the advantages of both systems. The In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 


sor support both pages and segments in order to 
provide maximum flexibility to the system designer. 
Segmentation and paging are complementary. Seg- 
mentation is useful for organizing memory in logical 
modules, and as such is a tool for the application 
programmer, while pages are useful for the system 
programmer for managing the physical memory of a 
system. 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor have three distinct address spaces: logi- 
cal, linear, and physical. 
A logical 
address (also 


known as a virtual address) consists of a selector 
and an offset. A selector is the contents of a seg- 
ment register. An offset is formed by summing all of 
the addressing components (BASE, INDEX, DIS- 
PLACEMENT) discussed in Section 2.5.3 Memory 
Addressing 
Modes into an effective address. Since 


each task on the Intel486 SX microprocessor/Intel 
OverDrive Processor have a maximum of 16K (214 
- 1) selectors, and offsets can be 4 gigabytes, (232 
bits) this gives a total of 246 bits or 64 terabytes of 
logical 
address space per task. The programmer 


sees this virtual address space. 


The segmentation unit translates the logical 
ad- 


dress space into a 32-bit linear address space. If the 
paging unit is not enabled then the 32-bit linear ad- 
dress corresponds to the physical 
address. The 


paging unit translates the linear address space into 
the physical address space. The physical 
address 


is what appears on the address pins. 


32 
EFFECTIVE 


ADDRESS 
LOGICAL 
OR 
SEGWENTATION 


VIRTUAL 
ADDRESS 
UNIT 
13 


32 


LINEAR 


ADDRESS 


32 


PHYSICAL 


ADORESS 


The primary difference between Real Mode and Pro- 
tected Mode is how the segmentation unit performs 
the translation of the logical address into the linear 
address. In Real Mode, the segmentation unit shifts 
the selector left four bits and adds the result 0 the 
offset to form the linear address. While in Protected 
Mode every selector has a linear base address as- 
sociated with it. The linear base address is stored in 
one of two operating system tables (Le., the Local 
Descriptor Table or Global Descriptor Table). The 
selector's linear base address is added to the offset 
to form the final linear address. 


Figure 2.16 shows the relationship between the vari- 
ous address spaces. 


The main data structure used to organize memory is 
the segment. On the Intel486 SX microprocessor/ 
Intel OverDrive Processor, segments are variable 
sized blocks of linear addresses which have certain 
attributes associated with them. There are two main 
types of segments: code and data, the segments are 
of variable size and can be as small as 1 byte or as 
large as 4 gigabytes (232 bytes). 


In order to provide compact instruction encoding, 
and 
increase 
Intel486 
SX 
microprocessor/Intel 
OverDrive Processor performance, instructions do 
not need to explicitly specify which segment register 
is used. A default segment register is automatically 
chosen according to the rules of Table 2.12 (Seg- 
ment Register Selection Rules). In general, data ref- 
erences use the selector contained in the DS regis- 
ter; Stack references use the SS register and In- 


struction fetches use the CS register. The contents 
of the Instruction Pointer provide the offset. Special 
segment override prefixes allow the explicit use of a 
given segment register, and override the implicit 
rules listed in Table 2.12. The override prefixes also 
allow the use of the ES, FS and GS segment regis- 
ters. 


There are no restrictions regarding the overlapping 
of the base addresses of any segments. Thus, all 6 
segments could have the base address set to zero 
and create a system with a four gigabyte linear ad- 
dress space. This creates a system where the virtual 
address space is the same as the linear address 
space. Further details of segmentation are dis- 
cussed in Section 4.1. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor 
have 
two 
distinct 
physical 
address 
spaces: Memory and I/O. Generally, peripherals are 
placed in I/O space although the Intel486 SX micro- 
processor/Intel OverDrive Processor also supports 
memory-mapped peripherals. The I/O space con- 
sists of 64 Kbytes, it can be divided into 64K 8-bit 
ports, 32K 16-bit ports, or 16K 32-bit ports, or any 
combination of ports which add up to less than 
64 Kbytes. The 64K I/O address space refers to 
physical memory rather than linear address since 
I/O instructions do not go through the segmentation 
or paging hardware. The M/IO# pin acts as an addi- 
tional address line thus allowing the system designer 
to easily determine which address space the proces- 
sor is accessing. 


• 


intel~ 


Type of 
Implied (Default) 
Segment 
Override 
Memory 
Reference 
Segment 
Use 
Prefixes 
Possible 


Code Fetch 
CS 
None 


Destination of PUSH, PUSHF, INT, 
SS 
None 
CALL, PUSHA Instructions 


Source of POP, POPA, POPF, 
SS 
None 
IRET, RET instructions 


Destination of STOS, MOVS, REP 
ES 
None 
STOS, REP MOVS Instructions 
(01 is Base Register) 


Other Data References, with 
Effective Address Using Base 
Register of: 


[EAX] 
OS 
[EBX] 
OS 
[ECX] 
OS 
[EDX] 
OS 
All 
[ESI] 
OS 
[EDI] 
OS 
[EBP] 
SS 
[ESP] 
SS 


The I/O ports are accessed via the IN and OUT I/O 
instructions, with the port address supplied as an 
immediate 8-bit constant in the instruction or in the 
OXregister. All 8- and 16-bit port addresses are zero 
extended on the upper address lines. The I/O in- 
structions cause the M/IO# 
pin to be driven low. 


I/O port addresses OOF8Hthrough OOFFHare re- 
served for use by Intel. 


The Intel486 SX microprocessor/lntel 
OverDrive 
Processor provide a total of 11 addressing modes 
for instructions to specify operands. The addressing 
modes are optimized to allow the efficient execution 
of high level languages such as C and FORTRAN, 
and they cover the vast majority of data references 
needed by high-level languages. 


Two of the addressing modes provide for instruc- 
tions that operate on register or immediate oper- 
ands: 


Register 
Operand 
Mode: The operand is located in 
one of the 8-, 16- or 32-bit general registers. 


Immediate 
Operand 
Mode: The operand is includ- 
ed in the instruction as part of the opcode. 


The remaining 9 modes provide a mechanism for 
specifying the effective address of an operand. The 
linear address consists of two components: the seg- 
ment base address and an effective address. The 
effective address is calculated by using combina- 
tions of the following four address elements: 


DISPLACEMENT: 
An 8-, or 32-bit immediate value, 
following the instruction. 


BASE: The contents of any general purpose regis- 
ter. The base registers are generally used by compil- 
ers to point to the start of the local variable area. 


INDEX: The contents of any general purpose regis- 
ter except for ESP. The index registers are used to 
access the elements of an array, or a string of char- 
acters. 


SCALE: The index register's value can be multiplied 
by a scale factor, either 1, 2, 4 or 8. Scaled index 
mode is especially useful for accessing arrays or 
structures. 


Combinations of these 4 components make up the 9 
additional addressing modes. There is no perform- 
ance penalty for using any of these addressing com- 
binations, since the effective address calculation is 
pipelined with the execution of other instructions. 
The one exception is the simultaneous use of Base 
and Index components which requires one addition- 
al clock. 


As shown in Figure 2.17, the effective address (EA) 
of an operand is calculated according to the follow- 
ing formula. 


Direct Mode: The operand's offset is contained as 
part of the instruction as an 8-, 16- or 32-bit dis- 
placement. 
EXAMPLE: 
INC Word 
PTR [500] 


Register Indirect Mode: A BASE register contains 
the address of the operand. 
EXAMPLE: 
MOV 
[ECX], 
EDX 


SS 
GS 
FS 


ES 
os 
-cs 


SS 
GS 


FS 
ES 
os 


ACCESS 
RIGHTS 
CS 


LIIIIT 


BASE 
ADDRESS 


Based Mode: A BASE register's contents is added 
to a DISPLACEMENT to form the operand's offset. 
EXAMPLE: 
MOV ECX, [EAX + 24] 


Index Mode: An INDEX register's contents is added 
to a DISPLACEMENT to form the operand's offset. 
EXAMPLE: 
ADD EAX, TABLE[ESI] 


Scaled Index Mode: An INDEX register's contents is 
multiplied by a scaling factor which is added to a 
DISPLACEMENT to form the operand's offset. 
EXAMPLE: 
IMUL EBX, TABLE[ES\'4],7 


Based Index Mode: The contents of a BASE register 
is added to the contents of an INDEX register to 
form the effective address of an operand. 
EXAMPLE: 
MOV EAX, 
[ESI] 
[EBX] 


Based Scaled Index Mode: The contents of an IN- 
DEX register is multiplied by a SCALING factor and 
the result is added to the contents of a BASE regis- 
ter to obtain the operand's offset. 
EXAMPLE: 
MOV ECX, [EDX'S] 
[EAX] 


SEGIIENT 
LIIIIT 
,/ 
" 


LINEAR 
ADDRESS • 
TARGET 
ADDRESS 


------~ 


SEGIIENT 
BASE 
ADDRESS 
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Based Index Mode with Displacement: The contents 
of an INDEX Register and a BASE register's con- 
tents and a DISPLACEMENT are all summed to- 
gether to form the operand offset. 
EXAMPLE: ADD EDX, [ESI] [EBP + OOFFFFFOH] 


Based Scaled Index Mode with Displacement: The 
contents of an INDEX register are multiplied by a 
SCALING factor, the result is added to the contents 
of a BASE register and a DISPLACEMENT to form 
the operand's offset. 
EXAMPLE: 
MOV 
EAX, 
LOCALTABLE[EDI*4] 
[EBP+80] 


2.5.4 DIFFERENCES BETWEEN 16- AND 32-BIT 
ADDRESSES 


In order to provide software compatibility with the 
80286 and the 8086, the Intel486 SX microproces- 
sor/Intel OverDrive Processor can execute 16-bit in- 
structions in Real and Protected Modes. The proc- 
essor determines the size of the instructions it is ex- 
ecuting by examining the D bit in the CS segment 
Descriptor. If the D bit is 0 then all operand lengths 
and effective addresses are assumed to be 16 bits 
long. If the D bit is 1 then the default length for oper- 
ands and addresses is 32 bits. In Real Mode the 
default size for operands and addresses is 16-bits. 


Regardless of the default precision of the operands 
or addresses, the Intel486 SX microprocessor/lntel 
OverDrive Processor is able to execute either 16- or 
32-bit instructions. This is specified via the use of 
override prefixes. Two prefixes, the Operand Size 
Prefix and the Address Length Prefix, override the 
value of the D bit on an individual instruction basis. 
These prefixes are automatically added by Intel as- 
semblers. 


Example: The 
Intel486 SX microprocessor/Intel 
OverDrive Processor is executing in Real Mode and 
the programmer needs to access the EAX registers. 
The assembler code for this might be MOV EAX, 32- 
bit MEMORYOP, ASM486 Macro Assembler auto- 
matically determines that an Operand Size Prefix is 
needed and generates it. 


Example: The D bit is 0, and the programmer wishes 
to use Scaled Index addressing mode to access an 
array. The Address Length Prefix allows the use of 
MOV DX, TABLE[ESI*2]. The assembler uses an 
Address Length Prefix since, with D=0, the default 
addressing mode is 16-bits. 


Example: The D bit is 1, and the program wants to 
store a 16-bit quantity. The Operand Length Prefix is 
used to specify only a 16-bit value; MOV MEM16, 
DX. 


The OPERAND LENGTH and Address Length Pre- 
fixes can be applied separately or in combination to 
any instruction. The Address Length Prefix does not 
allow addresses over 64 Kbytes to be accessed in 
Real Mode. A memory address which exceeds 
FFFFH will result in a General Protection Fault. An 
Address Length Prefix only allows the use of the ad- 
ditional Intel486 SX microprocessor/Intel OverDrive 
Processor addressing modes. 


When executing 32-bit code, the Intel486 SX micro- 
processor/lntel OverDrive Processor uses either 8-, 
or 32-bit displacements, and any register can be 
used as base or index registers. When executing 16- 
bit code, the displacements are either 8, or 16 bits, 
and the base and index register conform to the 
80286 model. Table 2.13 illustrates the differences. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor can support a wide-variety of data types. 
In the following descriptions. The central processing 
unit (CPU) consists of the base architecture regis- 
ters. 


For· the Intel OverDrive Processor, the on-chip 
floating point unit (FPU) consists of the floating 
point registers. 


For the Intel OverDrive Processor, the FPU does 
not support unsigned data types. Refer to Table 
2.14. 


Byte: 


Word: 
Dword: 


Unsigned 8-bit quantity 
Unsigned 16-bit quantity 
Unsigned 32-bit quantity 


The least significant bit (LSB) in a byte is bit 0, and 
the most significant bit is 7. 


16-Bit Addressing 
32-Blt Addressing 


BASE REGISTER 
BX,BP 
Any 32-bit GP Register 
INDEX REGISTER 
SI,DI 
Any 32-bit GP Register 
Except ESP 
SCALE FACTOR 
none 
1,2,4,8 
DISPLACEMENT 
0,8,16 bits 
0,8,32 bits 


All signed data types assume 2's complement nota- 
tion. The signed data types contain two fields, a sign 
bit and a magnitude. The sign bit is the most signifi- 
cant bit (MSB). The number is negative if the sign bit 
is 1. If the sign bit is 0, the number is positive. The 
magnitude field consists of the remaining bits in the 
number. Refer to Table 2.14. 
8-bit Integer: 
Signed 8-bit quantity 
16-bit Integer: Signed 16-bit quantity 
32-bit Integer: Signed 32-bit quantity 
64·bit Integer: Signed 64-bit quantity 


For the Intel OverDrive Processor, the FPU only 
supports 16·, 32- and 54-bit integers. 


The CPU section of the Intel486 SX microproces- 
sor/Intel OverDrive Processor only supports 8-, 16- 
and 32-bit integers. 


Floating point data type in the Intel OverDrive 
Processor contain three fields, sign, significand 
and exponent. The sign field is one bit and is the 
MSB of the floating point number. The number is 
negative if the sign bit is 1. If the sign bit is 0, the 
number is positive. The significand gives the sig- 
nificant bits of the number. The exponent field 
contains the power of 2 needed to scale the sig- 
nificand. Refer to Table 2.14. 


Only the FPU supports floating point data types. 
Single Precision Real: 
23·bit significand and 
8-blt exponent. 32 bits 
total. 


Double Precision Real: 
52-bit signiflC8nd and 
11·bit 
exponent 
64 
bits total. 


Extended Precision Real: 54-bit significand and 
15·bit 
exponent. 
SO 
bits total. 


2.6.1.4 BCD Data Types 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor support packed and unpacked binary 
coded decimal (BCD) data types. A packed BCD 
data type contains two digits per byte, the lower digit 
is in bits 0-3 
and the upper digit in bits 4-7. 
An 
unpacked BCD data type contains 1 digit per byte 
stored in bits 0-3. 


The CPU section of the Intel486 SX microproces- 
sor/Intel OverDrive Processor supports 8-bit packed 
and unpacked BCD data types. Refer to Table 2.14. 


For the Intel OverDrive Processor the FPU only 
supports SD-bitpacked BCD data types. 


2.6.1.5 String Data Types 


A string data type is a contiguous sequence of bits, 
bytes, words or dwords. A string may contain be· 
tween 1 byte and 4 Gbytes. Refer to Table 2.15. 


String data types are only supported by the CPU 
section of the Intel486 SX microprocessor/Intel 
OverDrive Processor. 


Bit String: A set of contiguous bits. In the Intel486 
SX microprocessor/Intel 
OverDrive Processor bit 
strings can be up to 4 gigabits long. 


2.6.1.6 ASCII Data Types 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor support ASCII (American Standard Code 
for Information Interchange) strings and can perform 
arithmetic operations (such as addition and division) 
on ASCII data. The CPU section of the Intel486 SX 
microprocessor/Intel OverDrive Processor can only 
operate on ASCII data. Refer to Table 2.15. 


•• 
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Table 2.15. String and ASCII Data Types 


String Data Type. 


A+N 
CJ 


A+l 
A 


~ 
~ 


A+2N+1 
A+2N 
A+3 
A+2 
A+l 
A 


115 


1 
J 
115 
0115 
01 
N 


A+7 
A+8 
A+5 
A+4 
A+3 
A+2 
A+l 
A 


131 
0131 
01 


D 


2.6.1.7 Pointer 
Data Types 


A pointer data type contains a value that gives the 
address of a piece of data. The Intel486 SX micro- 
processor/lntel 
OverDrive Processor support two 
types of pointers. Refer to Table 2.16. 


48-bit Pointer: 16-bit selector and 32-bit offset 


Table 2.16. Pointer 
Data Types 


~SIlIIlyte 


.I. 


47 
31 


Selector 


2.6.2 LITTLE 
ENDIAN 
vs BIG ENDIAN 
DATA 
FORMATS 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor, as well as all other members of the 86 
architecture use the "Iittle-endian" method for stor- 
ing data types that are larger than one byte. Words 
are stored in two consecutive bytes in memory with 
the low-order byte at the lowest address and the 
high order byte at the high address. Dwords are 
stored in four consecutive bytes in memory with the 
low-order byte at the lowest address and the high 
order byte at the highest address. The address of a 
word or dword data item is the byte address of the 
low-order byte. 


Figure 2.18 illustrates the differences between the 
big-endian and little-endian formats for dwords. The 
32 bits of data are shown with the low order bit num- 
bered bit 0 and the high order bit numbered 32. Big- 
endian data is stored with the high-order bits at the 
lowest addressed byte. Little-endian data is stored 
with the high-order bits in the highest addressed 
byte. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor has two instructions which can convert 
16- or 32-bit data between the two byte orderings. 
BSWAP (byte swap) handles four byte values and 
XCHG (exchange) handles two byte values. 


Interrupts and exceptions alter the normal program 
flow, in order to handle external events, to report 
errors or exceptional conditions. The difference be- 
tween interrupts and exceptions is that interrupts are 
used to handle asynchronous external events while 
exceptions handle instruction faults. Although a pro- 
gram can generate a software interrupt via an INT N 
instruction, the Intel486 SX microprocessor/Intel 
OverDrive Processor treats software interrupts as 
exceptions. 


Hardware interrupts occur as the result of an exter- 
nal event and are classified into two types: maskable 
or non-maskable. Interrupts are serviced after the 
execution of the current instruction. After the inter- 
rupt handler is finished servicing the interrupt, exe- 
cution proceeds with the instruction immediately af· 
ter the interrupted instruction. Sections 2.7.3 and 
2.7.4 discuss the differences between Maskable and 
Non-Maskable interrupts. 


Exceptions are classified as faults, traps, or aborts 
depending on the way they are reported, and wheth- 
er or not restart of the instruction causing the excep- 
tion is supported. Faults are exceptions that are de- 
tected and serviced before 
the execution of the 
faulting instruction. A fault would occur in a virtual 
memory system, when the processor referenced a 
page or a segment which was not present. The oper- 
ating system would fetch the page or segment from 
disk, and then the Intel486 SX microprocessor/lntel 
OverDrive Processor would restart the instruction. 
Traps are exceptions that are reported immediately 
after the execution of the instruction which caused 
the problem. User defined interrupts are examples 
of traps. Aborts are exceptions which do not permit 
the precise location of the instruction causing the 
exception to be determined. Aborts are used to re- 
port severe errors, such as a hardware error, or ille- 
gal values in system tables. 


Thus, when an interrupt service routine has been 
completed, execution proceeds from the instruction 
immediately following the interrupted instruction. On 
the other hand, the return address from an excep- 
tion fault routine will always point at the instruction 
causing the exception and include any leading in- 
struction prefixes. Table 2.17 summarizes the possi- 
ble interrupts for the Intel486 SX microprocessor/In- 
tel OverDrive Processor and shows where the return 
address points. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor have the ability to handle up to 256 differ- 
ent interrupts/ exceptions. In order to service the in- 
terrupts, a table with up to 256 interrupt vectors 
must be defined. The interrupt vectors are simply 
pointers to the appropriate interrupt service routine. 
In Real Mode (see Section 3.1), the vectors are 4 
byte quantities, a Code Segment plus a 16-bit offset; 
in Protected Mode, the interrupt vectors are 8 byte 
quantities, which are put in an Interrupt Descriptor 
Table (see Section 4.3.3.4). Of the 256 possible in- 
terrupts, 32 are reserved for use by Intel, the remain- 
ing 224 are free to be used by the system designer. 


When an interrupt occurs the following actions hap- 
pen. First, the current program address and the 
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Flags are saved on the stack to allow resumption 
of 
the interrupted 
program. 
Next, an 8-bit vector 
is sup- 
plied to the Intel486 
SX microprocessor/Intel 
Over- 
Drive Processor 
which 
identifies 
the appropriate 
en- 
try in the interrupt 
table. The table contains 
the start- 
ing address 
of the 
interrupt 
service 
routine. 
Then, 
the user supplied 
interrupt 
service 
routine 
is execut- 
ed. Finally, when an IRET instruction 
is executed 
the 
old 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
state 
is restored 
and program 
execution 
resumes 
at the appropriate 
instruction. 


Maskable 
interrupts 
are the most common 
way used 
by the Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
to 
respond 
to 
asynchronous 
external 
hardware 
events. 
A hardware 
interrupt 
occurs 
when 
the INTR is pulled high and the Interrupt 
Flag bit (IF) 
is enabled. 
The 
Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
only responds 
to interrupts 
be- 
tween instructions, 
(REPeat String instructions, 
have 
an 
"interrupt 
window", 
between 
memory 
moves, 


which 
allows 
interrupts 
during 
long 
string 
moves). 
When an interrupt 
occurs the Intel486 
SX microproc- 
essor/Intel 
OverDrive 
Processor 
CoProcessor 
reads 
an 8-bit vector 
supplied 
by the hardware 
which iden- 
tifies 
the source 
of the interrupt, 
(one of 224 
user 
defined 
interrupts). 
The exact nature of the interrupt 
sequence 
is discussed 
in Section 
7.2.10. 


The 8-bit interrupt 
vector 
is supplied 
to the Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
in 
several 
different 
ways: 
exceptions 
supply 
the inter- 
rupt vector 
internally; 
software 
INT instructions 
con- 
tain 
or imply 
the vector; 
maskable 
hardware 
inter- 
rupts 
supply 
the 
8-bit 
vector 
via the 
interrupt 
ac- 
knowledge 
bus sequence. 
Non-Maskable 
hardware 
interrupts 
are assigned 
to interrupt 
vector 
2. 


Instruction 
Which 
Return Address 


Function 
Interrupt 
Can Cause 
Points 
to 
Type 
Number 
Exception 
Faulting 
Instruction 


Divide Error 
0 
DIV,IDIV 
YES 
FAULT 


Debug Exception 
1 
Any Instruction 
YES 
TRAP' 


NMI Interrupt 
2 
INT 2 orNMI 
NO 
NMI 


One Byte Interrupt 
3 
INT 
NO 
TRAP 


Interrupt 
on Overflow 
4 
INTO 
NO 
TRAP 


Array Bounds Check 
5 
BOUND 
YES 
FAULT 


Invalid OP-Code 
6 
Any Illegal Instruction 
YES 
FAULT 


Device Not Available 
7 
ESC, WAIT 
YES 
FAULT 


Double Fault 
8 
Any Instruction 
That Can 
ABORT 
Generate 
an Exception 


Intel Reserved 
9 


InvalidTSS 
10 
JMP, CALL, IRET, INT 
YES 
FAULT 


Segment 
Not Present 
11 
Segment 
Register 
Instructions 
YES 
FAULT 


Stack Fault 
12 
Stack References 
YES 
FAULT 


General 
Protection 
Fault 
13 
Any Memory 
Reference 
YES 
FAULT 


Page Fault 
14 
Any Memory Access or Code Fetch 
YES 
FAULT 


Intel Reserved 
15 


Floating 
Point Error 
16 
Floating 
Point, WAIT 
YES 
FAULT 


Alignment 
Check Interrupt 
17 
Unaligned 
Memory Access 
YES 
FAULT 


Intel Reserved 
18-31 


Two Byte Interrupt 
0-255 
INTn 
NO 
TRAP 


• 


The IF bit in the EFLAG registers is reset when an 
interrupt is being serviced. This effectively disables 
servicing additional interrupts during an interrupt 
service routine. However, the IF may be set explicitly 
by the interrupt handler, to allow the nesting of inter- 
rupts. When an IRET instruction is executed the 
original state of the IF is restored. 


Non-maskable interrupts provide a method of servic- 
ing very high priority interrupts. A common example 
of the use of a non-maskable interrupt (NMI) would 
be to activate a power failure routine. When the NMI 
input is pulled high it causes an interrupt with an 
internally supplied vector value of 2. Unlike a normal 
hardware interrupt, no interrupt acknowledgment se- 
quence is performed for an NMI. 


While executing the NMI servicing procedure, the In- 
tel486 SX microprocessor/Intel OverDrive Proces- 
sor will not service further NMI requests until an in- 
terrupt return (IRET) instruction is executed or the 
processor is reset. If NMI occurs while currently 
servicing an NMI, its presence will be saved for serv- 
icing after executing the first IRET instruction. The IF 
bit is cleared at the beginning of an NMI interrupt to 
inhibit further INTR interrupts. 


A third type of interrupt/exception for the Intel486 
SX microprocessor/Intel OverDrive Processor is the 
software interrupt. An INT n instruction causes the 
processor to execute the interrupt service routine 
pointed to by the nth vector in the interrupt table. 


A special case of the two byte software interrupt INT 
n is the one byte INT 3, or breakpoint interrupt. By 
inserting this one byte instruction in a program, the 
user can set breakpoints in his program as a debug- 
ging tool. 


A final type of software interrupt is the single step 
interrupt. It is discussed in Section 9.2. 


Interrupts are externally-generated events. Maska- 
ble Interrupts (on the INTR input) and Non-Maskable 
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Interrupts (on the NMI input) are recognized at in- 
struction 
boundaries. When 
NMI and 
maskable 
INTR are both recognized at the same instruction 
boundary, the 
Intel486 
SX microprocessor/Intel 
OverDrive Processor invokes the NMI service rou- 
tine first. If, after the NMI service routine has been 
invoked, maskable interrupts are still enabled, then 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor will invoke the appropriate interrupt serv- 
ice routine. 


Table 2.18a.lnteI486 SX Mlcroprocessor/ 
Intel OverDrive Processor Priority for 
Invoking Service Routines in Case of 
Simultaneous External Interrupts 


1. NMI 
2.INTR 


Exceptions are internally-generated events. Excep- 
tions are detected by the Intel486 SX microproces- 
sor/lntel OverDrive Processor if, in the course of ex- 
ecuting an instruction, the Intel486 SX microproces- 
sor/Intel OverDrive Processor detects a problematic 
condition. The 
Intel486 SX microprocessor/Intel 
OverDrive Processor then immediately invokes the 
appropriate exception service routine. The state of 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor is such that the instruction causing the 
exception can be restarted. If the exception service 
routine has taken care of the problematic condition, 
the instruction will execute without causing the same 
exception. 


It is possible for a single instruction to generate sev- 
eral exceptions (for example, transferring a single 
operand could generate two page faults if the oper- 
and location spans two "not present" pages). How- 
ever, only one exception is generated upon each at- 
tempt to execute the instruction. Each exception 
service routine should correct its corresponding ex- 
ception, and restart the instruction. In this manner, 
exceptions are serviced until the instruction exe- 
cutes successfully.. 


As the Intel486 SX microprocessor/Intel OverDrive 
Processor executes instructions, it follows a consist- 
ent cycle in checking for exceptions, as shown in 
Table 2.18b. This cycle is repeated as each instruc- 
tion is executed, and occurs in parallel with instruc- 
tion decoding and execution. 


Consider 
the case of the Intel486 
SX microproc- 
essor/Intel 
OverDrive 
Processor 
having 
just 
completed 
an instruction. 
It then 
performs 
the 
following 
checks 
before reaching the point where 
the next instruction 
is completed: 


1. Check for Exception 
1 Traps from the instruc- 
tion just completed 
(single-step 
via Trap Flag, 
or Data Breakpoints 
set in the Debug Regis- 
ters). 


2. Check 
for Exception 
1 Faults 
in the next in- 
struction 
(Instruction 
Execution 
Breakpoint 
set in the 
Debug 
Registers 
for the 
next 
in- 
struction). 


3. Check for external 
NMI and INTR. 


4. Check 
for Segmentation 
Faults that prevent- 
ed fetching 
the entire next instruction 
(excep- 
tions 
11 or 13). 


5. Check 
for Page Faults 
that prevented 
fetch- 
ing the entire next instruction 
(exception 
14). 


6. Check 
for Faults 
decoding 
the 
next instruc- 
tion (exception 
6 if illegal 
opcode; 
exception 
6 if in Real Mode or in Virtual 8086 Mode and 
attempting 
to execute 
an instruction 
for Pro- 
tected 
Mode only (see Section 
4.6.4); or ex- 
ception 
13 if instruction 
is longer 
than 
15 
bytes, or privilege violation 
in Protected 
Mode 
(Le., not at IOPL or at CPL = 0). 


7. If WAIT opcode, 
check 
if TS=1 
and MP=1 
(exception 
7 if both are 1). 
~ 


8. If opcode 
for 
Floating 
Point 
Unit. 
check 
If 
EM = 1 or TS = 1 (exception 
7 if either are 1). 


9. If opcode 
for Floating 
Point Unit (FPU), check 
FPU error status 
(exception 
16 If error status 
is asserted). 


10. Check 
in the following 
order for each memo- 
ry reference 
required 
by the instruction: 


a. Check 
for Segmentation 
Faults 
that 
pre- 
vent transferring 
the entire memory quanti- 
ty (exceptions 
11, 12, 13). 


b. Check 
for Page Faults that prevent 
trans- 
ferring 
the entire memory 
quantity 
(excep- 
tion 14). 


NOTE: 
The 
order 
stated 
supports 
the 
concept 
of the 
paging 
mechanism 
being "underneath" 
the seg- 
mentation 
mechanism. 
Therefore, 
for any given 
code 
or data 
reference 
in memory, 
segmenta- 
tion exceptions 
are generated 
before 
paging ex- 
ceptions 
are generated. 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
fully supports 
restarting 
all instructions 
af- 
ter faults. 
If an exception 
is detected 
in the instruc- 
tion to be executed 
(exception 
categories 
4 through 
10 in Table 2.18b), the Intel486 
SX microprocessor/ 
Intel 
OverDrive 
Processor 
invokes 
the 
appropriate 
exception 
service 
routine. 
The 
Intel486 
SX micro- 
processor/Intel 
OverDrive 
Processor 
is in a state 
that 
permits 
restart 
of the instruction, 
for all cases 
but those 
in Table 
2.18c. 
Note that 
all such 
cases 
are easily avoided 
by proper design of the operating 
system. 


Table 
2.18c. Conditions 
Preventing 
Instruction 
Restart 


An 
instruction 
causes 
a task 
switch 
to 
a task 
whose 
Task 
State 
Segment 
is 
partially 
"not 
present". 
(An entirely 
"not 
present" 
TSS is re- 
startable.) 
Partially 
present 
TSS's 
can be avoid- 
ed either 
by keeping 
the 
TSS's 
of such 
tasks 
present 
in memory, 
or by aligning 
TSS segments 
to reside entirely within a single 4K page (for TSS 
segments 
of 4 Kbytes or less). 


NOTE: 
These 
conditions 
are avoided 
by using the oper- 
ating system 
designs 
mentioned 
in this table. 


2.7.8 DOUBLE 
FAULT 


A Double 
Fault 
(exception 
8) results 
when 
the 
In- 
tel486 
SX microprocessor/Intel 
OverDrive 
Proces- 
sor attempts 
to invoke 
an exception 
service 
routine 
for the segment 
exceptions 
(10, 11, 12 or 13), but in 
the process 
of doing so, detects 
an exception 
other 
than a Page Fault (exception 
14). 


A Double 
Fault (exception 
8) will also be generated 
when 
the 
Intel486 
SX microprocessor/Intel 
Over- 
Drive Processor 
attempts 
to invoke 
the Page Fault 
(exception 
14) service 
routine, 
and detects 
an ex- 
ception 
other than a second 
Page Fault. In any func- 
tional 
system, 
the entire 
Page Fault service 
routine 
must remain 
"present" 
in memory. 


When a Double Fault occurs, the Intel486 
SX micro- 
processor/lntel 
OverDrive 
Processor 
invokes 
the 
exception 
service 
routine for exception 
8. 


2.7.9 FLOATING 
POINT 
INTERRUPT 
VECTORS 


several 
interrupt 
vectors 
of the Intel486 
SX mi- 
croprocessorllntel 
OverDrive 
Processor 
are 
used to report 
exceptional 
conditions 
while exe· 
cuting numeric programs 
In either real or protect- 
ed mode. Table 2.19 shows these 
interrupts 
and 
their causes. 


Interrupt 
C8uM of Interrupt 
Number 


7 
A FloatIng Point instruction was encountered 
when EM or TS of the Intel OverDrive 
Processor controf register zero (CRO) was set. EM = 1 indicates that software emulation 
of the In8tructIon Is required. When TS Is set, elther a Floating PoInt or WAlT instruction 
causes IntemJpt 7. ThIs indicates that the current FPU context may not belong to the 
current task. 


13 
The fInlt word or doubIeword of a numeric operand Is not entirely within the limit of Its 
segment 
The return address pushed onto the stack of the exception handler points at the 
Floating PoInt Instruction that cauaed the exception. tncludtng any prefixes. The FPU has 
not executed thts instruction; the Instruction pointer and data pointer register refer to a 
previous, correctly executed 1n8trucIlon. 


16 
The prevtous numerics instruction cauaed an unmasked exception. The address of the 
faulty tnstructton and the address of Its operand are stored in the Instruction pointer and 
data potnt:eers. 
Only Floating Point and WAIT tnstructtons can cause this interrupt 
The Intel 
Processor return address pushed onto the stack of the exception 
handler points to a WAlT or FloatIng Point tnstructton (mctudlng prefixes). This instruction 
can be restarted after clearing the exception condition In the FPU. The FNINIT. FNCLEX. 
FNSTSW. FNSTENV. and FNSA ve tnstructtons cannot cause thts interrupt 


When the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor is reset or powered up, it is initial- 
ized in Real Mode. Real Mode has the same base 
architecture as the 8086, but allows access to the 
32-bit register set of the Intel486 SX microproces- 
sor/Intel 
OverDrive 
Processor. 
The 
addressing 
mechanism, memory size, interrupt handling, are all 
identical to the Real Mode on the 80286. 


All of the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor instructions are available in Real 
Mode (except those instructions listed in Section 
4.6.4). The default operand size in Real Mode is 16 
bits, just like the 8086. In order to use the 32-bit 
registers and addressing modes, override prefixes 
must be used. In addition, the segment size on the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor in Real Mode is 64 Kbytes so 32-bit effective 
addresses must have a value less thap OOOOFFFFH. 
The primary purpose of Real Mode is to set up the 
processor for Protected Mode Operation. 


The LOCK prefix on the Intel486 SX microproces- 
sor/Intel OverDrive Processor even in Real Mode, is 
more restrictive than on the 80286. This is due to the 
addition of paging on the Intel486 SX microproces- 
sor/Intel 
OverDrive Processor in Protected Mode 
and Virtual 8086 Mode. Paging makes it impossible 
to guarantee that repeated string instructions can be 
LOCKed. The 
Intel486 
SX microprocessor/Intel 
OverDrive Processor can't require that all pages 
holding the string be physically present in memory. 
Hence, a Page Fault (exception 14) might have to be 
taken during the repeated string instruction. There- 
fore the LOCK prefix can't be supported during re- 
peated string instructions. 


These are the only instruction forms where the 
LOCK prefix is legal on the Intel486 SX microproces- 
sor/lntel OverDrive Processor: 


Opcode 
Operands 
(Dest, Source) 


BITTest and 
Mem,Reg/immed 
SETIRESET/COMPLEMENT 
XCHG 
Reg,Mem 
XCHG 
Mem,Reg 
ADD,OR,ADC,SBB, 
Mem,Reg/immed 
AND,SUB,XOR 
NOT,NEG,INC,DEC 
Mem 
CMPXCHG,XADD 
Mem,Reg 


An exception 6 will be generated if a LOCK prefix is 
placed before any instruction form or opcode not 
listed above. The LOCK prefix allows indivisible 
read/modify/write 
operations on memory operands 
using the instructions above. For example, even the 
ADD Reg, Mem is not LOCKable, because the Mem 
operand is not the destination (and therefore no 
memory read/modify/operation is being performed). 


Since, on the 
Intel486 SX microprocessor/Intel 
OverDrive Processor, repeated string instructions 
are not LOCKable, it is not possible to LOCK the bus 
for a long period of time. Therefore, the LOCK prefix 
is not IOPL-sensitive on the Intel486 SX microproc- 
essor/Intel OverDrive Processor. The LOCK prefix 
can be used at any privilege level, but only on the 
instruction forms listed above. 


3.2 
Memory Addressing 


In Real Mode the maximum memory size is limited to 
1 megabyte. Thus, only address lines A2-A 19 are 
active. (Exception, after RESET address lines A20- 
A31 are high during CS-relative memory cycles until 
an intersegment jump or call is executed (see Sec- 
tion 6.5». 


Since paging is not allowed in Real Mode the linear 
addresses are the same as physical addresses. 
Physical addresses are formed in Real Mode by 
adding the contents of the appropriate segment reg- 
ister which is shifted left by four bits to an effective 
address. This addition results in a physical address 
from OOOOOOOOH 
to 0010FFEFH. This is compatible 


with 80286 Real Mode. Since segment registers are 
shifted left by 4 bits, Real Mode segments always 
start on 16 byte boundaries. 


All segments in Real Mode are exactly 64 Kbytes 
long, and may be read, written, or executed. The 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor will generate an exception 13 if a data operand 
or instruction fetch occurs past the end of a seg- 
ment (i.e., if an operand has an offset great~than 
FFFFH, for example a word with a low byte at 
FFFFH and the high byte at OOOOH). 


Segments may be overlapped in Real Mode. Thus, if 
a particular segment does not use all 64 Kbytes an- 
other segment can be overlayed on top of the un- 
used portion of the previous segment. This allows 
the programmer to minimize the amount of physical 
memory needed for a program. 


There are two fixed areas in memory which are re- 
served in Real address mode: system initialization 
area and the interrupt table area. Locations OOOOOH 
through 003FFH are reserved for interrupt vectors. 
Each one of the 256 possible interrupts has a 4-byte 
jump vector reserved for it. Locations FFFFFFFOH 
through FFFFFFFFH are reserved for system initiali- 
zation. 


Many of the exceptions shown in Table 2.17 and 
discussed in Section 2.7 are not applicable to Real 
Mode operation, in particular exceptions 10, 11, 14, 
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17, will not happen in Real Mode. Other exceptions 
have slightly different meanings in Real Mode; Table 
3.1 identifies these exceptions. 


The HLT instruction stops program execution and 
prevents the Intel486 SX microprocessor/Intel Over- 
Drive Processor from using the local bus until re- 
started. Either NMI, INTR with interrupts enabled 
(IF= 1), or RESET will force the Intel486 SX micro- 
processor/Intel OverDrive Processor out of halt. If 
interrupted, the saved CS:IP will point to the next 
instruction after the HLT. 


As in the case in protected mode, the shutdown will 
occur when a severe error is detected that prevents 
further processing. In Real Mode, shutdown can oc- 
cur under two conditions: 


An interrupt or an exception occurs (exceptions 8 or 
13) and the interrupt vector is larger than the Inter- 
rupt Descriptor Table (i.e., there is not an interrupt 
handler for the interrupt). 


A CALL, INT or PUSH instruction attempts to wrap 
around the stack segment when SP is not even (i.e., 
pushing a value on the stack when SP = 0001 re- 
sulting in a stack segment greater than FFFFH). 


An NMI input can bring the processor out of shut- 
down if the Interrupt Descriptor Table limit is large 
enough to contain the NMI interrupt vector (at least 
0017H) and the stack has enough room to contain 
the vector and flag information (i.e., SP is greater 
than 0005H). If these conditions are not met, the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor is unable to execute the NMI and executes an- 
other shutdown cycle. In this case, the Intel486 SX 
microprocessor/Intel OverDrive Processor remains 
in the shutdown and can only exit via the RESET 
input. 


Function 
Interrupt 
Related 
Return 


Number 
Instructions 
Address 
Location 


Interrupt table limit too small 
8 
INT Vector is not 
Before 


within table limit 
Instruction 


CS, OS,ES, FS, GS 
13 
Word memory reference 
Before 


Segment overrun exception 
beyond offset = FFFFH. 
Instruction 


An attempt to execute 
past the end of CS segment. 


SS Segment overrun exception 
12 
Stack Reference 
Before 


beyond offset = FFFFH 
Instruction 
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4.0 
PROTECTED 
MODE 
ARCHITECTURE 


The complete capabilities of the Intel486 SX micro- 
processorlintel 
OverDrive Processor are unlocked 
when the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor operates in Protected Virtual Ad- 
dress Mode (Protected Mode). Protected Mode 
vastly increases the linear address space to four gig- 
abytes (232 bytes) and allows the running of virtual 
memory programs of almost unlimited size (64 tera- 
bytes or 246 bytes). In addition Protected Mode al- 
lows the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor to run all of the existing 8086, 
80286 and Intel386 microprocessor software, while 
providing a sophisticated memory management and 
a hardware-assisted protection mechanism. Protect- 
ed Mode allows the use of additional instructions es- 
pecially optimized for supporting multitasking operat- 
ing systems. The base architecture of the Intel486 
SX microprocessorlintel 
OverDrive Processor re- 
mains the same, the registers, instructions, and ad- 
dressing modes described in the previous sections 
are retained. The main difference between Protect- 
ed Mode, and Real Mode from a programmer's view 
is the increased address space, and a different ad- 
dressing mechanism. 


ACCESS RIGHTS 


LIMIT 


BASE ADDRESS 


4.2 Addressing 
Mechanism 


Like Real Mode, Protected Mode uses two compo- 
nents to form the logical address, a 16-bit selector is 
used to determine the linear base address of a seg- 
ment, the base address is added to a 32-bit effective 
address to form a 32-bit linear address. The linear 
address is then either used as the 32-bit physical 
address, or if paging is enabled the paging mecha- 
nism maps the 32-bit linear address into a 32-bit 
physical address. 


The difference between the two modes lies in calcu- 
lating the base address. In Protected Mode the se- 
lector is used to specify an index into an operating 
system defined table (see Figure 4.1). The table 
contains the 32-bit base address of a given seg- 
ment. The physical address is formed by adding the 
base address obtained from the table to the offset. 


Paging provides an additional memory management 
mechanism which operates only in Protected Mode. 
Paging provides a means of managing the very large 
segments of the Intel486 SX microprocessorllntel 
OverDrive Processor. As such, paging operates be- 
neath segmentation. The paging mechanism trans- 
lates the protected linear address which comes from 
the segmentation unit into a physical address. Figure 
4.2 shows the complete Intel486 SX microproces- 
sor/Intel 
OverDrive Processor addressing mecha- 
nism with paging enabled. 
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Int.I.S6™ 
SX 
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Intel 
OverOrive™ 
Processor 


PAGING 


MECHANISM 
PAGE 
FRAME 


ADDRESS 


Segmentation is one method of memory manage- 
ment. Segmentation provides the basis for protec- 
tion. Segments are used to encapsulate regions of 
memory which have common attributes. For exam- 
ple, all of the code of a given program could be con- 
tained in a segment, or an operating system table 
may reside in a segment. All information about a 
segment is stored in an 8 byte data structure called 
a descriptor. All of the descriptors in a system are 
contained in tables recognized by hardware. 


The following terms are used throughout the discus- 
sion of descriptors, privilege levels and protection: 


PL: PriVilege Level-Cne 
of the four hierarchical 
privilege levels. Level 0 is the most privileged level 
and level 3 is the least privileged. More privileged 
levels are numerically smaller than less privileged 
levels. 


RPL: Requestor Privilege Level-The 
privilege level 
of the original supplier of the selector. RPL is deter- 
mined by the least two significant bits of a selector. 


DPL: Descriptor Privilege Level-This 
is the least 
privileged level at which a task may access that de- 
scriptor (and the segment associated with that de- 
scriptor). Descriptor Privilege Level is determined by 
bits 6:5 in the Access Right Byte of a descriptor. 


CPL: Current Privilege Level-The 
privilege level at 
which a task is currently executing, which equals the 
privilege level of the code segment being executed. 
CPL can also be determined by examining the low- 
est 2 bits of the CS register, except for conforming 
code segments. 


EPL: Effective Privilege Level-The 
effective privi- 
lege level is the least privileged of the RPL and DPL. 
Since smaller privilege level values indicate greater 
privilege, EPL is the numerical maximum of RPL and 
DPL. 


Task: One instance of the execution of a program. 
Tasks are also referred to as processes. 


4.3.3.1 Descriptor 
Tables 
Introduction 


The descriptor tables define all of the segments 
which are used in an Intel486 SX microprocessor/In- 
tel OverDrive Processor system. There are three 


types of tables on the Intel486 SX microprocessor/ 
Intel OverDrive Processor which hold descriptors: 
the Global Descriptor Table, Local Descriptor Table, 
and the Interrupt Descriptor Table. All of the tables 
are variable length memory arrays. They can range 
in size between 8 bytes and 64 Kbytes. Each table 
can hold up to 8192 8-byte descriptors. The upper 
13 bits of a selector are used as an index into the 
descriptor table. The tables have registers associat- 
ed with them which hold the 32-bit linear base ad- 
dress, and the 16-bit limit of each table. 


Each of the tables has a register associated with it, 
the GDTR, LDTR, and the IDTR (see Figure 4.3). 
The LGDT, LLDT, and L1DT instructions, load the 
base and limit of the Global, Local, and Interrupt De- 
scriptor Tables, respectively, into the appropriate 
register. The SGDT, SLOT, and SlOT store the base 
and limit values. These tables are manipulated by 
the operating system. Therefore, the load descriptor 
table instructions are privileged instructions. 


II 
I 
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4.3.3.2 
Global Descriptor 
Table 


The Global Descriptor Table (GOT) contains de- 
scriptors which are possibly available to all of the 
tasks in a system. The GOT can contain any type of 
segment descriptor except for descriptors which are 
used for servicing interrupts (Le., interrupt and trap 
descriptors). Every Intel486 SX microprocessor/Intel 
OverDrive Processor system contains a GOT. Gen- 
erally the GOT contains code and data segments 
used by the operating systems and task state seg- 
ments, and descriptors for the LOTs in a system. 


The first slot of the Global Descriptor Table corre- 
sponds to the null selector and is not used. The null 
selector defines a null pointer value. 


4.3.3.3 
Local Descriptor 
Table 


LOTs contain descriptors which are associated with 
a given task. Generally, operating systems are de- 
signed so that each task has a separate LOT. The 
LOT may contain only code, data, stack, task gate, 
and call gate descriptors. LOTs provide a mecha- 
nism for isolating a given task's code and data seg- 
ments from the rest of the operating system, while 
the GOT contains descriptors for segments which 
are common to all tasks. A segment cannot be ac- 
cessed by a task if its segment descriptor does not 
exist in either the current LOT or the GOT. This pro- 
vides both isolation and protection for a task's seg- 
ments, while still allowing global data to be shared 
among tasks. 


Unlike the 6 byte GOT or lOT registers which contain 
a base address and limit, the visible portion of the 
LOT register contains only a 16-bit selector. This se- 
lector refers to a Local Descriptor Table descriptor in 
the GOT. 


• 


4.3.3.4 Interrupt 
Descriptor Table 


The third table needed for Intel486 SX microproces- 
sor/Intel OverDrive Processor systems is the Inter- 
rupt Descriptor Table. (See Figure 4.4.) The lOT con- 
tains the descriptors which point to the location of 
up to 256 interrupt service routines. The lOT may 
contain only task gates, interrupt gates, and trap 
gates. The lOT should be at least 256 bytes in size in 
order to hold the descriptors for the 32 Intel Re- 
served Interrupts. Every interrupt used by a system 
must have an entry in the lOT. The lOT entries are 
referenced via INT instructions, external interrupt 
vectors, and exceptions. (See Section 2.7 Inter- 
rupts). 


tHTERAUPT 
DESCRIPTOR 
TABLE 
(lOT) 
ClATEFOR 
INTERRUPT 
.1 


aATEFOR 


INTERRUPT "0 lid 


Figure 4.4. Interrupt Descriptor 
Table Register Use 
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4.3.4.1 Descriptor 
Attribute 
Bits 


The object to which the segment selector points to 
is called a descriptor. Descriptors are eight byte 
quantities which contain attributes about a given re- 
gion of linear address space (Le.,a segment). These 
attributes include the 32-bit base linear address of 
the segment, the 20-bit length and granularity of the 
segment, the protection level, read, write or exe~ute 
privileges, the default size of the operands (16-blt or 
32-bit), and the type of segment. All of the attribute 
information about a segment is contained in 12 bits 
in the segment descriptor. Figure 4.5 shows the gen- 
eral format of a descriptor. All segments on the In- 
tel486 SX microprocessor/Intel OverDrive Proces- 
sor have three attribute fields in common: the P bit, 
the DPL bit, and the S bit. The Present P bit is 1 if 
the segment is loaded in physical memory, if P=0 
then any attempt to access this segment causes a 
not present exception (exception 11). The Descrip- 
tor Privilege Level DPL is a two-bit field which speci- 
fies the protection level 0-3 associated with a seg- 
ment. 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor have two main categories of segments: 
system segments and non-system segments (for 
code and data). The segment S bit in the segment 
descriptor determines if a given segment is a system 
segment or a code or data segment. If the S bit is ~ 
then the segment is either a code or data segment, If 
it is 0 then the segment is a system segment. 


4.3.4.2 Intel486™ SX Microprocessor! 


Intel OverDrive Processor 
Code, Data Descriptors 
(S= 1) 


Figure 4.6 shows the general format of a code and 
data descriptor and Table 4.1 illustrates how the bits 
in the Access Rights Byte are interpreted. 


o 
BYTE 
ADDRESS 
o 


BASE 
23 ... 16 


BASE 
Base Address of the segment 
LIMIT 
The length of the segment 
P 
Present Bit 
1= Present 
0 = Not Present 
DPL 
Descriptor Privilege Level 0-3 
S 
Segment Descriptor 
0 = System Descriptor 
1= Code or Data Segment Descriptor 
TYPE 
Type of Segment 
A 
Accessed Bit 
G 
Granularity Bit 
1= Segment length is page granular 
0= Segment length is byte granular 
D 
Default Operation Size (recognized in code segment descriptors only) 
1= 32-bit segment 
0 = 16-bit segment 
o 
Bit must be zero (0) for compatibility with future processors 
AVL 
Available field for user or as 


NOTE: 
In a maximum-size segment (I.e., a segment with G= 1 and segment limit 19...0 = FFFFFH), the lowest 12 bits of the 
segment base should be zero (I.e., segment base 11...000= OOOH). 


SEGMENT 
BASE 15 ... 
0 
SEGMENT 
LIMIT 15 ... 
0 


LIMIT 
ACCESS 
BASE 
BASE 31 ... 
24 
G 
D 
0 
AVL 
RIGHTS 
19 ... 
16 
BYTE 
23 ... 
16 


D/B 
1= Default Instruction Attributes are 32-Bits 
0= Default Instruction Attributes are 16-Bits 
AVL 
Available field for user or as 
G 
Granularity Bit 
1= Segment length is page granular 
0= Segment length is byte granular 
o 
Bit must be zero (0) for compatibility with future processors 
• 


Bit 
Name 
Function 
Position 


7 
Present 
(P) 
P = 1 
Segment 
is mapped 
into physical 
memory. 
p=o 
No mapping to physical 
memory exits, base and limit are 
not used. 
6-5 
Descriptor 
Privilege 
Segment 
privilege 
attribute 
used in privilege tests. 
Level (DPL) 
4 
Segment 
Descrip- 
S = 1 
Code or Data (includes 
stacks) segment 
descriptor. 


tor (S) 
S=O 
System Segment 
Descriptor 
or Gate Descriptor. 


3 
Executable 
(E) 
E - 0 
De""pto, typ, ;, data "'!Im,nt 
} " 
2 
Expansion 
Direc- 
ED = 0 Expand up segment, 
offsets 
must be 
:5:: limit. 
Data 
tion (ED) 
ED = 
1 Expand down segment, 
offsets 
must be > limit. 
Segment 
1 
Writeable 
(W) 
W = 0 
Data segment 
may not be written into. 
(S = 
1, 
W = 1 
Data segment 
may be written into. 
E = 0) 


3 
Executable 
(E) 
E = 1 
Descriptor 
type is code segment: 
r 


2 
Conforming 
(C) 
C = 
1 
Code segment 
may only be executed 
Code 
when CPL ~ DPL and CPL 
Segment 
remains unchanged. 
(S = 
1, 
1 
Readable 
(R) 
R=O 
Code segment 
may not be read. 
E = 1) 
R = 1 
Code segment 
may be read. 


0 
Accessed 
(A) 
A=O 
Segment 
has not been accessed. 
A = 1 
Segment 
selector 
has been loaded into segment 
register 
or used by selector 
test instructions. 


Type 
Field 
Definition 


Code 
and 
data 
segments 
have 
several 
descriptor 
fields in common. 
The accessed 
A bit is set whenev- 
er the processor 
accesses 
a descriptor. 
The A bit is 
used by operating 
systems 
to keep usage statistics 
on a given 
segment. 
The 
G bit, or granularity 
bit, 
specifies 
if a segment 
length 
is byte-granular 
or 
page-granular. 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
segments 
can be one 
mega- 
byte 
long with byte granularity 
(G = 0) or four giga- 
bytes with page granularity 
(G = 1), (Le., 220 pages 
each 
page is 4 Kbytes 
in length). 
The granularity 
is 
totally 
unrelated 
to paging. 
An Intel486 
SX micro- 
processor/Intel 
OverDrive 
Processor 
system 
can 
consist 
of segments 
with byte granularity, 
and page 
granularity, 
whether 
or not paging is enabled. 


The executable 
E bit tells if a segment 
is a code or 
data segment. 
A code segment 
(E = 1, S = 1) may be 
execute-only 
or execute/read 
as determined 
by the 
Read 
R bit. 
Code 
segments 
are 
execute 
only 
if 
R = 0, and 
execute/read 
if R = 1. Code 
segments 
may never be written 
into. 


NOTE: 
Code segments 
may be modified 
via aliases. 
Alias- 
es are writeable 
data 
segments 
which 
occupy 
the 
same 
range 
of linear 
address 
space 
as the 
code 
segment. 


The D bit indicates 
the default 
length 
for operands 
and effective 
addresses. 
If 0= 
1 then 
32-bit 
oper- 
ands and 32-bit addressing 
modes 
are assumed. 
If 
0=0 
then 
16-bit 
operands 
and 
16-bit 
addressing 
modes 
are assumed. 
Therefore 
all existing 
80286 
code segments 
will execute 
on the Intel486 
SX mi- 
croprocessor/Intel 
OverDrive 
Processor 
assuming 
the 0 bit is set O. 


Another 
attribute 
of code segments 
is determined 
by 
the conforming 
C bit. Conforming 
segments, 
C = 1, 


can be executed 
and shared 
by programs 
at differ- 
ent privilege 
levels. 
(See Section 
4.4 Protection.) 


Segments 
identified 
as data segments 
(E = 0, S = 1) 
are used for two types of Intel486 
SX microproces- 
sor/Intel 
OverDrive 
Processor 
segments: 
stack 
and 
data 
segments. 
The 
expansion 
direction 
(ED) 
bit 
specifies 
if a segment 
expands 
downward 
(stack) or 
upward 
(data). 
If a segment 
is a stack 
segment 
all 


offsets 
must be greater than the segment 
limit. On a 
data segment 
all offsets 
must be less than or equal 
to the limit. In other words, 
stack 
segments 
start at 
the base linear address 
plus the maximum 
segment 
limit and grow down to the base linear address 
plus 
the limit. On the other 
hand, data segments 
start at 
the base linear address 
and expand 
to the base lin- 
ear address 
plus limit. 


The write W bit controls the ability to write into a 
segment. Data segments are read-only if W= O. The 
stack segment must have W= 1. 


The B bit controls the size of the stack pointer regis- 
ter. If B= 1, then PUSHes, POPs, and CALLs all use 
the 32-bit ESP register for stack references and as- 
sume an upper limit of FFFFFFFFH. If B= 0, stack 
instructions all use the 16-bit SP register and as- 
sume an upper limit of FFFFH. 


4.3.4.3 System 
Descriptor 
Formats 


System segments describe information about oper- 
ating system tables, tasks, and gates. Figure 4.7 
shows the general format of system segment de- 
scriptors, and the various types of system segments. 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor system descriptors contain a 32-bit base linear 
address and a 20-bit segment limit. 80286 system 
descriptors have a 24-bit base address and a 16-bit 
segment limit. 80286 system descriptors are identi- 
fied by the upper 16 bits being all zero. 


4.3.4.4 LDT Descriptors 
(S = 0, TYPE = 2) 


LDT descriptors (S=0, TYPE=2) contain informa- 
tion about Local Descriptor Tables. LDTs contain a 
table of segment descriptors, unique to a particular 
task. Since the instruction to load the LDTR is only 
available at privilege level 0, the DPL field is ignored. 
LDT descriptors are only allowed in the Global De- 
scriptor Table (GDT). 
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SEGMENT BASE 15 ... 0 


LIMIT 
19".16 


Type 
o 
1 
2 
3 
4 
5 


Defines 
Invalid 
Available 80286 TSS 
LDT 
Busy 80286 TSS 
80286 Call Gale 
Task Gale (for 80286, 
Inlel486 SX Microprocessorllnlel 
OverDrive 
Processor Task) 
80286 Inlerrupl Gale 
80286 Trap Gale 


BASE 
23" . 16 


Dellnes 
Invalid 
Available Inlel486 SX Microprocessorllnlel 
OverDrive Processor TSS 
Undefined (Inlel Reserved) 
Busy Inlel486 SX Microprocessor/lnlel 
OverDrive Processor TSS 
Inlel486 SX Microprocessor/lnlel 
OverDrive Processor Call Gale 
Undefined (Inlel Reserved) 
Inlel486 SX Microprocessor/lnlel 
OverDrive Processor 
Inlel486 SX Microprocessor/lnlel 
OverDrive Processor 


Figure 4.8 shows the format of the four types of gate 
descriptors. Call gates are primarily used to transfer 
program control to a more privileged level. The call 
gate descriptor consists of three fields: the access 
byte, a long pointer (selector and offset) which 
points to the start of a routine and a word count 
which specifies how many parameters are to be cop- 
ied from the caller's stack to the stack of the called 
routine. The word count field is only used by call 
gates when there is a change in the privilege level, 
other types of gates ignore the word count field. 


Interrupt and trap gates use the destination selector 
and destination offset fields of the gate descriptor as 
a pointer to the start of the interrupt or trap handler 
routines. The difference between interrupt gates and 
trap gates is that the interrupt gate disables inter- 
rupts (resets the IF bit) while the trap gate does not. 


Task gates are used to switch tasks. Task gates 
may only refer to a task state segment (see Section 
4.4.6 Task Switching) therefore only the destination 
selector portion of a task gate descriptor is used, 
and the destination offset is ignored. 


Exception 13 is generated when a destination selec- 
tor does not refer to a correct descriptor type, i.e., a 
code segment for an interrupt, trap or call gate, a 
TSS for a task gate. 


The access byte format is the same for all gate de- 
scriptors. P= 1 indicates that the gate contents are 


valid. P= 0 indicates the contents are not valid and 
causes exception 11 if referenced. DPL is the de- 
scriptor privilege level and specifies when this de- 
scriptor may be used by a task (see Section 4.4 Pro- 
tection). The S field, bit 4 of the access rights byte, 
must be 0 to indicate a system control descriptor. 
The type field specifies the descriptor type as indi- 
c~d 
in Figure 4.8. 


4.3.4.7 Differences 
between InteI486T11SX 
Mlcroproce •• or lintel 
OverDrive 
Proceaor 
and 80286 Descriptors 


In order to provide operating system compatibility 
between the 80286 and Intel486 SX microproces- 
sorllntel OverDrive Processor, the Intel486 SX mi- 
croprocessorllntel OverDrive Processor supports all 
of the 80286 segment descriptors. Figure 4.9 shows 
the general format of an 80286 system segment de- 
scriptor. The only differences between 80286 and 
Intel486 SX microprocessorlintel OverDrive Proces- 
sor descriptor formats are that the values of the type 
fields, and the limit and base address fields have 
been expanded for the Intel486 SX microprocessorl 
Intel OverDrive Processor. The 80286 system seg- 
ment descriptors contained a 24-bit base address 
and 16-bit limit, while the Intel486 SX microproces- 
sorllntel OverDrive Processor system segment de- 
scriptors have a 32-bit base address, a 20-bit limit 
field, and a granularity bit. 


SELECTOR 
OFFSET 15 ... 0 


WORD 
OFFSET 31 ... 16 
P 
DPL 
0 
TYPE 
0 
0 
0 
COUNT 


I 
4 ... 0 


Value 
Deacrlptlon 
4 
80286call gate 
5 
Task gate (for 80286or Intel486SX microprocessor/IntelOverOriveProcessortask) 


6 
80286interruptgate 
7 
80286trap gale 
C 
Inlel486SX microprocessor/lnlelOverOriveProcessorcall gate 
E 
Inte1488SX microprocessor/IntelOverDriveProcessorinterruptgate 
F 
Inlel486SX microprocessor/IntelOverDriveProcessortrap gate 
o 
Descriptorcontentsare not valid 
1 
Descriptorcontentsare valid 


DPL~east privilegedlevelat whicha task mayaccessthe gate.WORDCOUNT0-31-the 
numberof parametersto copyfromcaller'sstack 
10 the calledprocedure'sstack.The parametersare32-bitquantitiesfor Intel486SXmicroprocessor/IntelOverOriveProcessorgates,and 16- 
bit quantitiesfor 80286gates. 


DESTINATION 
16-bit 
SELECTOR 
selector 
Selectorto the targetcode segment 
or 
Selectorto the targettask state segmentfor task gate 


offset 
Entrypoint withinthe target code segment 
16-bit80286 
32-bitIntel486SX microprocessor/IntelOverDriveProcessor 
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By supporting 80286 system segments the Intel486 
SX microprocessor/Intel 
OverDrive Processor is 
able to execute 80286 application programs on an 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor operating system. This is possible because the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor automatically understands which descriptors are 
80286-style descriptors and which descriptors are 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor-style descriptors. In particular, if the upper word 
of a descriptor is zero, then that descriptor is a 
80286-style descriptor. 


The only other differences between 80286-style de- 
scriptors 
and 
Intel486 
SX 
microprocessor/Intel 
OverDrive Processor descriptors is the interpretation 
of the word count field of call gates and the B bit. 
The word count field specifies the number of 16-bit 
quantities to copy for 80286 call gates and 32-bit 
quantities 
for 
Intel486 
SX 
microprocessor/Intel 
OverDrive Processor call gates. The B bit controls 
the size of PUSHes when using a call gate; if B=0 
PUSHes are 16 bits, if B= 1 PUSHes are 32 bits. 


4.3.4.8 Selector 
Fields 


A selector in Protected Mode has three fields: Local 
or Global Descriptor Table Indicator (TI), Descriptor 
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SEGMENT BASE 15 ... 0 


Intel Reserved 
Set to 0 


BASE 
LIMIT 
P 


Base Address of the segment 
The length of the segment 
Present Bit 
1~ Present 
0 ~ Not Present 


Entry Index (Index), and Requestor (the selector's) 
Privilege Level (RPL) as shown in Figure 4.10. The 
TI bits select one of two memory-based tables of 
descriptors (the Global Descriptor Table or the Local 
Descriptor Table). The Index selects one of 8K de- 
scriptors in the appropriate descriptor table. The 
RPL bits allow high speed testing of the selector's 
privilege attributes. 


4.3.4.9 Segment 
Descriptor 
Cache 


In addition to the selector value, every segment reg- 
ister has a segment descriptor cache register asso- 
ciated with it. Whenever a segment register's con- 
tents are changed, the 8-byte descriptor associated 
with that selector is automatically loaded (cached) 
on the chip. Once loaded, all references to that seg- • 
ment use the cached descriptor information instead 
of reaccessing the descriptor. The contents of the 
descriptor cache are not visible to the programmer. 
Since descriptor caches only change when a seg- 
ment register is changed, programs which modify 
the descriptor tables must reload the appropriate 
segment registers after changing a descriptor's 
value. 


o 


BASE 
+4 
23,..16 


Oescriptor Privilege Level 0-3 
System Oescriptor 
0 ~ System 
1= User 
Type of Segment 


6 


5 
~ 
3:~~~~#( 


2 


. •. 


NULL 


LOCAL 


DESCRIPTOR 


TABLE 


GLOBAL 


DESCRIPTOR 


TABLE 


Figure 4.10. Example 
Descriptor 
Selection 


4.11. For compatibility with the 8086 architecture, 
the base is set to sixteen times the current selector 
value, the limit is fixed at OOOOFFFFH,and the attri- 
butes are fixed so as to indicate the segment is pres- 
ent and fully usable. In Real Address Mode, the in- 
ternal "privilege level" is always fixed to the highest 
level, level 0, so I/O and other privileged opcodes 
may be executed. 


The contents of the segment descriptor cache vary 
depending on the mode the Intel486 SX microproc- 
essor/Intel 
OverDrive Processor is operating in. 


When operating in Real Address Mode, the segment 
base, limit, and other attributes within the segment 
cache registers are defined as shown in Figure 


32 - 
BIT LIMIT 


(fiXED) 


OTHER 
ATTRIBUTES 


(fiXED) 
32- BIT BASE 


(UPDATED 
DURING 
SELECTOR 
LOAD 
INTO 
SEGMENT 
REGISTER) 


CONfORMING 
PR~V~LEGE---------------------.., 


STACK 
SIZE 
---------------------, 
EXECUTABLE 
--------------------, 
WRITEABLE 
-------------------, 
READABLE 
------------------, 


EXPANSION 
DIRECTION ----------------1 
GRANULARITY 
I 
ACCESSED 
1 


~:~V~i~~E_L:~E~ 
~A-~-~-_-_-_-_-_-_-_-_-_-_-_-_-L,-~-,!-_-_-_-~-J 
__ 
_ 
__ 


CS 
16X 
CURRENT 
CS SELECTOR· 
OOOOffffH 
Y 
0 
Y 
B 
U 
Y 
Y 
Y 
- 
N 


SS 
16X 
CURRENT 
SS SELECTOR 
OOOOffffH 
Y 
0 
Y 
B 
U 
Y 
Y 
N 
W 
- 


DS 
16X 
CURRENT 
DS SELECTOR 
OOOOffffH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


ES 
16X 
CURRENT 
ES SELECTOR 
OOOOffffH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


fS 
16X 
CURRENT 
fS 
SELECTOR 
OOOOffffH 
Y 
0 
Y 
B 
U 
Y 
Y 
N - - 


GS 
16X 
CURRENT 
GS SELECTOR 
OOOOffffH 
Y 
0 
y 
B 
U 
y 
y 
N - - 


'Except 
the 
32·bit 
CS 
base 
is initialized 
to 
FFFFFOOOH 
after 
reset 
until 
first 
intersegment 
control 
transfer 
(i.e., 
intersegment 
CALL, 
or 
intersegment 
JMP, 
or INT). 
(See 
Figure 
4.13 
Example.) 
Key: 
y 
~ 
yes 
N 
~ 
no 
o 
~ 
privilege 
level 
0 
1 
~ 
privilege 
level 
1 
2 
= privilege 
level 
2 


3 
~ 
privilege 
level 
3 
U 
= expand 
up 


o 
= expand 
down 
B 
= byte granularity 
P 
= page granularity 
W 
~ 
push/pop 
16-bit 
words 
F 
= push/pop 
32·bit 
dwords 
- 
= does 
not apply 
to that segment 
cache 
register 


Figure 4.11. Segment Descriptor Caches for Real Address Mode 
(Segment Limit and Attributes 
are Fixed) 


When operating in Protected Mode, the segment 
base, limit, and other attributes within the segment 
cache registers are defined as shown in Figure 4.12. 
In Protected Mode, each of these fields are defined 


according to the contents of the segment descriptor 
indexed by the selector value loaded into the seg- 
ment register. 
• 
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32 - BIT BASE 


(UPDATED 
DURING 
SELECTOR LOAD INTO 
SEGWENT REGISTER) 


CONrORWING 
PRIVILEGE ----------------------, 
STACK SIZE 
---------------------, 
EXECUTABLE 
--------------------, 
WRITEABLE 
-------------------, 
READABLE 
------------------, 


EXPANSION 
DIRECTION ----------------1 
GRANULARITY 
1 
ACCESSED 
1 


~:~~~~~E 
_L:~E~ 
!~s: 
_-_-_-_-_-_-_-_-_-_-_-;-I~-I!-_-_-_-_-_-_-l-_ t____ 


32 - BIT LIWIT 


(UPDATED 
DURING 
SELECTOR LOAD INTO 
SEGWENT REGISTER) 


OTHER ATIRIBUTES 


(UPDATED 
DURING 
SELECTOR LOAD INTO 
SEGWENT REGISTER) 


CS 
BASE PER SEG DESCR 
LIWIT PER SEG DESCR 
P 
d 
d 
d 
d 
d 
N 
Y 
- 
d 


SS 
BASE PER SEG DESCR 
LIWIT PER SEG DESCR 
P 
d 
d 
d 
d 
r 
w 
N d - 
DS 
BASE PER SEG DESCR 
LIWIT PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 
ES 
BASE PER SEG DESCR 
LIWIT PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


rs 
BASE PER SEG DESCR 
LIWIT PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


GS 
BASE PER SEG DESCR 
L1WIT PER SEG DESCR 
P 
d 
d 
d 
d 
d 
d 
N - - 


Key: 
Y = fixed yes 
N ~ fixed no 
d = per segment descriptor 
p = per segment descriptor, descriptor must indicate "present" to avoid exception 11 
(exception 12 in case of SS) 
r = per segment descriptor, but descriptor must indicate "readable" to avoid exception 13 
(special case for SS) 
w ~ per segment descriptor, but descriptor must indicate "writable" to avoid exception 13 
(special case for SS) 
- = does not apply to that segment cache register 


When operating in a Virtual 8086 Mode within the 
Protected Mode, the segment base, limit, and other 
attributes within the segment cache registers are de- 
fined as shown in Figure 4.13. For compatibility with 
the 8086 architecture, the base is set to sixteen 
times the current selector value, the limit is fixed at 


OOOOFFFFH,and the attributes are fixed so as to 
indicate the segment is present and fully usable. The 
virtual program executes at lowest privilege level, 
level 3, to allow trapping of all IOPL-sensitive in- 
structions and level-O-onlyinstructions. 
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(UPDATEDDURINGSELECTOR 
(fiXED) 
(fiXED) 
LOADINTOSEGWENTREGISTER) 


CONfORWINGPRIVILEGE 
---------------------..., 
STACKSIZE------------------------..., 
EXECUTABLE------------...,.----------., 
WRITEABLE 
---------------------...., 
READABLE---------------------, 
EXPANSIONDIRECTION-----------------l 
GRANULARITY 
1 
ACCESSED 
1 


~:~~i~~E_L~E~-_-_-_-B_-A-:E-_-_-_-_-_-_-_-_-_-_-_-_-:-I~-I~-_-_-_-~-J 
_ 


CS 
16X CURRENTCSSELECTOR 
OOOOHHH 
y 
3 
Y 
B U 
Y 
Y 
y 
- 
N 


SS 
16X CURRENTSS SELECTOR 
OOOOffffH 
y 
3 
y 
B U y 
y 
N W - 
DS 
16X CURRENTDSSELECTOR 
OOOOffHH 
Y 
3 
Y 
B U y 
y 
N 


ES 
16X CURRENTESSELECTOR 
OOOOffffH 
y 
3 
y 
B U y 
y 
N 
fS 
16X CURRENTFSSELECTOR 
OOOOffHH 
Y 
3 
y 
B U y 
y 
N 
GS 
16X CURRENTGSSELECTOR 
OOOOfffFH 
y 
3 
y 
B U 
Y 
Y 
N 


Key: 
Y 
= yes 
N 
= no 
o 
~ 
privilege 
level 
0 


1 
= privilege 
level 
1 


2 
= privilege 
level 
2 
3 
= privilege 
level 
3 
U 
~ 
expand 
up 


o 
~ 
expand 
down 
B 
= byte granularity 
p 
= page 
granularity 
W 
= push/pop 
16-bit 
words 
F 
~ 
push/pop 
32-bit 
dwords 
- 
= does 
not apply 
to that segment 
cache 
register 


Figure 4.13. Segment Descriptor Caches for Virtual 8086 Mode within Protected Mode 
(Segment Limit and Attributes 
are Fixed) 
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The Intel486 SX microprocessor/Intel 
OverDrive 
Processor has four levels of protection which are 
optimized to support the needs of a multi-tasking op- 
erating system to isolate and protect user programs 
from each other and the operating system. The privi- 
lege levels control the use of privileged instructions, 
I/O instructions, and access to segments and seg- 
ment descriptors. Unlike traditional microprocessor- 
based systems where this protection is achieved 
only through ~heuse of complex external hardware 
and software the Intel486 SX microprocessor/Intel 
OverDrive Processor provides the protection as part 
of its integrated Memory Management Unit. The In- 
tel486 SX microprocessor/lntel 
OverDrive Proces- 
sor offers an additional type of protection on a page 
basis, when paging is enabled (See Section 4.5.3 
Page Level Protection). 


The four-level hierarchical privilege system is illus- 
trated in Figure 4-14. It is an extension of the user/ 
supervisor privilege mode commonly used by mini- 
computers and, in fact, the user/supervisor mode is 


• 


fully supported by the Intel486 SX microprocessor/ 
Intel OverDrive Processor paging mechanism. The 
privilege levels (PL) are numbered 0 through 3. Lev- 
el 0 is the most privileged or trusted level. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor controls access to both data and proce- 
dures between levels of a task, according to the fol- 
lowing rules. 
• Data stored in a segment with privilege level p can 
be accessed only by code executing at a privilege 
level at least as privileged as p. 
• A code segment/procedure with privilege level p 
can only be called by a task executing at the same 
or a lesser privilege level than p. 


4.4.3.1 Task Privilege 


At any point in time, a task on the Intel486 SX micro- 
processor/lntel 
OverDrive Processor always exe- 
cutes at one of the four privilege levels. The Current 
Privilege Level (CPL) specifies the task's privilege 
level. A task's CPL may only be changed by control 
transfers through gate descriptors to a code seg- 
ment with a different privilege level. (See Section 
4.4.4 Privilege Level Transfers.) Thus, an applica- 
tion program running at PL = 3 may call an operat- 
ing system routine at PL = 1 (via a gate) which 
would cause the task's CPL to be set to 1 until the 
operating system routine was finished. 


4.4.3.2 Selector Privilege (RPL) 


The privilege level of a selector is specified by the 
RPL field. The RPL is the two least significant bits of 
the selector. The selector's RPL is only used to es- 
tablish a less trusted privilege level than the current 
privilege level for the use of a segment. This level is 
called the task's effective privilege level (EPL). The 
EPL is defined as being the least privileged (i.e. nu- 
merically larger) level of a task's CPL and a selec- 
tor's RPL. Thus, if selector's RPL = 0 then the CPL 
always specifies the privilege level for making an ac- 
cess using the selector. On the other hand if RPL = 


3 then a selector can only access segments at level 
3 regardless of the task's CPL. The RPL is most 
commonly used to verify that pointers passed to an 
operating system procedure do not access data that 
is of higher privilege than the procedure that origi- 
nated the pointer. Since the originator of a selector 
can specify any RPL value, the Adjust RPL (ARPL) 
instruction is provided to force the RPL bits to the 
originator's CPL. 


4.4.3.3 110 Privilege and 1/0 Permission Bltmap 


The I/O privilege level (IOPL, a 2-bit field in the 
EFLAG register) defines the least privileged level at 
which I/O instructions can be unconditionally per- 
formed. I/O instructions can be unconditionally per- 
formed when CPL :<;; IOPL. (The I/O instructions are 
IN, OUT, INS, OUTS, REP INS, and REP OUTS.) 
When CPL > 10PL,and the current task is associat- 
ed with a 286 TSS, attempted I/O instructions cause 
an exception 13 fault. When CPL > 10PL, and the 
current task is associated with an Intel486 SX micro- 
processor/lntel 
OverDrive Processor TSS, the I/O 
Permission Bitmap (part of an Intel486 SX micro- 
processor/lntel 
OverDrive Processor TSS) is con- 
sulted on whether I/O to the port is allowed, or an 
exception 13 fault is to be generated instead. For 
diagrams of the I/O Permission Bitmap, refer to Fig- 
ures 4.15a and 4.15b. For further information on how 
the I/O 
Permission Bitmap is used in Protected 
Mode or in Virtual 8086 Mode, refer to Section 4.6.4 
Protection and I/O Permission Bitmap. 


The I/O privilege level (IOPL) also affects whether 
several other instructions can be executed or cause 
an exception 13 fault instead. These instructions are 
called "IOPL-sensitive" 
instructions and they are 
CLI and STI. (Note that the LOCK prefix is not 10PL- 
sensitive on the Intel486 SX microprocessor/lntel 
OverDrive Processor.) 


The 10PL also affects whether the IF (interrupts en- 
able flag) bit can be changed by loading a value into 
the EFLAGS register. When CPI- :<;; IOPL, then the 
IF bit can be changed by loading a new value into 
the EFLAGS register. When CPL > 10PL,the IF bit 
cannot be changed by a new value POP'ed into (or 
otherwise loaded into) the EFLAGS register; the IF 
bit merely remains unchanged and no exception is 
generated. 
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Instruction 
Operands 
Function 


ARPL 
Selector, 
Adjust Requested 
Register 
Privilege Level: adjusts 
the RPL of the selector 
to the numeric 
maximum of current 
selector RPL value and 
the RPL value in the 
register. Set zero flag if 
selector RPL was 
changed. 


VERR 
Selector 
VERify for Read: sets 
the zero flag if the 
segment referred to by 
the selector can be 
read. 


VERW 
Selector 
VERify for Write: sets 
the zero flag if the 
segment referred to by 
the selector can be 
written. 


LSL 
Register, 
Load Segment Limit: 
Selector 
reads the segment limit 
into the register if 
privilege rules and 
descriptor type allow. 
Set zero flag if 
successful. 


LAR 
Register, 
Load Access Rights: 
Selector 
reads the descriptor 
access rights byte into 
the register if privilege 
rules allow. Set zero 
flag if successful. 


4.4.3.4 Privilege Validation 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor provides several instructions to speed 
pointer testing and help maintain system integrity by 
verifying that the selector value refers to an appro- 
priate segment. Table 4.2 summarizes the selector 
validation procedures available for the Intel486 SX 
microprocessor/Intel OverDrive Processor. 


This pointer verification prevents the common prob- 
lem of an application at PL = 3 calling a operating 
systems routine at PL = 0 and passing the operat- 
ing system routine a "bad" pointer which corrupts a 


data structure belonging to the operating system. If 
the operating system routine uses the ARPL instruc- 
tion to ensure that the RPL of the selector has no 
greater privilege than that of the caller, then this 
problem can be avoided. 


4.4.3.5 Descriptor Access 


There are basically two types of segment accesses: 
those involving code segments such as control 
transfers, and those involving data accesses. Deter- 
mining the ability of a task to access a segment in- 
volves the type of segment to be accessed, the in- 
struction used, the type of descriptor used and CPL, 
RPL, and DPL as described above. 


Any time an instruction loads data segment registers 2 
(OS, ES, FS, GS) the Intel486 SX microprocessor/ 
Intel OverDrive Processor makes protection valida- 
tion checks. Selectors loaded in the OS, ES, FS, GS 
registers must refer only to data segments or read- 
able code segments. The data access rules are 
specified in Section 4.4.2 Rules of Privilege. The 
only exception to those rules is readable conforming 
code segments which can be accessed at any privi- 
lege level. 


Finally the privilege validation checks are performed. 
The CPL is compared to the EPL and if the EPL is 
more privileged than the CPL an exception 13 (gen- 
eral protection fault) is generated. 


The rules regarding the stack segment are slightly 
different than those involving data segments. In- 
structions that load selectors into SS must refer to 
data segment descriptors for writeable data seg- 
ments. The DPL and RPL must equal the CPL. All 
other descriptor types or a privilege level violation 
will cause exception 13. A stack not present fault 
causes exception 12. Note that an exception 11 is 
used for a not-present code or data segment. 


Inter-segment control transfers occur when a selec- 
tor is loaded in the CS register. For a typical system 
most of these transfers are simply the result of a call 
or a jump to another routine. There are five types of 
control transfers which are summarized in Table 4.3. 
Many of these transfers result in a privilege level 
transfer. Changing privilege levels is done only via 
control transfers, by using gates, task switches, and 
interrupt or trap gates. 
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Control Transfer 
Types 
Operation 
Types 
Descriptor 
Descriptor 
Referenced 
Table 


Intersegment within the same privilege level 
JMP, CALL, RET, IREP 
Code Segment 
GOT/LOT 


Intersegment to the same or higher privilege level 
CALL 
Call Gate 
GOT/LOT 
Interrupt within task may change CPL 
Interrupt Instruction, 
Trap or 
lOT 
Exception, External 
Interrupt 
Interrupt 
Gate 


Intersegment to a lower privilege level 
RET,IREP 
Code Segment 
GOT/LOT 
(changes task CPL) 


CALL,JMP 
Task State 
GOT 
Segment 


Task Switch 
CALL, JMP 
Task Gate 
GOT/LOT 


IREP' 
Task Gate 
lOT 
Interrupt Instruction, 
Exception, External 
Interrupt 


'NT (Nested Task bit of flag register) = 0 
"NT 
(Nested Task bit of flag register) = 1 


Control transfers can only occur if the operation 
which loaded the selector references the correct de- 
scriptor type. Any violation of these descriptor usage 
rules will cause an exception 13 (e.g. JMP through a 
call gate, or IRET from a normal subroutine call). 


In order to provide further system security, all control 
transfers are also subject to the privilege rules. 


The privilege 
rules require 
that: 
- 
Privilege level transitions can only occur via 
gates. 
- 
JMPs can be made to a non-conforming code 
segment with the same privilege or to a conform- 
ing code segment with greater or equal privilege. 
- 
CALLs can be made to a non-conforming code 
segment with the same privilege or via a gate to a 
more privileged level. 


- 
Interrupts handled within the task obey the same 
privilege rules as CALLs. 


- 
Conforming Code segments are accessible by 
privilege levels which are the same or less privi- 
leged than the conforming-code segment's OPL. 


- 
Both the requested privilege level (RPL) in the 
selector pointing to the gate and the task's CPL 
must be of equal or greater privilege than the 
gate's OPL. 


- 
The code segment selected in the gate must be 
the same or more privileged than the task's CPL. 


- 
Return instructions that do not switch tasks can 
only return control to a code segment with same 
or less privilege. 


- 
Task switches can be performed by a CALL, 
JMP, or INT which references either a task gate 
or task state segment who's OPL is less privi- 
leged or the same privilege as the old task's CPL. 


Any control transfer that changes CPL within a task 
causes a change of stacks as a result of the privi- 
lege level change. The initial values of SS:ESP for 
privilege levels 0, 1, and 2 are retained in the task 
state segment (see Section 4.4.6 Task Switching). 
During a JMP or CALL control transfer, the new 
stack pointer is loaded into the SS and ESP regis- 
ters and the previous stack pointer is pushed onto 
the new stack. 


When RETurning to the original privilege level, use 
of the lower-privileged stack is restored as part of 
the RET or IRET instruction operation. For subrou- 
tine calls that pass parameters on the stack and 
cross privilege levels, a fixed number of words (as 
specified in the gate's word count field) are copied 
from the previous stack to the current stack. The 
inter-segment RET instruction with a stack adjust- 
ment value will correctly restore the previous stack 
pointer upon return. 
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Figure 4.15b. Sample I/O Permission Bit Map 


processor/Intel 
OverDrive Processor directly sup- 
ports this operation by providing a task switch in- 
struction in hardware. The Intel486 SX microproces- 
sor/Intel OverDrive Processor task switch operation 
saves the entire state of the machine (all of the reg· 
isters, address space, and a link to the previous 
task), loads a new execution state, performs protec- 
tion checks, and commences execution in the new 
task, in about 10 microseconds. Like transfer of con- 
trol via gates, the task switch operation is invoked by 
executing an inter-segment JMP or CALL instruction 
which refers to a Task State Segment (TSS), or a 
task gate descriptor in the GOT or LOT. An INT n 
instruction, exception, trap, or external interrupt may 
also invoke the task switch operation if there is a 
task gate descriptor in the associated lOT descriptor 
slot. 


Gates provide protected, indirect CALLs. One of the 
major uses of gates is to provide a secure method of 
privilege transfers within a task. Since the operating 
system defines all of the gates in a system, it can 
ensure that all gates only allow entry into a few trust· 
ed procedures (such as those which allocate memo- 
ry, or perform I/O). 


Gate descriptors follow the data access rules of priv- 
ilege; that is, gates can be accessed by a task if the 
EPL, is equal to or more privileged than the gate 
descriptor's DPL. Gates follow the control transfer 
rules of privilege and therefore may only transfer 
control to a more privileged level. 


Call Gates are accessed via a CALL instruction and 
are syntactically identical to calling a normal subrou- 
tine. When an inter-level Intel486 SX microproces- 
sor/Intel OverDrive Processor call gate is activated, 
the following actions occur. 
1. Load CS:EIP from gate check for validity 
2. SS is pushed zero-extended to 32 bits 
3. ESP is pushed 
4. Copy Word Count 32-bit parameters from the 
old stack to the new stack 
5. Push Return address on stack 


The procedure is identical for 80286 Call gates, ex- 
cept that 16·bit parameters are copied and 16-bit 
registers are pushed. 


Interrupt Gates and Trap gates work in a similar 
fashion as the call gates, except there is no copying 
of parameters. The only difference between Trap 
and Interrupt gates is that control transfers through 
an Interrupt gate disable further interrupts (Le.the IF 
bit is set to 0), and Trap gates leave the interrupt 
status unchanged. 


A very important attribute of any multi-tasking/multi- 
user operating system is its ability to rapidly switch 
between tasks or processes. The Intel486 SX micro- 


The TSS descriptor points to a segment (see Figure 
4.15) containing the entire Intel486 SX microproces- 
sor/Intel OverDrive Processor execution state while 
a task gate descriptor contains a TSS selector. The 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor supports both 80286 and Intel486 SX microproc- 
essor/Intel OverDrive Processor style TSSs. Figure 
4.16 shows a 80286 TSS. The limit of an Intel486 SX 
microprocessorlintel 
OverDrive 
Processor 
TSS 
must be greater than 0064H (002BH for a 80286 
TSS), and can be as large as 4 Gigabytes. In the 
additional TSS space, the operating system is free 
to store additional information such as the reason 
the task is inactive, time the task has spent running, 
and open files belong to the task. 


Each task must have a TSS associated with it. The 
current TSS is identified by a special register in the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor called the Task State Segment Register (TR). 
This register contains a selector referring to the task 
state segment descriptor that defines the current 
TSS. A hidden base and limit register associated 
with TR are loaded whenever TR is loaded with a 
new selector. Returning from a task is accomplished 
by the IRET instruction. When IRET is executed, 
control is returned to the task which was interrupted. 
The current executing task's state is saved in the 
TSS and the old task state is restored from its TSS. 
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Several bits in the flag register and machine status 
word (CRO) give information about the state of a 
task which are useful to the operating system. The 
Nested Task (NT) (bit 14 in EFLAGS) controls the 
function of the IRET instruction. If NT = 0, the IRET 
instruction performs the regular return; when NT = 
1, IRET performs a task switch operation back to the 
previous task. The NT bit is set or reset in the follow- 
ing fashion: 


15 
0 


BACK 
LINK 
SELECTOR 
TO TSS 


SP 
FOR 
CPL 
0 


SS 
FOR 
CPL 
0 


SP 
FOR 
CPL 
1 


SS 
FOR 
CPL 
1 


SP 
FOR 
CPL 
2 


SS 
FOR 
CPL 
2 


IP (ENTRY 
POINT) 


FLAGS 


AX 


CX 
ox 


BX 


SP 


BP 


SI 


01 


ES 
SELECTOR 


CS 
SELECTOR 


SS 
SELECTOR 


OS 
SELECTOR 


TASK'S 
LOT 
SELECTOR 


AVAILABLE 


4 


6 
INITIAL 


STACKS 
8 
FOR 
CPL 
0,1,2 


10 


12 


14 


16 


18 
CURRENT 
lA 
TASK 


IC 
STATE 


IE 


20 


22 


24 


28 


28 


2A 


When a CALL or INT instruction initiates a task 
switch, the new TSS will be marked busy and the 
back link field of the new TSS set to the old TSS 
selector. The NT bit of the new task is set by CALL 
or INT initiated task switches. An interrupt that does 
not cause a task switch will clear NT. (The NT bit will 
be restored after execution of the interrupt handler) 
NT may also be set or cleared by POPF or IRET 
instructions. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor task state segment is marked busy by 
changing the descriptor type field from TYPE 9H to 
TYPE BH. An 80286 TSS is marked busy by chang- 
ing the descriptor type field from TYPE 1 to TYPE 3. 
Use of a selector that references a busy task state 
segment causes an exception 13. 


The Virtual Mode (VM) bit 17 is used to indicate if a 
task, is a virtual 8086 task. If VM = 1, then the tasks 


will use the Real Mode addressing mechanism. The 
virtual 8086 environment is only entered and exited 
via a task switch (see Section 4.6 Virtual Mode). 


The FPU's state is not automatically saved when 
a task switch occurs, because the incoming task 
may not use the FPU. The Task Switched (TS) 
Bit (bit 3 in the CRO) helps deal with the FPU's 
state in a multitasking environment 
Whenever 
the Intel OverDrive Processor switches tasks, it 
sets the TS bit. The Intel OverDrive Processor 
detects the first use of a processor extension in- 
struction after a task switch and causes the proc- 
essor extension not available exception 7. The 
exception handler for exception 7 may then de- 
cide whether to save the state of the FPU. A 
processor extension not present exception (7) 
2 


will occur when attempting to execute a Floating 
Point or WAIT instruction if the Task Switched 
and Monitor coprocessor extension bits are both 
set (Le., TS = 1 and MP = 1). 


The T bit in the Intel486 SX microprocessor/lntel 
OverDrive Processor TSS indicates that the proces- 
sor 
should 
generate 
a debug 
exception 
when 
switching to a task. If T = 1 then upon entry to a 
new task a debug exception 1 will be generated. 


4.4.7 INITIALIZATION AND TRANSITION TO 
PROTECTED MODE 


Since the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor begins executing in Real Mode im- 
mediately after RESET it is necessary to initialize the 
system tables and registers with the appropriate val- 
ues. 


The GOTand lOT registers must refer to a valid GOT 
and lOT. The lOT should be at least 256 bytes long, 
and GOT must contain descriptors for the initial 
code, and data segments. Figure 4.17 shows the ta- 
bles and Figure 4.18 the descriptors needed for a 
simple Protected Mode Intel486 SX microproces- 
sor/lntel OverDrive Processor system. It has a sin- 
gle code and single data/stack segment each four 
gigabytes long and a single privilege level PL = O. 


The actual method of enabling Protected Mode is to 
load CROwith the PE bit set, via the MOV CRO,R/M 
instruction. This puts the Intel486 SX microproces- 
sor/lntel OverDrive Processor in Protected Mode. 


After enabling Protected Mode, the next instruction 
should execute an intersegment JMP to load the CS 
register and flush the instruction decode queue. The 
final step is to load all of the data segment registers 
with the initial selector values. 
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Figure 4.18. GDT Descriptors 
for Simple System 


An alternate approach to entering Protected Mode 
the system designer an easy method of constructing 
which is especially appropriate for multi-tasking op- 
the data structures needed for a Protected Mode 
erating systems, is to use the built in task-switch to 
Intel486 SX microprocessor/Intel OverDrive Proces- 
load all of the registers. In this case the GOT would 
sor system. This tool is the builder BLD-386. BLD- 
contain two TSS descriptors in addition to the code 
386 lets the operating system writer specify all of the 
and data descriptors needed for the first task. The 
segment descriptors discussed in the previous sec- 


first JMP instruction in Protected Mode would jump 
tions (LOTs, lOTs, GDTs, Gates. and TSSs) in a 
to the TSS causing a task switch and loading all of 
•high-level language. 
the registers with the values stored in the TSS. The 
Task State Segment Register should be initialized to 
point to a valid TSS descriptor since a task switch 
4.5 Paging 
saves the state of the current task in a task state 
segment. 
4.5.1 PAGING 
CONCEPTS 


4.4.8 TOOLS 
FOR BUILDING 
PROTECTED 
SYSTEMS 


Paging is another type of memory management 
useful for virtual memory multitasking operating sys- 


In order to simplify the design of a protected multi- 
tasking system, Intel provides a tool which allows 


tems. Unlike segmentation which modularizes pro- 
grams and data into variable length segments, pag- 
ing divides programs into multiple uniform size 
pages. Pages bear no direct relation to the logical 
structure of a program. While segment selectors can 
be considered the logical "name" 
of a program 
module or data structure, a page most likely corre- 
sponds to only a portion of a module or data struc- 
ture. 


By taking advantage of the locality of reference dis- 
played by most programs, only a small number of 
pages from each active task need be in memory at 
anyone moment. 


4.5.2.1 Page Mechanism 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor uses two levels of tables to translate the 
linear address (from the segmentation unit) into a 
physical address. There are three components to 
the paging mechanism of the Intel486 SX microproc- 
essor/Intel OverDrive Processor: the page directory, 
the page tables, and the page itself (page frame). All 
memory-resident elements of the Intel486 SX micro- 
processor/lntel OverDrive Processor paging mecha- 
nism are the same size, namely, 4 Kbytes. A uniform 
size for all of the elements simplifies memory alloca- 


tion and reallocation schemes, since there is no 
problem with memory fragmentation. Figure 4.19 
shows how the paging mechanism works. 


4.5.2.2 Page Descriptor 
Base Register 


CR2 is the Page Fault Linear Address register. It 
holds the 32-bit linear address which caused the last 
page fault detected. 


CR3 is the Page Directory Physical Base Address 
Register. It contains the physical starting address of 
the Page Directory. The lower 12 bits of CR3 are 
always zero to ensure that the Page Directory is al- 
ways page aligned. Loading it via a MOV CR3, reg 
instruction causes the Page Table Entry cache to be 
flushed, as will a task switch through a TSS which 2 
changes 
the value of CRO.(See 4.5.5 Translation 
Lookaslde 
BUffer.) 


4.5.2.3 Page Directory 


The Page Directory is 4 Kbytes long and allows up to 
1024 Page Directory Entries. Each Page Directory 
Entry contains the address of the next level of ta- 
bles, the Page Tables and information about the 
page table. The contents of a Page Directory Entry 
are shown in Figure 4.20. The upper 10 bits of the 
linear address (A22-A31) are used as an index to 
select the correct Page Directory Entry. 
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Figure 4.21. Page Table Entry (Points to Page) 


The (User/Supervisor) U/S bit 2 and the (Read/ 
Write) R/W bit 1 are used to provide protection attri- 
butes for individual pages. 
Each Page Table is 4 Kbytes and holds up to 1024 
Page Table Entries. Page Table Entries contain the 
starting address of the page frame and statistical 
information about the page (see Figure 4.21). Ad- 
dress bits A12-A21 are used as an index to select 
one of the 1024 Page Table Entries. The 20 upper- 
bit page frame address is concatenated with the 
lower 12 bits of the linear address to form the physi- 
cal address. Page tables can be shared between 
tasks and swapped to disks. 


4.5.2.5 Page Directory/Table 
£ntrles 


The lower 12 bits of the Page Table Entries and 
Page Directory Entries contain statistical information 
about pages and page tables respectively. The P 
(Present) bit 0 indicates if a Page Directory or Page 
Table entry can be used in address translation. If 
P = 1 the entry can be used for address translation, 
if P = 0 the entry cannot be used for translation, 
and all of the other bits are available for use by the 
software. For example the remaining 31 bits could 
be used to indicate where on the disk the page is 
stored. 


The A (Accessed) bit 5, is set by the Intel486 SX 
microprocessor/Intel OverDrive Processor for both 
types of entries before a read or write access occurs 
to an address covered by the entry. The D (Dirty) bit 
6 is set to 1 before a write to an address covered by 
that page table entry occurs. The 0 bit is undefined 
for Page Directory Entries. When the P, A and 0 bits 
are updated by the Intel486 SX microprocessor/Intel 
OverDrive Processor, a Read-Modify-Write cycle is 
generated which locks the bus and prevents con- 
flicts with other processors or peripherals. Software 
which modifies these bits should use the LOCK pre- 
fix to ensure the integrity of the page tables in multi- 
master systems. 


The 3 bits marked OS Reserved 
in Figure 4.20 and 


Figure 4.21 (bits 9-11) are software definable. OSs 
are free to use these bits for whatever purpose they 
wish. An example use of the OS 
Reserved 
bits 


would be to store information about page aging. By 
keeping track of how long a page has been in mem- 
ory since being accessed, an operating system can 
implement a page replacement algorithm like Least 
Recently Used. 


4.5.3 PAGE 
LEVEL 
PROTECTION 
(R/W, U/S BITS) 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor provides a set of protection attributes for 
paging systems. The 
paging mechanism distin- 


guishes between two levels of protection: User 
which corresponds to level 3 of the segmentation 
based protection, and supervisor which encompass- 
es all of the other protection levels (0, 1, 2). 


The R/W and U/S bits are used in conjunction with 
the WP bit in the flags register (EFLAGS). The In- 
tel386 microprocessor does not contain the WP bit. 
The WP bit has been added to the Intel486 SX mi- 
croprocessor/Intel 
OverDrive Processor to protect 


read-only pages from supervisor write accesses. 
The Intel386 microprocessor allows a read-only 
page to be written from protection levels 0, 1 or 2. 
WP= 0 is the Intel386 microprocessor compatible 
mode. When WP= 0 the supervisor can write to a 
read-only page as defined by the U/S and R/W bits. 
When WP= 1 supervisor access to a read-only page 
(R/W=O) will cause a page fault (exception 14). 


Table 4.4 shows the affect of the WP, U/S and R/W 
bits on accessing memory. When WP=O, the super- 
visor can write to pages regardless of the state of 
the R/W bit. When WP= 1 and R/W = 0 the supervi- 
sor cannot write to a read-only page. A user attempt 
to access a supervisor only page (U/S = 0), or write 
to a read only page will cause a page fault (excep- 
tion 14). 


The R/W and U/S bits provide protection from user 
access on a page by page basis since the bits are 
contained in the Page Table Entry and the Page Di- 
rectory Table. The U/S and R/W bits in the first level 
Page Directory Table apply to all entries in the page 
table pointed to by that directory entry. The U/S and 
R/W bits in the second level Page Table Entry apply 
only to the page described by that entry. The most 
restrictive of the U/S and R/W bits from the Page 
Directory Table and the Page Table Entry are used 
to address a page. 


Example: If the U/S and R/W bits for the Page Di- 
rectory entry were 10 (user read/execute) and the 
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U/S and R/W 
bits for the Page Table Entry were 01 
(no 
user 
access 
at all), the 
access 
rights 
for 
the 
page 
would 
be 01, the 
numerically 
smaller 
of the 
two. 


Note that a given segment 
can be easily made read- 
only for level 0, 1 or 2 via use of segmented 
protec- 
tion mechanisms. 
(Section 
4.4 Protection.) 


4.5.4 PAGE 
CACHEABILITY 
(PWT 
AND 
PCD BITS) 


PWT (page write through) 
and PCD (page cache dis- 
able) are two new bits defined 
in entries 
in both lev- 
els of the page table 
structure, 
the Page Directory 
Table and the Page Table Entry. PCD and PWT con- 
trol page cacheability 
and write policy. 


PWT controls 
write policy. 
PWT = 1 defines 
a write- 
through 
policy for the current 
page. PWT = 0 allows 
the possibility 
of write-back. 
PWT is ignored 
internal- 
ly because 
the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
has 
a write-through 
cache. 
PWT can 
be used 
to control 
the 
write 
policy 
of a 
second 
level cache. 


PCD controls 
cacheability. 
PCD=O 
enables 
caching 
in the 
on-chip 
cache. 
PCD alone 
does 
not enable 
caching, 
it must be conditioned 
by the KEN,* 
(cache 
enable) 
input signal and the state of the CD (cache 
disable 
bit) and NW (no write-through) 
bits in control 
register 
0 (CRO). When PCD = 1, caching 
is disabled 
regardless 
of the state of KEN '*, CD and NW. (See 
Section 
5.0, On-Chlp 
Cache.) 


The state of the PCD and PWT bits are driven out on 
the PCD and PWT pins during a memory 
access. 


The PWT and PCD bits for a bus cycle are obtained 
either from control 
register 
3 (CR3), the Page Direc- 
tory Entry or the Page Table Entry, depending 
on the 
type of cycle run. However, 
when paging is disabled 
(PG = 
0 in CRO) or for cycles which 
bypass 
paging 
(Le., I/O 
(input/output) 
references, 
INTR 
(interrupt 
request) 
and HALT 
cycles), 
the PCD and PWT bits 
of CR3 are ignored. 
The 
Intel486 
SX microproces- 
sor/Intel 
OverDrive 
Processor 
assumes 
PCD 
= 
0 
and PWT 
= 
0 and drives these 
values 
on the PCD 
and)'WT 
pins. 


When 
paging 
is enabled 
(PG = 1 in CRO), the 
bits 
from the page table entry are cached 
in the transla- 


. tion lookaside 
buffer 
(TLB), and are driven 
any time 
the page 
mapped 
by the TLB 
entry 
is referenced. 
For normal 
memory 
cycles 
run with paging enabled, 
the PWT and PCD bits are taken from the Page Ta- 
ble Entry. During TLB refresh 
cycles when the Page 
Directory 
and Page Table entries 
are read, the PWT 
and PCD bits must be obtained 
elsewhere. 
The bits 
are taken from CR3 when a Page Directory 
Entry is 
being read. The bits are taken from the Page Direc- 
tory Entry when the Page Table Entry is being updat- 
ed. 


The PCD or PWT bits in CR3 are initialized 
to zero at 
reset, 
but can be set to any value 
by level 
0 soft- 
ware. 


4.5.5 TRANSLATION 
LOOKASIDE 
BUFFER 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
paging 
hardware 
is designed 
to support 
demand 
paged 
virtual 
memory 
systems. 
However, 


performance 
would 
degrade 
substantially 
if the 
In- 
tel486 
SX microprocessor/Intel 
OverDrive 
Proces- 
sor was required 
to access 
two levels 
of tables 
for 
every memory 
reference. 
To solve this problem, 
the 
Intel486 
SX microprocessor/Intel 
OverDrive 
Proces- 
sor keeps 
a cache 
of the 
most 
recently 
accessed 
pages. 
This cache 
is called 
the Translation 
Looka- 
side Buffer (TLB). The TLB is a four-way 
set associa- 
tive 
32-entry 
page 
table 
cache. 
It 
automatically 
keeps the most commonly 
used Page Table 
Entries 
in the 
Intel486 
SX microprocessor/Intel 
OverDrive 
Processor. 
The 
32-entry 
TLB 
coupled 
with 
a 4K 
page 
size, 
results 
in coverage 
of 
128 
Kbytes 
of 
memory 
addresses. 
For many common 
multi-tasking 
systems, 
the TLB will have a hit rate of about 98%. 
This means that the Intel486 
SX microprocessor/In- 
tel OverDrive 
Processor 
will only have to access the 
two-level 
page structure 
on 2% of all memory 
refer- 
ences. 
Figure 4.22 illustrates 
how the TLB comple- 
ments 
the 
Intel486 
SX microprocessor/Intel 
Over- 
Drive Processor's 
paging 
mechanism. 


Reading 
a new entry into the TLB (TLB refresh) 
is a 
two step process 
handled 
by the Intel486 
SX micro- 
processor/Intel 
OverDrive 
Processor 
hardware. 
The 
sequence 
of data cycles 
to perform 
a TLB 
refresh 
are: 


U/S 
R/W 
WP 
User Access 
Supervisor 
Access 


0 
0 
0 
None 
Read/Write/Execute 
0 
1 
0 
None 
Read/Write/Execute 
1 
0 
0 
Read/Execute 
Read/Write/Execute 
1 
1 
0 
Read/Write/ 
Execute 
Read/Write/Execute 
0 
0 
1 
None 
Read/Execute 
0 
1 
1 
None 
Read/Write/Execute 
1 
0 
1 
Read/Execute 
Read/Execute 
1 
1 
1 
Read/Write/Execute 
Read/Write/Execute 


1. Read the correct Page Directory Entry, as point- 
ed to by the page base register and the upper 
10 bits of the linear address. The page base reg- 
ister is in control register 3. 
1a. Optionally perform a locked read/write to set the 
accessed bit in the directory entry. The directory 
entry will actually get read twice if the Intel486 
SX microprocessor/Intel 
OverDrive Processor 
needs to set any of the bits in the entry. If the 
page directory entry changes between the first 
and second reads, the data returned for the sec- 
ond read will be used. 
2. 
Read the correct entry in the Page Table and 
place the entry in the TLB. 
2a. Optionally perform a locked read/write to set the 
accessed and/or dirty bit in the page table entry. 
Again, note that the page table entry will actually 
get read twice if the Intel486 SX microproces- 
sor/Intel OverDrive Processor needs to set any 
of the bits in the entry. Like the directory entry, if 
the data changes between the first and second 
read the data returned for the second read will 
be used. 


Note that the directory entry must always be read 
into the Intel486 SX microprocessor/Intel OverDrive 
Processor, since directory entries are never placed 
in the paging TLB. Page faults can be signaled from 
either the page directory read or the page table 
read. Page directory and page table entries may be 
placed 
in the 
Intel486 SX microprocessor/Intel 
OverDrive Processor on-chip cache just like normal 
data. 
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Figure 4.22. Translation 
Lookaslde 
Buffer 


The paging hardware operates in the following fash- 
ion. The paging unit hardware receives a 32·bit Iin· 
ear address from the segmentation unit. The upper 
20 linear address bits are compared with all 32 en· 
tries in the TLB to determine if there is a match. If 


there is a match (i.e., a TLB hit), then the 32-bit 
physical address is calculated and will be placed on 
the address bus. 


However, if the page table entry is not in the TLB, 
the 
Intel486 
SX microprocessor/Intel 
OverDrive 
Processor will read the appropriate Page Directory 
Entry. If P = 1 on the Page Directory Entry indicat- 
ing that the page table is in memory, then the In· 
tel486 SX microprocessor/Intel 
OverDrive Proces- 
sor will read the appropriate Page Table Entry and 
set the Access bit. If P = 1 on the Page Table Entry 
indicating that the page is in memory, the Intel486 
SX microprocessor/Intel 
OverDrive Processor will 
update the Access and Dirty bits as needed and 
fetch the operand. The upper 20 bits of the linear 
address, read from the page table, will be stored in 
the TLB for future accesses. However, if P = 0 for 
either the Page Directory Entry or the Page Table 
Entry, then the Intel486 SX microprocessor/Intel 
OverDrive Processor will generate a page fault, an 
Exception 14. 


The Intel486 SX microprocessor/lntal 
OverDrive 
Processor will also generate an exception 14 page 
fault if the memory reference violated the page pro- 
tection attributes (i.e., U/S or R/W) (e.g., trying to 
write to a read-only page). CR2 will hold the linear 
address which caused the page fault. If a second 
page fault occurs, while the Intel486 SX microproc- 
essor/Intel OverDrive Processor is attempting to en- 
ter the service routine for the first, then the Intel486 
SX microprocessor/Intel 
OverDrive Processor will 
invoke the page fault (exception 14) handler a sec· 
ond time, rather than the double fault (exception 8) 
handler. Since Exception 14 is classified as a fault, 
CS: EIP will point to the instruction causing the page 
fault. The 16-bit error code pushed as part of the 
page fault handler will contain status bits which indi- 
cate the cause of the page fault. 


The 16·bit error code is used by the operating sys- 
tem to determine how to handle the page fault. Fig- 
ure 4.23a shows the format of the page-fault error 
code and the interpretation of the bits. 


NOTE: 
Even though the bits in the error code (U/S, W/R, 
and P) have similar names as the bits in the Page 
Directory/Table Entries, the interpretation of the er- 
ror code bits is different. Figure 4.23b indicates 
what type of access caused the page fault. 


U 
U U U U U U U U U U U U U U 
W P 
S R 
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U/S: 
The 
U/S 
bit 
indicates 
whether 
the 
access 
causing 
the fault occurred 
when the Intel486 
SX mi- 
croprocessorllntel 
OverDrive 
Processor 
was 
exe- 
cuting 
in User 
Mode 
(U/S 
= 
1) or in Supervisor 
mode (U/S 
= 0). 


W/R: 
The 
W/R 
bit indicates 
whether 
the 
access 
causing 
the fault was a Read (W/R 
= 0) or a Write 
(W/R 
= 1). 


P: The 
P bit indicates 
whether 
a page 
fault 
was 
caused 
by a not-present 
page (P = 0), or by a page 
level protection 
violation 
(P = 1). 


U/S 
W/R 
Access 
Type 


0 
0 
Supervisor' 
Read 
0 
1 
Supervisor 
Write 
1 
0 
User Read 
1 
1 
User Write 


'Descriptor table access will fault with VIS 
= 0, aven if the program 
is executing at level 3. 


Figure 
4.23b. Type of Access 
Causing 
Page Fault 


The 
Intel486 
SX 
microprocessorllntel 
OverDrive 
Processor 
takes 
care of the page address 
transla- 
tion process, 
relieving 
the burden from an operating 
system 
in a demand-paged 
system. 
The operating 
system 
is responsible 
for setting 
up the initial page 
tables, 
and handling 
any page faults. 
The operating 
system 
also is required 
to invalidate 
(i.e., flush) the 
TLB when any changes 
are made to any of the page 
table 
entries. 
The 
operating 
system 
must 
reload 
CR3 to cause the TLB to be flushed. 


Setting 
up the tables 
is simply 
a matter 
of loading 
CR3 
with 
the 
address 
of the 
Page 
Directory, 
and 
allocating 
space 
for 
the 
Pa.\je Directory 
and 
the 
Page Tables. 
The primary 
responsibility 
of the oper- 
ating system 
is to implement 
a swapping 
policy and 
handle 
all of the page faults. 


A final concern 
of the operating 
system 
is to ensure 
that the TLB cache 
matches 
the information 
in the 
paging 
tables. 
In particular, 
any time the operating 
system 
sets the P present 
bit of page table entry to 
zero, the TLB 
must be flushed. 
Operating 
systems 
may want to take advantage 
of the fact that CR3 is 
stored 
as part of a TSS, to give every task or group 
of tasks its own set of page tables. 


4.6.1 EXECUTING 
8086 PROGRAMS 


The 
Intel486 
SX 
microprocessor 
lintel 
OverDrive 
Processor 
allows 
the 
execution 
of 
8086 
applic- 


ation programs 
in both Real Mode and in the Virtual 
8086 Mode (Virtual Mode). Of the two methods, 
Vir- 
tual 8086 Mode offers the system designer 
the most 
flexibility. 
The Virtual 
8086 
Mode allows 
the execu- 
tion of 8086 applications, 
while still allowing 
the sys- 
tem designer 
to take full advantage 
of the Intel486 
SX microprocessor 
lintel 
OverDrive 
Processor 
pro- 
tection 
mechanism. 
In particular, 
the Intel486 
SX mi- 
croprocessorllntel 
OverDrive 
Processor 
allows 
the 
simultaneous 
execution 
of 8086 operating 
systems 
and its applications, 
and an Intel486 
SX microproc- 
essor/Intel 
OverDrive 
Processor 
operating 
system 
and both 80286 and Intel486 
SX microprocessorlln- 
tel OverDrive 
Processor 
applications. 
Thus, in a mul- 
ti-user 
Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
computer, 
one 
person 
could 
be running 
an MS-DOS 
spreadsheet, 
another 
person 
using MS- 
DOS, and a third 
person 
could 
be running 
multiple 
Unix utilities 
and 
applications. 
Each 
person 
in this 
scenario 
would 
believe 
that 
he had the 
computer 
completely 
to 
himself. 
Figure 
4.24 
illustrates 
this 
concept. 


4.6.2 VIRTUAL 
8086 MODE 
ADDRESSING 
MECHANISM 


One of the major 
differences 
between 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
Real and 
Protected 
modes 
is how the segment 
selectors 
are 
interpreted. 
When the Intel486 
SX microprocessor/ 
Intel 
OverDrive 
Processor 
is executing 
in 
Virtual 
8086 
Mode 
the 
segment 
registers 
are used 
in an 
identical 
fashion 
to Real Mode. The contents 
of the 
segment 
register 
is shifted 
left 4 bits and added 
to 
the offset 
to form the segment 
base linear address. 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
allows 
the 
operating 
system 
to specify 
which 
programs 
use the 8086 style address 
mecha- 
nism, and which 
programs 
use Protected 
Mode ad- 
dressing, 
on a per task 
basis. 
Through 
the 
use of 
paging, the one megabyte 
address 
space of the Vir- 
tual Mode task can be mapped 
to anywhere 
in the 
4 gigabyte 
linear address 
space 
of the Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor. 
Like 
Real 
Mode, 
Virtual 
Mode 
effective 
addresses 
(i.e., 
segment 
offsets) 
that exceed 
64 Kbyte will cause an 
exception 
13. However, 
these restrictions 
should not 
prove to be important, 
because 
most tasks 
running 
in Virtual 
8086 
Mode 
will 
simply 
be existing 
8086 
application 
programs. 


The paging 
hardware 
allows 
the concurrent 
running 
of multiple 
Virtual 
Mode tasks, and provides 
protec- 


tion 
and 
operating 
system 
isolation. 
Although 
it is 
not strictly 
necessary 
to have the paging 
hardware 
enabled 
to run Virtual 
Mode 
tasks, 
it is needed 
in 
order 
to run multiple 
Virtual 
Mode 
tasks 
or to relo- 


• 


cate the address space of a Virtual Mode task to 
physical address space greater than one megabyte. 


The paging hardware allows the 20-bit linear ad- 
dress produced by a Virtual Mode program to be 
divided into up to 256 pages. Each one of the pages 
can be located anywhere within the maximum 4 gig- 
abyte physical address space of the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor. In addition, 
since CR3 (the Page Directory Base Register) is 
loaded by a task switch, each Virtual Mode task can 
use a different mapping scheme to map pages to 
different physical locations. Finally, the paging hard- 
ware allows the sharing of the 8086 operating sys- 
tem code between multiple 8086 applications. Fig- 
ure 4.24 shows how the Intel486 SX microproces- 
sor/Intel OverDrive Processor paging hardware en- 
ables multiple 8086 programs to run under a virtual 
memory demand paged system. 


4.6.4 PROTECTION 
AND 110 PERMISSION 
BITMAP 


All Virtual 8086 Mode programs execute at privilege 
level 3, the level of least privilege. As such, Virtual 
8086 Mode programs are subject to all of the protec- 
tion checks defined in Protected Mode. (This is dif- 
ferent from Real Mode which implicitly is executing 


at privilege level 0, the level of greatest privilege.) 
Thus, an attempt to execute a privileged instruction 
when in Virtual 8086 Mode will cause an exception 
13 fault. 


The following are privileged instructions, which may 
be executed only at Privilege Level O. Therefore, at- 
tempting to execute these instructions in Virtual 
8086 Mode (or anytime CPL > 0) causes an excep- 
tion 13 fault: 


LIDT; 
LGDT; 
LMSW; 
CLTS; 
HLT; 


MOV DRn,reg; 
MOV TRn,reg; 
MOV CRn,reg; 


MOV reg,DRn; 
MOV reg,TRn; 
MOV reg,CRn. 


Several instructions, particularly those applying to 
the multitasking model and protection model, are 
available only in Protected Mode. Therefore, at- 
tempting to execute the following instructions in 
Real Mode or in Virtual 8086 Mode generates an 
exception 6 fault: 


LTR; 
LLDT; 
LAR; 
LSL; 
ARPL. 


STR; 
SLDT; 
VERR; 
VERW; 


• 
TASK 1 
• 
8086 
OS 


""EWORY 
WEWORY 


f117J TASK 2 
~ 
Inte1386T"" CPU OS 


~ 
WEWORY 
~ 
WEWORY 


The instructions which are 10PL-sensitive in Protect- 
ed Mode are: 


IN; 
STI; 
OUT; 
CLI 
INS; 
OUTS; 
REP 
INS; 
REP 
OUTS; 


In Virtual 8086 Mode, a slightly different set of in- 
structions are made 10PL-sensitive.The following in- 
structions are 10PL-sensitive in Virtual 8086 Mode: 


INT 
n; 
STI; 
PUSHF; 
CLI; 
POPF; 
IRET 


The PUSHF, POPF, and IRET instructions are 10PL- 
sensitive in Virtual 8086 Mode only. This provision 
allows the IF flag (interrupt enable flag) to be virtual- 
ized to the Virtual 8086 Mode program. The INT n 
software interrupt instruction is also 10PL-sensitive 
in Virtual 8086 Mode. Note, however, that the INT 3 
(opcode OCCH),INTO, and BOUND instructions are 
not 10PL-sensitive in Virtual 8086 mode (they aren't 
10PL sensitive in Protected Mode either). 


Note that the I/O instructions (IN, OUT, INS, OUTS, 
REP INS, and REP OUTS) are not 10PL-sensitive in 
Virtual 8086 mode. Rather, the I/O instructions be- 
come automatically sensitive to the I/O Permission 
Bltmap contained in the Intel486 SX microproces- 
sor/Intel 
OverDrive 
Processor 
Task State Seg- 
ment. The I/O 
Permission Bitmap, automatically 
used by the Intel486 SX microprocessor/Intel Over- 
Drive Processor in Virtual 8086 Mode, is illustrated 
by Figures 4.15a and 4.15b. 


The I/O Permission Bitmap can be viewed as a 
0-64 Kbit bit string, which begins in memory at off- 
set BiLMap_Offset 
in the current TSS. BiL 
Map_Offset 
must be ~ DFFFH so the entire bit 
map and the byte FFI;l which follows the bit map are 
all at offsets ~ FFFFH from the TSS base. The 
16-bit pointer BiLMap_Offset 
(15:0) is found in the 
word beginning at offset 66H (102 decimal) from the 
TSS base, as shown in Figure 4.15a. 


Each bit in the I/O Permission Bitmap corresponds 
to a single byte-wide I/O port, as illustrated in Figure 
4.15a. If a bit is 0, I/O to the corresponding byte- 
wide port can occur without generating an excep- 
tion. Otherwise the I/O instruction causes an excep- 
tion 13 fault. Since every byte-wide I/O port must be 
protectable, all bits corresponding to a word-wide or 
dword-wide port must be 0 for the word-wide or 
dword-wide I/O to be permitted. If all the referenced 
bits are 0, the I/O will be allowed. If any referenced 
bits are 1, the attempted I/O will cause an exception 
13 fault. 


Due to the use of a pointer to the base of the I/O 
Permission Bitmap, the bitmap may be located any- 
where within the TSS, or may be ignored completely 
by pointing the BiLMap_Offset 
(15:0) beyond the 
limit of the TSS segment. In the same manner, only 
a small portion of the 64K I/O space need have an 
associated map bit, by adjusting the TSS limit to 
truncate the bitmap. This eliminates the commitment 
of 8K of memory when a complete bitmap is not 
required, while allowing the fully general case if de- 
sired. 


EXAMPLE OF BITMAP FOR I/O PORTS 0-255: 
Setting the TSS limit to (biLMap_Offset 
+ 31 
+ 1"1 
[" 
see note below) will allow a 32-byte bit- 
map for the I/O ports #0-255, 
plus a terminator 
byte of all 1's [" 
see note below). This allows the 
I/O bitmap to control I/O Permission to I/O port 
0-255 
while causing an exception 13 fault on at- 
tempted I/O to any I/O port 80256 through 65,565. 


"IMPORTANT 
IMPLEMENTATION NOTE: Beyond 
the last byte of I/O mapping information in the I/O 
Permission Bitmap must be a byte containing all 1's. 
The byte of all 1's must be within the limit of the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor TSS segment (see Figure 4.15a). 


4.6.5 INTERRUPT HANDLING 


In order to fully support the emulation of an 8086 
machine, interrupts in Virtual 8086 Mode are han- 
dled in a unique fashion. When running in Virtual 
Mode all interrupts and exceptions involve a privi- 
lege change back to the host Intel486 SX microproc- 
essor/Intel OverDrive Processor operating system. 
The Intel486 SX microprocessor/Intel 
OverDrive 
Processor operating system determines if the inter- 
rupt comes from a Protected Mode application or 
from a Virtual Mode program by examining the VM 
bit in the EFLAGS image stored on the stack. 


When a Virtual Mode program is interrupted and ex- 
ecution passes to the interrupt routine at level 0, the 
VM bit is cleared. However, the VM bit is still set in 
the EFLAG image on the stack. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor operating system in turn handles the ex- 
ception or interrupt and then returns control to the 
8086 program. The Intel486 SX microprocessor/In- 
tel 
OverDrive 
Processor operating 
system 
may 
choose to let the 8086 operating system handle the 
interrupt or it may emulate the function of the inter- 
rupt handler. For example, many 8086 operating 
system calls are accessed by PUSHing parameters 
on the stack, and then executing an INT n instruc- 
tion. If the 10PL is set to 0 then all INT n instructions 
will be intercepted by the Intel486 SX microproces- 


sor/lntel 
OverDrive Processor operating system. 
The Intel486 SX microprocessor/Intel 
OverDrive 
Processor operating system could emulate the 8086 
operating system's call. Figure 4.25 shows how the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor operating system could intercept an 8086 oper- 
ating system's call to "Open a File". 


An 
Intel486 
SX microprocessor/Intel 
OverDrive 
Processor operating system can provide a Virtual 
8086 Environment which is totally transparent to the 
application software via intercepting and then emu- 
lating 8086 operating system's calls, and intercept- 
ing IN and OUT instructions. 


4.6.6 ENTERING 
AND LEAVING 
VIRTUAL 
8086 MODE 


Virtual 8086 mode is entered by executing an IRET 
instruction (at CPL=0), or Task Switch (at any CPL) 
to an Intel486 SX microprocessor/Intel 
OverDrive 
Processor task whose Intel486 SX microprocessor/ 
Intel OverDrive Processor TSS has a FLAGS image 
containing a 1 in the VM bit position while the In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 
sor is executing in Protected Mode. That is, one way 
to enter Virtual 8086 mode is to switch to a task with 
an 
Intel486 
SX 
microprocessor/lntel 
OverDrive 
Processor TSS that has a 1 in the VM bit in the 
EFLAGS image. The other way is to execute a 32-bit 
IRET instruction at privilege level 0, where the stack 
has a 1 in the VM bit in the EFLAGS image. POPF 
does not affect the VM bit, even if the Intel486 SX 
microprocessor/Intel OverDrive Processor is in Pro- 
tected Mode or level 0, and so cannot be used to 
enter Virtual 8086 Mode. PUSHF always pushes a a 
in the VM bit, even if the Intel486 SX microproces- 
sor/Intel 
OverDrive Processor is in Virtual 8086 
Mode, so that a program cannot tell if it is executing 
in REAL mode, or in Virtual 8086 mode. 


The VM bit can be set by executing an IRET instruc- 
tion only at privilege level 0, or by any instruction or 
Interrupt which causes a task switch in Protected 
Mode (with VM = 1 in the new FLAGS image), and 
can be cleared only by an interrupt or exception in 
Virtual 8086 Mode. IRET and POPF instructions exe- 
cuted in REAL mode or Virtual 8086 mode will not 
change the value in the VM bit. 


The transition out of virtual 8086 mode to Intel486 
SX microprocessor/Intel 
OverDrive Processor pro- 
tected mode occurs only on receipt of an interrupt or 
exception (such as due to a sensitive instruction). In 
Virtual 8086 mode, all interrupts and exceptions vec- 
tor through the protected mode lOT, and enter an 
interrupt handler in protected Intel486 SX microproc- 
essor/Intel OverDrive Processor mode. That is, as 
part of interrupt processing, the VM bit is cleared. 


Because the matching IRET must occur from level 0, 
if an Interrupt or Trap Gate is used to field an inter- 
rupt or exception out of Virtual 8086 mode, the Gate 
must perform an inter-level interrupt only to level O. 
Interrupt or Trap Gates through conforming seg- 
ments, or through segments with DPL> 0, will raise a 
GP fault with the CS selector as the error code. 


4.6.6.1 Task Switches 
To/From 
Virtual 
8086 Mode 


Tasks which can execute in virtual 8086 mode must 
be described by a TSS with the new Intel486 SX 
microprocessor/lntel 
OverDrive Processor format 
(TYPE 9 or 11 descriptor). 


A task switch out of virtual 8086 mode will operate 
exactly the same as any other task switch out of a 
task with an Intel486 SX microprocessor/Intel Over- 
Drive Processor TSS. All of the programmer visible 
state, including the FLAGS register with the VM bit 
set to 1, is stored in the TSS. 


The segment registers in the TSS will contain 8086 
segment base values rather than selectors. 


A task switch into a task described by an Intel486 
SX microprocessor/Intel OverDrive Processor TSS 
will have an additional check to determine if the in- 
coming task should be resumed in virtual 8086 
mode. Tasks described by 80286 format TSSs can- 
not be resumed in virtual 8086 mode, so no check is 
required there (the FLAGS image in 80286 format 
TSS has only the low order 16 FLAGS bits). Before 
loading the segment register images from an In- 
tel486 SX microprocessor/Intel OverDrive Proces- 
sor TSS, the FLAGS image is loaded, so that the 
segment registers are loaded from the TSS image 
as 8086 segment base values. The task is now 
ready to resume in virtual 8086 execution mode. 


4.6.6.2 Transitions 
Through 
Trap and Interrupt 
Gates, and IRET 


A task switch is one way to enter or exit virtual 8086 
mode. The other method is to exit through a Trap or 
Interrupt gate, as part of handling an interrupt, and 
to enter as part of executing an IRET instruction. 
The transition out must use an Intel486 SX micro- 
processor/Intel 
OverDrive 
Processor Trap 
Gate 
(Type 
14), or 
Intel486 SX microprocessor/Intel 
OverDrive Processor Interrupt Gate (Type 15), which 
must point to a non-conforming level a segment 
(DPL=0) in order to permit the trap handler to IRET 
back to the Virtual 8086 program. The Gate must 
point to a non-conforming level a segment to per- 
form a level switch to level a so that the matching 
IRET can change the VM bit. Intel486 SX microproc- 
essor/Intel 
OverDrive Processor gates must be 
used, since 80286 gates save only the low 16 bits of 
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Figure 4.25. Virtual 8086 Environment 
Interrupt 
and Call Handling 


the FLAGS register, so that the VM bit will not be 
(3) Pushthe 8086 segmentregistervalues onto the 
saved on transitionsthrough the 80286 gates.Also, 
new stack, in the order: GS, FS, OS, ES. These 
the 16-bit IRET (presumably)used to terminate the 
are pushed as 32-bit quantities,with undefined 
80286 interrupt handler will pop only the lower 16 
values in the upper 16 bits. Then load these 4 
bits from FLAGS,and will not affect the VM bit. The 
registerswith null selectors (0). 
actiontakenfor an Intel486SXmicroprocessor/lntel 
(4) Push the old 8086 stack pointer onto the new 
OverDriveProcessorTrap or Interruptgate if an in- 
stack by pushingthe SS register(as 32-bits,high 
terrupt occurs while the task is executing in virtual 
bits undefined),then pushingthe 32-bit ESPreg- 
8086 mode is given by the following sequence. 
ister saved above. 
(1) Savethe FLAGSregisterin a temp to push later. 
(5) Pushthe 32-bit FLAGSregister saved in step 1. 
Turn off the VM andTF bits,and if the interruptis 
(6) Push the old 8086 instruction pointer onto the 
serviced by an InterruptGate, turn off IF also. 
new stack by pushingthe CSregister(as32-bits, 


(2) Interrupt and Trap gates must perform a level 
high bits undefined),then pushingthe 32-bit EIP 
switch from 3 (where the VM86 program exe- 
register. 


cutes) to level 0 (so IRET can return).This pro- 
(7) Load up the new CS:EIPvalue from the interrupt 
cess involvesa stack switchto the stack givenin 
gate, and beginexecutionof the interruptroutine 
the TSS for privilege level O. Save the Virtual 
in protected Intel486 SX microprocessor/Intel 
8086 Mode SS and ESP registers to push in a 
OverDriveProcessormode. 
later step. The segment register load of SS will 
be done as a Protected Mode segment load, 
since the VM bit was turned off above. 


• 


The transition out of virtual 8086 mode performs a 
level change and stack switch, in addition to chang- 
ing back to protected mode. In addition, all of the 
8086 segment register images are stored on the 
stack (behind the SS:ESP image), and then loaded 
with null (0) selectors before entering the interrupt 
handler. This will permit the handler to safely save 
and restore the OS, ES, FS, and GS registers as 
80286 selectors. This is needed so that interrupt 
handlers which don't care about the mode of the 
interrupted program can use the same prolog and 
epilog code for state saving (Le.,push all registers in 
prolog, pop all in epilog) regardless of whether or not 
a "native" mode or Virtual 8086 mode program was 
interrupted. Restoring null selectors to these regis- 
ters before executing the IRET will not cause a trap 
in the interrupt handler. Interrupt routines which ex- 
pect values in the segment registers, or return val- 
ues in segment registers will have to obtain/return 
values from the 8086 register images pushed onto 
the new stack. They will need to know the mode of 
the interrupted program in order to know where to 
find/return segment registers, and also to know how 
to interpret segment register values. 


The IRET instruction will perform the inverse of the 
above sequence. Only the extended Intel486 SX mi- 
croprocessor/Intel 
OverDrive Processor IRET in- 
struction (operand size=32) can be used, and must 
be executed at level 0 to change the VM bit to 1. 
(1) If the NT bit in the FLAGs register is on, an inter- 
task return is performed. The current state is 
stored in the current TSS, and the link field in the 
current TSS is used to locate the TSS for the 
interrupted task which is to be resumed. 
Otherwise, continue with the following sequence. 


(2) Read the FLAGS image from SS:8[ESP) into the 
FLAGS register. This will set VM to the value ac- 
tive in the interrupted routine. 
(3) Pop off the instruction pointer CS:EIP. EIP is 
popped first, then a 32-bit word is popped which 
contains the CS value in the lower 16 bits. If 
VM =0, this CS load is done as a protected 
mode segment load. If VM= 1, this will be done 
as an 8086 segment load. 
(4) Increment the ESP register by 4 to bypass the 
FLAGS image which was "popped" in step 1. 


(5) If VM= 1, load segment registers ES, OS, FS, 


and GS from memory locations SS:[ESP+8], 
SS:[ESP+ 12), 
SS:[ESP+ 16], 
and 
SS:[ESP+20), 
respectively, where the new val- 
ue of ESP stored in step 4 is used. Since VM = 1, 
these are done as 8086 segment register loads. 
Else if VM =0, check that the selectors in ES, 
OS, FS, and GS are valid in the interrupted rou- 
tine. Null out invalid selectors to trap if an at- 
tempt is made to access through them. 


(6) If (RPL(CS) > 
CPL), pop the stack pointer 
SS:ESP from the stack. The ESP register is 
popped first, followed by 32-bits containing SS in 
the lower 16 bits. If VM =0, SS is loaded as a 
protected mode segment register load. If VM= 1, 
an 8086 segment register load is used. 
(7) Resume execution of the interrupted routine. The 
VM bit in the FLAGS register (restored from the 
interrupt routine's stack image in step 1) deter- 
mines whether the Intel486 SX microprocessor/ 
Intel OverDrive Processor resumes the interrupt- 
ed routine in Protected mode of Virtual 8086 
mode. 
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To meet its performance goals the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor contain an 
eight Kbyte cache. The cache is software transpar- 
ent to maintain binary compatibility with previous 
generations of the Inte1386™/lnte1486TMfamily ar- 
chitecture. 


The on-chip cache has been designed for maximum 
flexibility and performance. The cache has several 
operating modes offering flexibility during program 
execution and debugging. Memory areas can be de- 
fined as non-cacheable by software and external 
hardware. Protocols for cache line invalidations and 
replacement are implemented in hardware, easing 
system design. 


The on-chip cache is a unified code and data cache. 
The cache is used for both instruction and data ac- 
cesses and acts on physical addresses. 


The cache organization is 4-way set associative and 
each line is 16 bytes wide. The eight Kbytes of 
cache memory are logically organized as 128 sets, 
each containing four lines. 


The cache memory is physically split into four 
2-Kbyte blocks each containing 128 lines (see Fig- 
ure 5.1). Associated with each 2-Kbyte block are 
128 21-bit tags. There is a valid bit for each line in 
the cache. Each line in the cache is either valid or 
not valid. There are no provisions for partially valid 
lines. 
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The write strategy of on-chip cache is write-through. 
All writes will drive an external write bus cycle in 
addition to writing the information to the internal 
cache if the write was a cache hit. A write to an 
address not contained in the internal cache will only 
be written to external memory. Cache allocations 
are not made on write misses. 


Control of the cache is provided by the CD and NW 
bits in CRO.CD enables and disables the cache. NW 
controls memory write-through and invalidates. 


The CD and NW bits define four operating modes of 
the on-chip cache as given in Table 5.1. These 
modes provide flexibility in how the on-chip cache is 
used. 


The CD and NW bits define four operating modes of 
the on-chip code and data cache, as given in the 
following table: 


Table 5.1. Cache Operating 
Modes 


CD 
NW 
Operating 
Mode 


1 
1 
Cache fills disabled, write-through and 
invalidates disabled 
1 
0 
Cache fills disabled, write-through and 
invalidates enabled 
0 
1 
INVALID. IF CROis loaded with this 
configuration of bits, a GP fault with 
error code of 0 is raised. 


0 
0 
Cache fills enabled, write-through and 
invalidates enabled 


CD=1, 
NW=1 
The cache is completely disabled by setting 
CD= 1 and NW= 1 and then flushing the 
cache. This mode may be useful for debug- 
ging programs where it is important to see 
all memory cycles at the pins. Writes which 
hit in the cache will not appear on the exter- 
nal bus. 
It is possible to use the on-chip cache as 
fast static RAM by "pre-loading" 
certain 
memory areas into the cache and then set- 
ting CD= 1 and NW= 1. Pre-loading can be 
done by careful choice of memory refer- 
ences with the cache turned on or by use of 
the testability functions (see Section 8.2). 
When the cache is turned off the memory 
mapped by the cache is "frozen" into the 
cache since fills and invalidates are dis- 
abled. 


CD=1, 
NW=O 
Cache fills are disabled but write-throughs 
and invalidates are enabled. This mode is 
the same as if the KEN/I pin was strapped 
HIGH disabling cache fills. Write-throughs 
and invalidates may still occur to keep the 
cache valid. This mode is useful if the soft- 
ware must disable the cache for a short pe- 
riod of time, and then re-enable it without 
flushing the original contents. 


CD=O, NW=1 


INVALID. If CROis loaded with this bit con- 
figuration, a General Protection fault with 
error code of 0 is raised. Note that this 
mode would imply a non-transparent write- 
back cache. A future processor may define 
this combination of bits to implement a 
write-back cache. 


CD=O, NW=O 
This is the normal operating mode. 


Completely disabling the cache is a two step pro- 
cess. First CD and NW must be set to 1 and then the 
cache must be flushed. If the cache is not flushed, 
cache hits on reads will still occur and data will be 
read from the cache. 


Any area of memory can be cached in the Intel486 
SX microprocessor/Intel OverDrive Processor. Non- 
cacheable portions of memory can be defined by the 
external system or by software. The external system 
can inform the Intel486 SX microprocessor/Intel 
OverDrive Processor that a memory address is non- 
cacheable by returning the KEN/I pin inactive during 
a memory access (refer to Section 7.2.3). Software 
can prevent certain pages from being cached by set- 
ting the PCD bit in the page table entry. 


A read request can be generated from program op- 
eration or by an instruction pre-fetch. The data will 
be supplied from the on-chip cache if a cache hit 
occurs on the read address. If the address is not in 
the cache, a read request for the data is generated. 
on the external bus. 


If the read request is to a cacheable portion of mem- 
ory, the Intel486 SX microprocessor/Intel OverDrive 
Processor initiates a cache line fill. During a line fill a 
16-byte line is read into the Intel486 SX microproc- 
essor/Intel OverDrive Processor. 


Cache fills will only be generated for read misses. 
Write misses will never cause a line in the internal 
cache to be allocated. If a cache hit occurs on a 
write, the line will be updated. 
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Cache 
line fills can be performed 
over 8- and 16-bit 


busses 
using the dynamic 
bus sizing feature. 
Refer 


to Section 
7.1.3 
for 
a description 
of dynamic 
bus 


sizing. 


Refer 
to 
Section 
7.2.3 
for 
further 
information 
on 


cacheable 
cycles. 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 


Processor 
contain 
both 
a hardware 
and 
software 


mechanism 
for invalidating 
lines in its internal 
cache. 


Cache 
line invalidations 
are needed 
to keep the In- 


tel486 
SX microprocessor/Intel 
OverDrive 
Proces- 


sor cache 
contents 
consistent 
with external 
memo- 


ry. 


Refer 
to 
Section 
7.2.8 
for 
further 
information 
on 


cache 
line invalidations. 


When a line needs to be placed in its internal 
cache 


the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 


Processor 
first checks 
to see if there 
is a non-valid 


line in the set that can be replaced. 
If all four lines in 


the 
set 
are 
valid, 
a 
pseudo 
least-recently-used 


mechanism 
is used to determine 
which 
line should 


be replaced. 


A valid bit is associated 
with each line in the cache. 
When 
a line needs 
to be placed 
in a set, the four 


valid bits are checked 
to see if there 
is a non-valid 


line that can be replaced. 
If a non-valid 
line is found, 


that line is marked 
for replacement. 


The four lines in the set are labeled 
10, 11, 12,and 13. 


The order in which the valid bits are checked 
during 


an invalidation 
is 10, 11, 12 and 13. All valid 
bits are 


cleared 
when 
the 
processor 
is reset 
or when 
the 


cache 
is flushed. 


Replacement 
in the cache 
is handled 
by a pseudo 


least recently 
used (LRU) mechanism 
when all four 


lines in a set are valid. Three 
bits, BO, B1 and B2, 


are defined 
for each 
of the 
128 sets in the cache. 


These bits are called the LRU bits. The LRU bits are 
updated 
for every hit or replace 
in the cache. 


If the most recent 
access 
to the set was to 10 or 11, 


BO is set to 1. BO' is set to 0 if the most recent 
ac- 


cess was to 12 or 13. If the most 
recent 
access 
to 


10:11was to 10, B1 is set to 1, else B1 is set to O. If 
the most recent access to 12:13was to 12,B2 is set to 
1, else B2 is set to O. 


The pseudo 
LRU mechanism 
works 
in the following 


manner. 
When 
a line must be replaced, 
the cache 


will first select 
which 
of 10:11and 12:13was least re- 


cently used. Then the cache will determine 
which of 


the two lines was least recently 
used and mark it for 


replacement. 
This decision 
tree is shown 
in Figure 


5.2. 
When 
the 
Intel486 
SX 
microprocessor/Intel 


OverDrive 
Processor 
is reset or when 
the cache 
is 


flushed 
all 128 sets of three 
LRU bits are set to O. 


All four line. In the .et valid?~ 
Replacenon-valid line 


Yo. ! 


80 =o? 


Y•• : 10or 11lout recently u.ed 
No: 12or 13lout recently u.ed 


81=o? 


~ 
Replace 
Replace 


10 
11 


82 =o? 


~ 
Replace 
Replace 
12 
13 
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Two bits for cache control, PWT and PCD, are de- 
fined in the page table and page directory entries. 
The state of these bits are driven out on the PWT 
and PCD pins during memory access cycles. 


The PWT bit controls write policy for second level 
caches used with the Intel486 SX microprocessor/ 
Intel OverDrive Processor. Setting PWT= 1 defines 
a write-through policy for the current page while 
PWT= 0 allows the possibility of write-back. The 
state of PWT is ignored internally by the Intel486 SX 
microprocessor/Intel OverDrive Processor since the 
on-chip cache is write through. 


The PCD bit controls cacheability on a page by page 
basis. The PCD bit is internally ANDed with the 
KEN/I signal to control cacheability on a cycle by 
cycle basis (see Figure 5.3). PCD= 0 enables cach- 
ing while PCD= 1 forbids it. Note that cache fills are 


enabled when PCD=O AND KEN/I =0. This logical 
AND is implemented physically with a NOR gate. 


The state of the PCD bit in the page table entry is 
driven on the PCDpin when a page in external mem- 
ory is accessed. The state of the PCD pin informs 
the external system of the cacheability of the re- 
quested information. The external system then re- 
turns KEN/I telling the Inte1486SX microprocessor/ 
Intel OverDrive Processor if the area is cacheable. 
The Intel486 SX microprocessor/Intel 
OverDrive 
Processor initiates a cache line fill if PCD and KEN/I 
indicate that the requested information is cacheable. 


The PCD bit is masked with the CD (cache disable) 
bit in control register 0 to determine the state of the 
PCD pin. If CD= 1 the Intel486 SX microprocessor/ 
Intel OverDrive Processor forces the PCD pin HIGH. 
If CD= 0 the PCD pin is driven with the value for the 
page table entry/directory. See Figure 5.3. 
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The PWT and PCD bits for a bus cycle are obtained 
from either CR3, the page directory or page table 
entry. These bits are assumed to be zero during real 
mode, whenever paging is disabled, or for cycles 
that bypass paging, (I/O references, interrupt ac- 
knowledge and Halt cycles). 


When paging is enabled, the bits from the page table 
entry are cached in the TLB, and are driven any time 
the page mapped by the TLB entry is referenced. 
For normal memory cycles, PWT and PCD are taken 
from the page table entry. During TLB refresh cycles 
where the page table and directory entries are read, 
the PWT and PCD bits must be obtained elsewhere. 
During page table updates the bits are obtained from 
the page directory. When the page directory is up- 
dated the bits are obtained from CR3. 


5.7 Cache Flushing 


The on-chip cache can be flushed by external hard- 
ware or by software instructions. Flushing the cache 
clears all valid bits for all lines in the cache. The 
cache is flushed when external hardware asserts the 
FLUSH# pin. 


The flush pin needs to be asserted for one clock if 
driven synchronously or for two clocks if driven 
asynchronously. The flush input is asynchronous but 
setup and hold times must be met. The flush pin 
should be deasserted after the cache flush is com- 
plete. Failure to deassert the pin will cause execu- 
tion to stop as the processor will be repeatedly flush- 
ing the cache. If external hardware activates flush in 
response to an I/O write, flush must be asserted for 
at least two clocks prior to ready being returned for 
the I/O write. This ensures that the flush completes 
before the CPU begins execution of the instruction 
following the OUT instruction. 


Flush is recognized during HOLD just like EADS#. 


The instructions INVD and WBINVD cause the on- 
cache to be flushed. External caches connected to 
the 
Intel486 SX microprocessor/Intel 
OverDrive 


Processor are signaled to flush their contents when 
these instructions are executed. 


WBINVD will cause an external write-back cache to 
write back dirty lines before flushing its contents. 
The external cache is signalled using the bus cycle 
definition pins and the byte enables (refer to Section 
6.2.5 for the bus cycle definition pins and Section 
7.2.11 for special bus cycles). Refer to the Intel486 
microprocessor programmers reference manual for 
detailed instruction definitions. 


The results of the INVD and WBINVD instructions 
are identical for the operation of the Intel486 SX mi- 
croprocessorllntel 
OverDrive 
Processor 
on-chip 


cache since the cache is write-through. Note that 
the INVD and WBINVD instructions are machine de- 
pendent. Future members of the Intel486 SX micro- 
processorllntel 
OverDrive Processor family may 


change the definition of this instruction. 


5.8 Caching Translation 
Lookaside 
Buffer Entries 


The Intel486 SX microprocessorllntel 
OverDrive 


Processor contain an integrated paging unit with a 
translation lookaside buffer (TLB). The TLB contains 
32 entries. The TLB has been enhanced over the 
Intel386 microprocessor's TLB by upgrading the re- • 
placement strategy to a pseudo-LRU (least recently 
used) algorithm. The pseudo-LRU replacement algo- 
rithm is the same as that used in the on-chip cache. 


The paging TLB operation is automatic whenever 
paging is enabled. The TLB contains the most re- 
cently used page table entries. A page table entry 
translates the linear address pointing to a particular 
page to the physical address where the page is 
stored in memory (refer to Section 4.5, Paging). 


The paging unit will look up the linear address in the 
TLB in response to an internal bus request. The cor- 
responding physical address is passed on to the on- 
chip cache or the external bus (in the event of a 
cache miss) when the linear address is present in 
the TLB. 


The paging unit will access the page tables in exter- 
nal memory if the linear address is not in the TLB. 
The required page table entry will be read into the 
TLB and then the cache or bus cycle for the actual 
data will take place. The process of reading a new 
page table entry into the TLB is called a TLB refresh. 


A TLB refresh is a two step process. The paging unit 
must first read the page directory entry which points 
to the appropriate page table. The page table entry 
to be stored in the TLB is then read from the page 
table. Control register 3 (CR3) points to the base of 
the page directory table. 


The Intel486 SX microprocessorlintel 
OverDrive 


Processor will allow page directory and page table 
entries (returned during TLB refreshes) to be stored 
in the on-chip cache. Setting the PCD bits in CR3 
and the page directory entry to 1 will prevent the 
page directory and page table entries from being 
stored in the on-chip cache (see Section 5.6, Page 
Cacheablllty). 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor bus has been designed to be similar to 
the Intel386 microprocessor bus whenever possible. 
Several new features have been added to the In- 
tel486 SX microprocessor/Intel OverDrive Proces- 
sor bus resulting in increased performance and func- 
tionality. New features include a 1X clock, a burst 
bus mechanism for high-speed internal cache fills, a 
cache line invalidation mechanism, enhanced bus 
arbitration capabilities, a BS8# bus sizing mecha- 
nism and parity support. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor are driven by a 1X clock as opposed to a 
2X clock in the Intel386 microprocessor. A 1X clock 
allows simpler system design by cutting in half the 
clock speed required in the external system. 


Like the Intel386 microprocessor, the Intel486 SX 
microprocessor/Intel 
OverDrive 
Processor 
have 
separate parallel busses for data and addresses. 
The bidirectional data bus is 32 bits in width. The 
address bus consists of two components: 30 ad- 
dress lines (A2-A31) 
and 4 by1e enable lines 


(BEO#-BE3#). 
The address bus addresses ex- 


ternal memory in the same manner as the Intel386 
microprocessor: The address lines form the upper 
30 bits of the address and the by1eenables select 
individual by1eswithin a 4 by1elocation. The address 
lines are bidirectional for use in cache line invalida- 
tions. 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor burst bus mechanism enables high-speed 
cache fills from external memory. Burst cycles can 
strobe data into the Intel486 SX microprocessor/In- 
tel OverDrive Processor at a rate of one item every 
clock. Non-burst cycles have a maximum rate of one 
item every two clocks. Burst cycles are not limited to 
cache fills: all bus cycles requiring more than a sin- 
gle data cycle can be bursted. 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor have a bus hold feature similar to that of 
the Intel386 microprocessor. During bus hold, the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor relinquish control of the local bus by floating its 
address, data and control busses. 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor have an address hold feature in addition 
to bus hold. During address hold only the address 
bus is floated, the data and control busses can re- 
main active. Address hold is used for cache line in- 
validations. 
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Ahead is a brief description of the Intel486 SX micro- 
processor/lntel OverDrive Processor input arid out- 
put signals arranged by functional groups. Before 
beginning the signal descriptions a few terms need 
to be defined. The /I symbol at the end of a signal 
name indicates the active, or asserted, state occurs 
when the signal is at a low voltage. When a /I is not 
present after the signal name, the signal is active at 
the high voltage level. The term "ready" is used to 
indicate that the cycle is terminated with RDY/I or 
BRDY#. 


Section 6 and 7 will discuss bus cycles and data 
cycles. A bus cycle is at least two clocks long and 
begins with ADS# active in the first clock and ready 
active in the last clock. Data is transferred to or from 
the 
Intel486 SX microprocessor/lntel 
OverDrive 
Processor during a data cycle. A bus cycle contains 
one or more data cycles. 


6.2.1 CLOCK 
(CLK) 


ClK provides the fundamental timing and the inter- 
nal operating frequency for the Intel486 SX micro- 
processor/lntel 
OverDrive Processor. All external 
timing parameters are specified with respect to the 
rising edge of ClK. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor can operate over a wide frequency range 
but ClK's 
frequency cannot change rapidly while 
RESET is inactive. ClK's frequency must be stable 
for proper chip operation since a single edge of ClK 
is used internally to generate two phases. ClK only 
needs TTl 
levels for proper operation. Figure 6.2 
illustrates the ClK waveform. 


6.2.2 Address Bus (A31-A2, BEO#-BE3#) 


A31-A2 
and BEO#-BE3# 
form the address bus 
and provide physical memory and I/O port address- 
es. The Intel486 SX microprocessor/Intel OverDrive 
Processor are capable of addressing 4 gigabytes of 
physical 
memory 
space 
(OOOOOOOOHthrough 
FFFFFFFFH), and 64 Kbytes of I/O address space 
(OOOOOOOOH 
through OOOOFFFFH).A31-A2 
identify 
addresses to a 4-byte location. BEO# - BE3/I identi- 
fy which bytes within the 4-byte location are involved 
in the current transfer. 


Addresses are driven back into the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor over A31- 
A4 during cache line invalidations. The address lines 
are active HIGH. When used as inputs into the proc- 
essor, A31-A4 must meet the setup and hold times, 
t22 and t23' A31-A2 
are not driven during bus or 
address hold. 


The byte enable outputs, BEO#-BE3#, 
determine 
which bytes must be driven valid for read and write 
cycles to external memory. 


BE3# applies to D24-D31 
BE2# applies to D16-D23 
BE1# applies to D8-D15 
BEO# applies to DO-D7 


BEO#-BE3# 
can be decoded to generate AO, A1 


and BHE# signals used in 8- and 16-bit systems 
(see Table 7.5). BEO/l-BE3# 
are active lOW and 
are not driven during bus hold. 


The bidirectional lines, D31-DO, form the data bus 
for the Intel486 SX microprocessor/Intel OverDrive 
Processor DO-D7 define the least significant byte 
and 
D24-D31 
the 
most 
significant 
byte. 


tx = input setup times 
ty = input hold times, 
output 
float, valid and hold times 
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Data transfers to 8- or 16-bit devices is possible us- 
ing the data bus sizing feature controlled by the 
BS8# or BS16# input pins. 


031-00 
are active HIGH. For reads, 031-00 
must 
meet the setup and hold times, t22 and t23. 031-00 
are not driven during read cycles and bus hold. 


6.2.4 PARITY 


Data Parity Input/Outputs 
(DPO-DP3) 


DPO-DP3 are the data parity pins for the processor. 
There is one pin for each byte of the data bus. Even 
parity is generated or checked by the parity genera- 
tors/ checkers. Even parity means that there are an 
even number of HIGH inputs on the eight corre- 
sponding data bus pins and parity pin. 


Data parity is generated on all write data cycles with 
the same timing as the data driven by the Intel486 
SX microprocessor/Intel OverDrive Processor. Even 
parity information must be driven back to the In- 
tel486 SX microprocessorlintel 
OverDrive Proces- 
sor on these pins with the same timing as read infor- 
mation to insure that the correct parity check status 
is indicated by the Intel486 SX microprocessor/Intel 
OverDrive Processor. 


The values read on these pins do not affect program 
execution. It is the responsibility of the system to 
take appropriate actions if a parity error occurs. 


Input signals on DPO-DP3 must meet setup and 
hold times t22 and t23 for proper operation. 


Parity status is driven on the PCHK# pin, and a pari- 
ty error is indicated by this pin being LOW. PCHK# 
is driven the clock after ready for read operations to 
indicate the parity status for the data sampled at the 
end of the previous clock. Parity is checked during 
code reads, memory reads and I/O reads. Parity is 
not checked during interrupt acknowledge cycles. 
PCHK# only checks the parity status for enabled 
bytes as indicated by the byte enable and bus size 
signals. It is valid only in the clock immediately after 
read data is returned to the Intel486 SX microproc- 
essor/Intel OverDrive Processor. At all other times it 
is inactive (HIGH). PCHK# is never floated. 


Driving PCHK# is the only effect that bad input pari- 
ty has on the Intel486 SX microprocessor/Intel 
OverDrive Processor. The Intel486 SX microproces- 
sor/Intel OverDrive Processor will not vector to a 
bus error interrupt when bad data parity is returned. 
In systems that will not employ parity, PCHK# can 
be ignored. In systems not using parity, DPO-DP3 
should be connected to Vcc through a pullup resis- 
tor. 


6.2.5 BUS CYCLE DEFINITION 


MIIO#, 
D/C#, 
W/R# 
Outputs 


M/IO#, 
D/C# and W/R# 
are the primary bus cycle 
definition signals. They are driven valid as the ADS# 
signal is asserted. M/IO# 
distinguishes between 


memory and I/O cycles, D/C# 
distinguishes be- 
tween data and control cycles and W/R# 
distin- 
guishes between write and read cycles. 


Bus cycle definitions as a function of M/IO#, 
D/C# 
and W/R# 
are given in Table 6.1. Note there is a 
difference between the Intel486 SX microprocessor/ 
Intel OverDrive Processor and Intel386 microproces- 
sor bus cycle definitions. The halt bus cycle type has 
been moved to location 001 in the Intel486 SX mi- 
croprocessorllntel 
OverDrive Processor from loca- 
tion 101 in the Intel386 microprocessor. Location 
101 is now reserved and will never be generated by 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor. 


Table 6.1. AD5# Initiated Bus Cycle Definitions 


M/IO# 
D/C# 
W/R# 
Bus Cycle Initiated 


0 
0 
0 
InterruptAcknowledge 
0 
0 
1 
Halt/SpecialCycle 
0 
1 
0 
I/O Read 
0 
1 
1 
I/O Write 
1 
0 
0 
CodeRead 
1 
0 
1 
Reserved 
1 
1 
0 
MemoryRead 
1 
1 
1 
MemoryWrite 


LOCK# indicates that the Intel486 SX microproces- 
sor/Intel 
OverDrive Processor is running a read- 
modify-write cycle where the ex1ernalbus must not 
be relinquished between the read and write cycles. 
Read-modify-write cycles are used to implement 
memory-based semaphores. Multiple reads or writes 
can be locked. 


When LOCK# is asserted, the current bus cycle is 
locked and the Intel486 SX microprocessor/Intel 
OverDrive Processor should be allowed exclusive 
access to the system bus. LOCK# goes active in 
the first clock of the first locked bus cycle and goes 
inactive after ready is returned indicating the last 
locked bus cycle. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor will not acknowledge bus hold when 
LOCK# is asserted (though it will allow an address 
hold). LOCK# is active LOW and is floated during 
bus hold. Locked read cycles will not be transformed 
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into cache fill cycles if KEN# is returned active. Re- 
fer to Section 7.2.6 for a detailed discussion of 
locked bus cycles. 


The pseudo-lock feature allows atomic reads and 
writes of memory operands greater than 32 bits. 
These operands require more than one cycle to 
transfer. The Intel486 SX microprocessor asserts 
PLOCK# during segment table descriptor reads (64 
bits) and cache line fills (128 bits). 


The Intel OverDrive Processor asserts PLOCK# 
during floating point long reads and writes (64 
bits), segment table descriptor reads (64 bits) 
and cache line fills (128 bits). 


When PLOCK# is asserted no other master will be 
given control of the bus between cycles. A bus hold 
request (HOLD) is not acknowledged during pseudo- 
locked reads and writes, with one exception. During 
non-cacheable non-bursted code prefetches, HOLD 
is recognized on memory cycle boundaries even 
though PlOCK# 
is asserted. The Intel486 SX mi- 
croprocessor/Intel 
OverDrive Processor will drive 
PLOCK# active until the addresses for the last bus 
cycle of the transaction have been driven regardless 
of whether BRDY# or ROY# are returned. 


A pseudo-locked transfer is meaningful only if the 
memory operand is aligned and if its completely con- 
tained within a single cache line. 


A 
64·bit 
floating 
point 
number must 
be 
aligned to an 8-byte boundary to guarantee 
an atomic access. 


Nonnally PLOCK" and BLAST# are inverse 
of each other. However during the first cycle 
of a 64·bit floating point write, both PLOCK# 
and BLAST# will be asserted. 


Since PLOCK# is a function of the bus size and 
KEN# inputs, PLOCK# should be sampled only in 
the clock ready is returned. This pin is active lOW 
and is not driven during bus hold. Refer to Section 
7.2.7 for a detailed discussion of pseudo-locked bus 
cycles. 


The bus control signals allow the Intel486 SX micro- 
processor/Intel 
OverDrive Processor to 
indicate 
when a bus cycle has begun, and allow other system 
hardware to control burst cycles, data bus width and 
bus cycle termination. 


Address Status Output (ADS#) 


The ADS# output indicates that the address and 
bus cycle definition signals are valid. This signal will 
go active in the first clock of a bus cycle and go 
inactive in the second and subsequent clocks of the 
cycle. ADS# is also inactive when the bus is idle. 


ADS# is used by external bus circuitry as the indica- 
tion that the Intel486 SX microprocessor/Intel Over- 
Drive Processor has started a bus cycle. The exter- 
nal circuit must sample the bus cycle definition pins 
on the next rising edge of the clock after ADS# is 
driven active. 


ADS# is active lOW and is not driven during bus 
hold. 


Non-burst Ready Input (RDY#) 


ROY# indicates that the current bus cycle is com- 
plete. In response to a read, ROY# indicates that 
the external system has presented valid data on the 
data pins. In response to a write request, ROY# indi- 
cates that the external system has accepted the In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 
sor data. ROY# is ignored when the bus is idle and 
at the end of the first clock of the bus cycle. Since 
ROY# is sampled during address hold, data can be 
returned to the processor when AHOlD is active. 


ROY# is active lOW, and is not provided with an 
internal pullup resistor. This input must satisfy setup 
and hold times t16 and t17 for proper chip operation. 


6.2.7 BURST CONTROL 


Burst Ready Input (BRDY#) 


BRDY# performs the same function during a burst 
cycle that ROY# performs during a non-burst cycle. 
BRDY# indicates that the external system has pre- 
sented valid data on the data pins in response to a 
read or that the external system has accepted the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor data in response to a write. BRDY# is ignored 
when the bus is idle and at the end of the first clock 
in a bus cycle. 


During a burst cycle, BRDY# will be sampled each 
clock, and if active, the data presented on the data 
bus pins will be strobed into the Intel486 SX micro- 
processor/Intel OverDrive Processor. ADS# is neg- 
ated during the second through last data cycles in 
the burst, but address lines A2-A3 and byte enables 
will change to reflect the next data item expected by 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor. 


If ROY# is returned simultaneously with BRDY#, 
BRDY# is ignored and the burst cycle is premature- 
ly aborted. An additional complete bus cycle will be 
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initiated after an aborted burst cycle if the cache line 
fill was not complete. BRDY# is treated as a normal 
ready for the last data cycle in a burst transfer or for 
non-burstable cycles. Refer to Section 7.2.2 for 
burst cycle timing. 


BRDY# is active LOW and is provided with a small 
internal pullup resistor. BRDY# must satisfy the set- 
up and hold times t16 and tH. 


BLAST# indicates that the next time BRDY# is re- 
turned it will be treated as a normal RDY#, terminat- 
ing the line fill or other multiple-data-cycle transfer. 
BLAST# is active for all bus cycles regardless of 
whether they are cacheable or not. This pin is active 
LOW and is not driven during bus hold. 


6.2.8 INTERRUPT SIGNALS (RESET, INTR, NMI) 


The interrupt signals can interrupt or suspend exe- 
cution of the processor's current instruction stream. 


Reset Input (RESET) 


RESET forces the Intel486 SX microprocessor/Intel 
OverDrive Processor to begin execution at a known 
state. For a power-up (cold start) reset, Vcc and 
CLK must reach their proper DC and AC specifica- 
tions for at least 1 ms before the Intel486 SX micro- 
processor/Intel OverDrive Processor begins instruc- 
tion execution. The RESET pin should remain active 
during this time to ensure proper Intel486 SX micro- 
processor/Intel 
OverDrive 
Processor 
operation. 
However, for a warm boot-up case, RESET is re- 
quired to remain active for a minimum of 15 clocks. 
The testability operating modes are programmed by 
the 
falling 
(inactive 
going) 
edge 
of 
RESET. (Refer to Section 8.0 for a description of the 
test modes during reset.) 


Maskable Interrupt 
Request Input (INTR) 


INTR indicates that an external interrupt has t;>een 
generated. Interrupt processing is initiated if the IF 
flag is active in the EFLAGS register. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor will generate two locked interrupt ac- 
knowledge bus cycles in response to asserting the 
INTR pin. An 8-bit interrupt number will be latched 
from an external interrupt controller at the end of the 
second interrupt acknowledge cycle. INTR must re- 
main active until the interrupt acknowledges have 
been performed to assure program interruption. Re- 
fer to Section 7.2.10 for a detailed discussion of in- 
terrupt acknowledge cycles. 


The INTR pin is active HIGH and is not provided with 
an internal pulldown resistor. INTR is asynchronous, 
but the INTR setup and hold times, t20and t21, must 
be met to assure recognition on any specific clock. 


Non-maskable Interrupt 
Request Input (NMI) 


NMI is the non-maskable interrupt request signal. 
Asserting NMI causes an interrupt with an internally 
supplied vector value of 2. External interrupt ac- 
knowledge cycles are not generated since the NMI 
interrupt vector is internally generated. When NMI 
processing begins, the NMI signal will be masked 
internally until the IRET instruction is executed. 


NMI is rising edge sensitive after internal synchroni- 
zation. NMI must be held LOW for at least four CLK 
periods before this rising edge for proper operation. 
NMI is not provided with an internal pulldown resis- 
tor. NMI is asynchronous but setup and hold times, 
t20and t21 must be met to assure recognition on any 
specific clock. 


6.2.9 BUS ARBITRATION SIGNALS 


This section describes the mechanism by which the 
processor relinquishes control of its local bus when 
requested by another bus master. 


Bus Request Output (BREQ) 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor asserts BREa whenever a bus cycle is 
pending internally. Thus, BREa is always asserted in 
the first clock of a bus cycle, along with ADS#. Fur- 
thermore, if the Intel486 SX microprocessor/lntel 
OverDrive Processor is currently not driving the bus 
(due to HOLD, AHOLD, or BOFF#), BREa is assert- 
ed in the same clock that ADS# would have been 
asserted if the Intel486 SX microprocessor/Intel 
OverDrive Processor were driving the bus. After the 
first clock of the bus cycle, BREa may change state. 
It will be asserted if additional cycles are necessary 
to complete a transfer (via BS8#, BS16#, KEN#), 
or if more cycles are pending internally. However, if 
no additional cycles are necessary to complete the 
current transfer, BREa can be negated before ready 
comes back for the current cycle. External logic can 
use the BREa signal to arbitrate among multiple 
processors. This pin is driven regardless of the state 
of bus hold or address hold. BREa is active HIGH 
and is never floated. During a hold state, internal 
events may cause BREa to be deasserted prior to 
any bus cycles. 


Bus Hold Request Input (HOLD) 


HOLD allows another bus master complete control 
of the Intel486 SX microprocessor/Intel OverDrive 
Processor bus. The Intel486 SX microprocessor/In- 
tel OverDrive Processor will respond to an active 
HOLD signal by asserting HLDA and placing most of 
its output and input/output pins in a high impedance 
state (floated) after completing its current bus cycle, 
burst cycle, or sequence of locked cycles. In addi- 
tion, if the Intel486 SX microprocessor/Intel Over- 
Drive Processor receives a HOLD request while 
performing a code fetch, and that cycle is backed 
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off (BOFF#), the Intel486 SX microprocessor/Intel 
OverDrive Processor will recognize HOLD before re- 
starting the cycle. The BREQ, HLDA, PCHK# and 
FERR# pins are not floated during bus hold. The 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor will maintain its bus in this state until the HOLD is 
deasserted. Refer to Section 7.2.9 for timing dia- 
grams for bus hold cycles and HOLD request ac- 
knowledge during BOFF#. 


Unlike the Intel386 microprocessor, the Intel486 SX 
microprocessor/Intel OverDrive Processor will rec- 
ognize HOLD during reset. Pullup resistors are not 
provided for the outputs that are floated in response 
to HOLD. HOLD is active HIGH and is not provided 
with an internal pulldown resistor. HOLD must satisfy 
setup and hold times t18 and t19 for proper chip op- 
eration. 


Bus Hold Acknowledge 
Output (HLDA) 


HLDA indicates that the Intel486 SX microproces- 
sor/Intel OverDrive Processor has given the bus to 
another local bus master. HLDA goes active in re- 
sponse to a hold request presented on the HOLD 
pin. HLDA is driven active in the same clock that the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor floats its bus. 


HLDA will be driven inactive when leaving bus hold 
and the Intel486 SX microprocessor/Intel OverDrive 
Processor will resume driving the bus. The Intel486 
SX microprocessor/Intel 
OverDrive Processor will 
not cease internal activity during bus hold since the 
internal cache will satisfy the majority of bus re- 
quests. HLDA is active HIGH and remains driven 
during bus hold. 


Backoff Input (BOFF#) 


Asserting the BOFF# input forces the Intel486 SX 
microprocessor/Intel 
OverDrive Processor to re- 
lease control of its bus in the next clock. The pins 
floated are exactly the same as in response to 
HOLD. The response to BOFF# differs from the re- 
sponse to HOLD in two ways: First, the bus is float- 
ed immediately in response to BOFF# while the In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 
sor completes the current bus cycle before floating 
its bus in response to HOLD. Second the Intel486 
SX microprocessor/Intel OverDrive Processor does 
not assert HLDA in response to BOFF#. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor remains in bus hold until BOFF# is nega- 
ted. Upon negation, the Intel486 SX microproces- 
sor/Intel OverDrive Processor restarts the bus cycle 
aborted when BOFF# was asserted. To the internal 
execution engine the effect of BOFF# is the same 
as inserting a few wait states to the original cycle. 
Refer to Section 7.2.12 for a description of bus cycle 
restart. 


Any data returned to the Intel486 SX microproces- 
sor/Intel OverDrive Processor while BOFF# is as- 
serted is ignored. BOFF# has higher priority than 
ROY# or BRDY#. If both BOFF# and ready are 
returned in the same clock, BOFF# takes effect. If 
BOFF# is asserted while the bus is idle, the Intel486 
SX microprocessor/Intel 
OverDrive Processor will 
float its bus in the next clock. BOFF# is active LOW 
and must meet setup and hold times t18 and t19 for 
proper chip operation. 


6.2.10 CACHE INVALIDATION 


The AHOLD and EADS# 
inputs are used during 
cache invalidation cycles. AHOLD conditions the In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 
sor 
address 
lines, 
A4-A31, 
to 
accept 
an 
address input. EADS# indicates that an external • 
address is actually valid on the address inputs. Acti- 
vating EADS# will cause the Intel486 SX microproc- 
essor/Intel OverDrive Processor to read the external 
address bus and perform an internal cache invalida- 
tion 
cycle 
to 
the 
address 
indicated. 
Refer 
to 
Section 7.2.8 for cache invalidation cycle timing. 


Address Hold Request Input (AHOLD) 


AHOLD is the address hold request. It allows anoth- 
er bus master access to the Intel486 SX microproc- 
essor/Intel 
OverDrive Processor address bus for 
performing an internal cache invalidation cycle. As- 
serting AHOLD will force the Intel486 SX microproc- 
essor/Intel OverDrive Processor to stop driving its 
address bus in the next clock. While AHOLD is ac- 
tive only the address bus will be floated, the remain- 
der of the bus can remain active. For example, data 
can be returned for a previously specified bus cycle 
when AHOLD is active. The Intel486 SX microproc- 
essor/Intel OverDrive Processor will not initiate an- 
other bus cycle during address hold. Since the In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 


sor floats its bus immediately in response to AHOLD, 
an address hold acknowledge is not required. If 
AHOLD is asserted while a bus cycle is in progress, 
and no readies are returned during the time AHOLD 
is asserted, the Intel486 SX microprocessor/Intel 
OverDrive Processor will redrive the same address 
(that it originally sent out) once AHOLD is negated. 


AHOLD is recognized during reset. Since the entire 
cache is invalidated by reset, any invalidation cycles 
run during reset will be unnecessary. AHOLD is ac- 
tive HIGH and is provided with a small internal pull- 
down resistor. It must satisfy the setup and hold 
times t18 and t19 for proper chip operation. This pin 
determines whether or not the built in self test fea- 
tures of the Intel486 SX microprocessor/Intel Over- 
Drive Processor will be exercised on assertion of 
RESET. 


External Address Valid Input (EADS#) 


EADS# indicates that a valid external address has 
been driven onto the Intel486 SX microprocessor/ 
Intel OverDrive Processor address pins. This ad- 
dress will be used to perform an internal cache inval- 
idation cycle. The 9(cternaladdress will be checked 
with the current cache contents. If the address spec- 
ified matches any areas in the cache, that area will 
immediately be invalidated. 


An invalidation cycle 
may be run by asserting 
EADS# regardless of the state of AHOLD, HOLD 
and BOFF#. EADS# is active LOW and is provided 
with an internal pullup resistor. EADS# must satisfy 
the setup and hold times t12 and t13 for proper chip 
operation. 


6.2.11 CACHE CONTROL 
Cache Enable Input (KEN#) 


KEN# is the cache enable pin. KEN# is used to 
determine whether the data being returned by the 
current cycle is cacheable. When KEN# is active 
and the Intel486 SX microprocessor/Intel OverDrive 
Processor generates a cycle that can be cached 
(most any memory read cycle), the cycle will be 
transformed into a cache line fill cycle. 


A cache line is 16 bytes long. Duringthe first cycle of 
a cache line fill the byte-enable pins should be ig- 
nored and data should be returned as if all four byte 
enables were asserted. The Intel486 SX microproc- 
essor/Intel OverDrive Processor will run between 4 
and 16 contiguous bus cycles to fill the line depend- 
ing on the bus data width selected by BS8# and 
BS16#. Refer to Section 7.2.3 for a description of 
cache line fill cycles. 


The KEN# input is active LOW and is provided with 
a small internal pullup resistor. It must satisfy the 
setup and hold times t14 and t15 for proper chip op- 
eration. 


Cache Flush Input (FLUSH#) 


The FLUSH# input forces the Intel486 SX micro- 
processor/Intel OverDrive Processor to flush its en- 
tire internal cache. FLUSH# 
is active LOW and 
need only be asserted for one clock. FLUSH# is 
asynchronous but setup and hold times t20 and t21 
must be met for recognition on any specific clock. 


FLUSH# also determines whether or not the tristate 
test mode of the Intel486 SX microprocessor/Intel 
OverDrive Processor will be invoked on assertion of 
RESET. To initiate the tri-state test mode, FLUSH# 
should be active for 2 clks before and 2 clks after 
the reset is deasserted. However, if FLUSH# is as- 
serted synchronously keeping it active for 2 CLKs 
prior to the falling edge of RESET will initiate the tri- 
state mode. 
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6.2.12 PAGE CACHEABILITY (PWT, PCD) 
The PWT and PCD output signals correspond to two 
user attribute bits in the page table entry. When pag- 
ing is enabled, PWT and PCD correspond to bits 3 
and 4 of the page table entry respectively. For cy- 
cles that are not paged when paging is enabled (for 
example I/O cycles) PWT and PCD correspond to 
bits 3 and 4 in control register 3. When paging is 
disabled, 
the 
Intel486 
SX 
microprocessor/Intel 
OverDrive Processor ignores the PCD and PWT bits 
and assumes they are zero for the purpose of cach- 
ing and driving PCD and PWT. 


PCD is masked by the CD (cache disable) bit in con- 
trol register 0 (CRO).When CD= 1 (cache line fills 
disabled) 
the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
forces 
PCD HIGH. When 
CD=0, PCD is driven with the value of the page 
table entry/directory. 


The purpose of PCD is to provide a cacheable/non- 
cacheable indication on a page by page basis. The 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor will not perform a cache fill to any page in which 
bit 4 of the page table entry is set. PWT corresponds 
to the write-back bit and·can be used by an external 
cache to provide this functionality. PCD and PWT 
bits are assigned to be zero during real mode or 
whenever paging is disabled. Refer to Sections 4.5.4 
and 5.6 for a discussion of non-cacheable pages. 


PCD and PWT have the same timing as the cycle 
definition pins (M/IO#, 
D/C#, 
W/R#). 
PCD and 
PWT are active HIGH and are not driven during bus 
hold. 


6.2.13.1 Intel OverDrive Processor IIn\81487 
SX Math COProcessor Signals 


Upgrade Present (UP#) 


The UP# pin is present on the Intel OverDrive 
Processor/lntel487 SX Math CoProcessor only. 
The Intel OverDrive Processor/lntel487 SX Math 
CoProcessor uses UP# to signal to the Intel486 
SX microprocessor that an Intel OverDrive Proc- 
essorllnte1487 SX Math CoProcessor is present 
in the system. An UP# driven interface circuit 
floats 
the 
Intel486 
SX 
microprocessor/Intel 
OverDrive Processor/lntel487 SX Math CoProc- 
essor then takes charge of the bus. The Intel 
OverDrive Processor/lntel487 SX Math CoProc- 
essor always drive UP# ae:tlve.UP# will never 
be floated if an Intel OverDrive Processor/In- 
tel487 SX Math CoProcessor is present In the 
system. SInce the Intel OverDrive Processor/In- 
tel487 SX Math CoProcessor always drives UP# 
active, UP# can be used to check whether the 
system contalns an Intel OverDrive Processor/ 
Intel487 SX Math CoProcessor. 
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The Key pin is electrically nonfunctional. Its only 
purpose is to insure correct Intel OverDrive Proc· 
essor/lntel487 SX Math CoProcessor orientation 
in a 169-pin socket. 


6.2.13.2 Numeric Error Reporting 
(FERR*, IONNE#) 


To allow PC-type floating point error reporting, 
the Intel OverDrive Processor provides two pins, 
FERR# and IGNNE#. 


Floating PoInt Error Output (FERR#) 


The Intel OverDrive Processor asserts FERR# 
whenever an unmasked floating point error is en- 
countered. FERR# is similar to the EAROR# pin 
on the Intel387 Math CoProcessor. FEAR# can 
be used by extemal logic for PC-type floating 
point error reporting in systems with an Intel 
OverDrive Processor. FERA # is active lOW 
and is not floated during bus hold. 
' 


In some cases, FERR# is asserted when the 
next floating point instruction is encountered and 
in other cases it is asserted before the next float- 
ing point instruction is encountered depending 
upon the execution state of the instruction caus- 
ing the exception. 


The following class of floating point exceptions 
drive FERR# at the time the exception occurs 
(i.e., before encountering the next floating point 
instruction). 
1. The stack fault, invalid operation, and denor- 
mal exceptions on all transcendental instruc- 
tions, integer arithmetic instructions, FSQAT, 
FSCALE, FPREM(1), FXTRACT, FBLD, and 
FBSTP. 
2. Any exceptions on store instructions (includ- 
ing integer store instructions). 


The following class of floating point exceptions 
drive FEAR# only after encountering the next 
floating point instruction. 
1. Exceptions other than on all transcendental 
instructions, integer arithmetic instructions, 
FSQRT, 
FSCALE, 
FPREM(1), 
FXTRACT, 
FBlD, and FBSTP. 
2. Any exception on all basic arithmetic, load, 
compare. and control instructions (i.e., all oth- 
er instructions). 


Ignore Numeric Error Input (IONNE#) 


The. Intel OverDrive Processor will ignore a nu- 
menc error and continue executing non-control 
floating point instructions when IGNNE# is as- 
serted, but FERR# will still be activated. When 


dea88erted, the Intel OverDrive Processor will 
freeze on a non-control floating point instruction 
If 
a 
pnMou8 
Instruction 
caused 
an 
error. 
IGNNE# has no effect when the NE bit in control 
register 0 Is tel 


The IGNNE# Input is active LOW and is provid- 
ed with a emeIIlntemal puIIup resistor. This input 
Is uynchronous, 
but must meet setup and hold 
times t2C) and t21 to Insure recognition on any 
apecIfIc clock. 


6.2.14 BUS SIZE CONTROL (BS16#, BS8#) 


The 8S16# and 8S8 # inputs allow external 16- and 
8-bit busses to be supported with a small number of 
external components. The Intel486 SX microproces- 2 
sorllntel 
OverDrive Processor samples these pins 
every clock. The value sampled in the clock before 
ready determines the bus size. When asserting 
8S16# or 8S8# only 16 or 8 bits of the data bus 
need be valid. If both 8S16# and 8S8# are assert- 
ed, an 8-bit bus width is selected. 


When 8S16# or 8S8# are asserted the Intel486 SX 
microprocessorlintel OverDrive Processor will con- 
vert a larger data request to the appropriate number 
of smaller transfers. The byte enables will also be 
modified appropriately for the bus size selected. 


8S16# and 8S8# are active LOW and are provided 
with small internal pullup resistors. 8S16# 
and 
8S8# must satisfy the setup and hold times t14 and 
t15 for proper chip operation. 


6.2.15 ADDRESS BIT 20 MASK (A20M#) 


Asserting the A20M # input causes the Intel486 SX 
microprocessor/Intel OverDrive Processor to mask 
physical address bit 20 before performing a lookup 
in the internal cache and before driving a memory 
cycle to the outside world. When A20M# is assert- 
ed, the Intel486 SX microprocessor/Intel OverDrive 
Processor emulates the 1 Mbyte address wrap- 
around that occurs on the 8086. A20M# is active 
LOW and must be a serted only when the processor 
is in real mode. The A20M# is not defined in Pro· 
tected Mode. A20M# is asynchronous but should 
meet setup and hold times t20 and t21 for recogni- 
tion in any specific clock. For correct operation of 
the chip, A20M# should not be active at the falling 
edge of RESET. When A20M# 
is asserted asyn- 
chronously, A20M# should be high (non-active) for 
2 clocks before and 2 clocks after RESET is deas· 
serted. When A20M# is asserted synchronously, 
A20M# should be high (non-active) at the clock pri- 
or to the falling edge of RESET. 


A20M# 
exhibits a minimum 4 clock latency, from 
time of assertion to masking of the A20 bit. A20M # 
is ignored during cache invalidation cycles. I/O 


writes require A20M # to be asserted a minimum of 
2 clocks prior to RDY being returned for the I/O 
write. This insures recognition of the address mask 
before the i486 SX microprocesor/lntel 
OverDrive 
Processor begins execution of the instruction follow- 
ing OUT. If A20M# is asserted after the ADS# of a 
data cycle, the A20 address signal is not masked 
during this cycle but is masked in the next cycle. 
During a prefetch (cacheable or not), if A20M# is 
asserted after the first ADS#, A20 is not masked for 
the duration of the prefetch; even if BS16# ro BS8# 
is asserted. 


6.2.16 PERFORMANCE UPGRADE SUPPORT 
SIGNAL DESCRIPTION (Available on 
PQFP Version Only) 


Upgrade Present Input (UP#) 


UP# is an input for the Intel486 SX microprocessor 
and is sampled at all times. UP# indicates that the 
performance upgrade component is present in the 
system. It is driven by the MP# pin of the Intel Over- 
Drive Processor. UP# forces the Intel486 SX CPU 
to tri-state all its outputs and enter the power-down 
mode. In the power down mode the Intel486 SX 
CPU keeps its outputs tri-stated and reduces its 
power dissipation to a minimal value (see DC specifi- 
cations). 


The following boundary scan test signals are only 
available on the PQFP version of the Intel486 SX 
CPU. 


Test Clock (TCK) 


TCK is an input to the Intel486™ SX CPU and pro- 
vides the clocking function required by the JTAG 
boundary scan feature. TCK is used to clock state 
information and data into and out of the component. 
State select information and data are clocked into 
the component on the rising eage of TCK on TMS 
and TDI, respectively. Data is clocked out of the part 
on the falling edge of TCK on TDO. 


In addition to using TCK as a free running clock, it 
may be stopped in a low, 0, state, indefinitely as 
described in IEEE 1149.1. While TCK is stopped in 
the low state, the boundary scan latches retain their 
state. 


When boundary scan is not used, TCK should be 
tied high or left as a NC. (This is important during 
power up to avoid the possibility of glitches on the 
TCK which could prematurely initiate boundary scan 
operations.) TCK is supplied with an internal pullup 
resistor. 


TCK is a clock signal and is used as a reference for 
sampling other JTAG signals. On the rising edge of 
TCK, TMS and TDI are sampled. On the falling edge 
of TCK, TDO is driven. 


Test Mode Select (TMS) 


TMS is decoded by the JTAG TAP (Tap Access 
Port) to select the operation of the test logic, as de- 
scribed in Section 5.2.4. 


To guarantee deterministic behavior of the TAP con- 
troller TMS is provided with an internal pull-up resis- 
tor. If boundary scan is not used, TMS may be tied 
high or left unconnected. TMS is sampled on the 
rising edge of TCK. TMS is used to select the inter- 
nal TAP states required to load boundary scan in- 
structions to data on TDI. For proper initialization of 
the JTAG logic, TMS should be driven high, "1", for 
at least four TCK cycles following the rising edge of 
RESET. 


Test Data Input (TDI) 


TDI is the serial input used to shift JTAG instructions 
and data into the component. The shifting of instruc- 
tions and data occurs during the SHIFT-IR and 
SHIFT-DR 
TAP 
controller 
states, 
respectively. 
These states are selected using the TMS signal as 
described in Section 5.2.4. 


An internal pullup resistor is provided on TDI to en- 
sure a known logic state if an open circuit occurs on 
the TDI path. Note that when "1" is continuously 
shifted into the instruction register, the BYPASS in- 
struction is selected. TDI is sampled on the rising 
edge of TCK, during the SHIFT-IR and the SHIFT- 
DR states. During all other TAP controller states, 
TDI is a "don't care". TDI is only sampled when 
TMS and TCK have been used to select the SHIFT- 
IR or SHIFT-DR states in the TAP controller. For 
proper initialization of JTAG logic, TDI should be 
driven high, "1", for at least four TCK cycles follow- 
ing the rising edge of RESET. 


Test Data Output (TOO) 


TDO is the serial output used to shift JTAG instruc- 
tions and data out of the component. The shifting of 
instructions and data occurs during the SHIFT-IR 
and SHIFT-DR TAP controller states, respectively. 
These states are selected using the TMS signal as 
described in Section 5.2.4. When not in SHIFT-IR or 
SHIFT-DR state, TDO is driven to a high impedance 
state to allow connecting TDO of different devices in 
parallel. TDO is driven on the falling edge of TCK 
during the SHIFT-IR and SHIFT-DR TAP controller 
states. At all other times TDO is driven to the high 
impedance state. TDO is only driven when TMS and 


TCK have been used to select the SHIFT-IR or 
SHIFT-DR states in the TAP controller. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor contain four write buffers to enhance the 
performance of consecutive writes to memory. The 
buffers can be filled at a rate of one write per clock 
until all four buffers are filled. 


When all four buffers are empty and the bus is idle, a 
write request will propagate directly to the external 
bus bypassing the write buffers, If the bus is not 
available at the time the write is generated internally, 
the write will be placed in the write buffers and prop- 
agate to the bus as soon as the bus becomes avail- 
able. The write is stored in the on-chip cache imme- 
diately if the write is a cache hit. 


Writes will be driven onto the external bus in the 
same order in which they are received by the write 
buffers. Under certain conditions a memory read will 
go onto the external bus before the memory writes 
pending in the buffer even though the writes oc- 
curred earlier in the program execution. 


A memory read will only be reordered in front of all 
writes in the buffers under the following conditions: If 
all writes pending in the buffers are cache hits and 
the read is a cache miss. Under these conditions the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor will not read from an external memory location 
that needs to be updated by one of the pending 
writes. 


Reordering of a read with the writes pending in the 
buffers can only occur once before all the buffers 
are emptied. Reordering read once only maintains 
cache consistency. Consider the following example: 
The Intel486 SX microprocessor/Intel 
OverDrive 
Processor writes to location X. Location X is in the 
internal cache, so it is updated there immediately. 
However, the bus is busy so the write out to main 
memory is buffered (see Figure 6.3(a». At this point, 
any reads to location X would be cache hits and 
most up-to-date data would be read. 


Inle1486T'" SX 
Microprocessorl 


OverDrive Processor 


Cache 
'I-~"I'0 
I,--_~_::_:_;_...J 


The next instruction causes a read to location Y. 
Location Y is not in the cache (a cache miss). Since 
the write in the write buffer is a cache hit, the read is 
reordered. When location Y is read, it is put into the 
cache. The possibility exists that location Y will re- 
place location X in the cache. If this is true, location 
X would no longer be cached (see Figure 6.3(b». 
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Cache consistency has been maintained up to this 
point. If a subsequent read is to location X (now a 
cache miss) and it was reordered in front of the buff- 
ered write to location X, stale data would be read. 
This is why only 1 read is allowed to be reordered. 
Once a read is reordered, all the writes in the write 
buffer are flagged as cache misses to ensure that no 
more reads are reordered. Since one of the condi- 
tions to reorder a read is that all writes in the write 
buffer must be cache hits, no more reordering is al- 
lowed until all of those flagged writes propogate to 
the bus. Similarly, if an invalidation cycle is run all 
entries in the write buffer are flagged as cache 
misses. 


For multiple processor systems and/or systems us- 
ing DMA techniques, such as bus snooping, locked 
semaphores should be used to maintain cache con- 
sistency. 


6.3.1 WRITE 
BUFFERS 
AND 1/0 CYCLES 


Input/Output (I/O) cycles must be handled in a dif- 
ferent manner by the write buffers. 


I/O reads are never reordered in front of buffered 
memory writes. This insures that the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor will update 
all memory locations before reading status from an 
I/O device. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor never buffers single I/O writes. When 
processing an OUT instruction, internal execution 
stops until the I/O write actually completes on the 
external bus. This allows time for the external sys- 
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tem to drive an invalidate into the Intel486 SX micro- 
processor/Intel OverDrive Processor or to mask in- 
terrupts before the Intel486 SX microprocessor/Intel 
OverDrive Processor progresses to the instruction 
following OUT. REP OUTS instructions will be buff- 
ered. 


I/O device recovery time must be handled slightly 
differently by the Intel486 SX microprocessor/Intel 
OverDrive Processor than with the Intel386 micro- 
processor. I/O device back-to-back write recovery 
times could be guaranteed by the Intel386 micro- 
processor by inserting a jump to the next instruction 
in the code that writes to the device. The jump 
forces the Intel386 microprocessor to generate a 
prefetch bus cycle which can't begin until the I/O 
write completes. 


Inserting a jump to the next write will not work with 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor because the prefetch could be satisfied 
by the on-chip cache. A read cycle must be explicitly 
generated to a non-cacheable location in memory to 
guarantee that a read bus cycle is performed. This 
read will not be allowed to proceed to the bus until 
after the I/O write has completed because I/O 
writes are not buffered. The I/O device will have 
time to recover to accept another write during the 
read cycle. 


6.3.2 WRITE 
BUFFERS 
IMPLICATIONS 
ON 
LOCKED 
BUS CYCLES 


Locked bus cycles are used for read-modify-write 
accesses to memory. During a read-modify-write ac- 
cess, a memory base variable is read, modified and 
then written back to the same memory location. It is 
important that no other bus cycles, generated by 
other bus masters or by the Intel486 SX microproc- 
essor/Intel OverDrive Processor be allowed on the 
external bus between the read and write portion of 
the locked sequence. 


During a locked read cycle the Intel486 SX micro- 
processor/Intel OverDrive Processor will always ac- 
cess external memory, it will never look for the loca- 
tion in the on-chip cache, but for write cycles, data is 
written in the internal cache (if cache hit) and in the 
external memory. All data pending in the Intel486 SX 
microprocessor/Intel 
OverDrive 
Processor 
write 
buffers will be written to memory before a locked 
cycle is allowed to proceed to the external bus. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor will assert the LOCK# pin after the write 
buffers are emptied during a locked bus cycle. With 
the LOCK# pin asserted, the Intel486 SX micro- 
processor/Intel 
OverDrive Processor will read the 
data, operate on the data and place the results in a 
write buffer. The contents of the write buffer will then 
be written to external memory. LOCK# will become 
inactive after the write part of the locked cycle. 


6.4 
Interrupt and Non-Maskable 
Interrupt 
Interface 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor provide two asynchronous interrupt in- 
puts, INTR (interrupt request) and NMI (non-maska- 
ble interrupt input). This section describes the hard- 
ware interface between the instruction execution 
unit and the pins. For a description of the algorithmic 
response to interrupts refer to Section 2.7. For inter- 
rupt timings refer to Section 7.2.10. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor contain a two-clock synchronizer on the 
interrupt line. An interrupt request will reach the in- 
ternal instruction execution unit two clocks after the 
INTR pin is asserted, if proper setup is provided to 
the first stage of the synchronizer. There is no spe- 
ciallogic in the interrupt path other than the synchro- 
nizer. The INTR signal is level sensitive and must 
remain active for the instruction execution unit to 
recognize it. The interrupt will not be serviced by the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor if the INTR signal does not remain active. 


The instruction execution unit will look at the state of 
the synchronized interrupt signal at specific clocks 
during the execution of instructions (if interrupts are 
enabled). These specific clocks are at instruction 
boundaries, or iteration boundaries in the case of 
string move instructions. Interrupts will only be ac- 
cepted at these boundaries. 


An interrupt must be presented to the Intel486 SX 
microprocessor/Intel OverDrive Processor INTR pin 
three clocks before the end of an instruction for the 
interrupt to be acknowledged. Presenting the inter- 
rupt 3 clocks before the end of an instruction allows 
the interrupt to pass through the two clock synchro- 
nizer leaving one clock to prevent the initiation of the 
next sequential instruction and to begin interrupt 
service. If the interrupt is not received in time to pre- 
vent the next instruction, it will be accepted at the 
end of next instruction, assuming INTR is still held 
active. The interrupt service microcode will start af- 
ter two dead clocks. 


The longest latency between when an interrupt re- 
quest is presented on the INTR pin and when the 
interrupt service begins is: longest instruction used 
+ the two clocks for synchronization + one clock 
required to vector into the interrupt service micro- 
code. 


The NMI pin has a synchronizer like that used on the 
INTR line. Other than the synchronizer, the NMI log- 
ic is different from that of the maskable interrupt. 
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NMI is edge triggered 
as opposed 
to the level trig- 
gered INTR signal. The rising edge of the NMI signal 
is used to generate 
the interrupt 
request. 
The NMI 
input need not remain active until the interrupt 
is ac- 
tually 
serviced. 
The 
NMI pin only 
needs 
to remain 
active 
for a single 
clock 
if the 
required 
setup 
and 
hold times are met. NMI will operate 
properly 
if it is 
held active for an arbitrary 
number 
of clocks. 


The 
Intel486 
SX microprocessor 
has a built in self 
test (BIST) that can be run during reset. The BIST is 
invoked 
if the AHOLD 
pin is asserted 
for 2 clocks 
before 
and 
2 clocks 
after 
RESET 
is deasserted. 
RESET must be active for 15 clocks 
with or with no 
BIST 
being 
enabled. 
To 
ensure 
proper 
results, 
FLUSH II must not be asserted 
while BIST is execut- 
ing. Refer to Section 
8.0 for information 
on Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
testa- 
bility. 


The NMI input must be held inactive 
for at least four 
clocks 
after it is asserted 
to reset the edge triggered 
logic. A subsequent 
NMI may not be generated 
if the 
NMI is not held inactive 
for at least two clocks 
after 
being asserted. 
The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
registers 
have the values 
shown 
in Table 
6.2 
after 
RESET 
is performed. 
The 
EAX 
register 
contains 
information 
on the success 
or failure of the 
BIST 
if the 
self test 
is executed. 
The 
OX register 
always 
contains 
a component 
identifier 
at the con- 
clusion 
of RESET. 
The 
upper 
byte of OX (DH) will 


The 
NMI 
input 
is internally 
masked 
whenever 
the 
NMI 
routine 
is entered. 
The 
NMI input 
will remain 
masked 
until an IRET (return from interrupt) 
instruc- 
tion 
is executed. 
Masking 
the 
NMI signal 
prevents 
recursive 
NMI calls. If another 
NMI occurs while the 
NMI is masked 
off, the pending 
NMI will be executed 
after the current 
NMI is done. Only one NMI can be 
pending 
while 
NMI is masked. 


Register 


EAX 
ECX 
EDX 
EBX 
ESP 
EBP 
ESI 
EDI 
EFLAGS 
EIP 
ES 
CS 
SS 
OS 
FS 
GS 
IDTR 
CRO 
DR7 


Table 
6.2. Register 
Values 
after 
Reset 


Initial 
Value 
(BIST) 


Zero (Pass) 
Undefined 
0400 + Revision 
10 
Undefined 
Undefined 
Undefined 
Undefined 
Undefined 
00000002h 
OFFFOh 
OOOOh 
FOOOh* 
OOOOh 
OOOOh 
OOOOh 
OOOOh 
Base = 0, Limit = 3FFh 
60000010h 
OOOOOOOOh 


Initial 
Value 
(No Blst) 


Undefined 
Undefined 
0400 + Revision 
10 
Undefined 
Undefined 
Undefined 
Undefined 
Undefined 
00000002h 
OFFFOh 
OOOOh 
FOOOh* 
OOOOh 
OOOOh 
OOOOh 
OOOOh 
Base = 0, Limit = 3FFh 
60000010h 
OOOOOOOOh 


cw 
SW 
TW 
FIP 
FEA 
FCS 
FOS 
FOP 
FSTACK 


037Fh 
OOOOh 
FFFFh 
OOOOOOOOh 
OOOOOOOOh 
OOOOh 
OOOOh 
OOOh 
Undefined 


Unchanged 
Unchanged 
Unchanged 
Unchanged 
Unchanged 
Unchanged 
Unchanged 
Unchanged 
UnchanQed 
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contain 
04 and the 
lower 
byte 
(DL) will contain 
a 
stepping 
identifier 
(see Table 6-3). 


The floating 
point registers 
are initialized 
as if the 
FINIT/FNINIT 
Onitlalize 
processor) 
Instruction 
was executed 
11the BIST was performed. 
If the 
BIST is not executed, 
the floating 
point registers 
are unchanged. 


Table 
6·3. Intel486TM 
SX Microprocessor/Intel 
OverDrive 
Processor/lntel487™ 
SX Math 
CoProcessor 
Revision 
10 


Product 
Stepping 
Component 
Revision 
10 
10 


i486SX 
AO 
04 
20 


Microprocessor 
80 
04 
22 
cAO 
04 
27 


c80 
04 
28 


i487 SX 
AO 
04 
20 


Math CoProcessor 
80 
04 
21 


OverDrive 
Processor 
A2 
04 
32 


81 
04 
33 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
will start executing 
instructions 
at location 
FFFFFFFOH 
after 
RESET. 
When 
the first InterSeg- 
ment Jump or Call is executed, 
address 
lines A20- 
A31 will drop 
LOW for CS-relative 
memory 
cycles, 
and the Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
will only execute 
instructions 
in the lower 
one Mbyte of physical 
memory. 
This allows the sys- 
tem designer 
to use a ROM at the top of physical 
memory 
to initialize the system and take care of RE- 
SETs. 


RESET forces 
the Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
to terminate 
all execution 
and 
local 
bus activity. 
No instruction 
or bus activity 
will 
occur 
as long as RESET is active. 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
recognizes 
and 
can 
respond 
to 
HOLD, 
AHOLD, 
and 
BOFF # 
requests 
regardless 
of 
the 
state of RESET. Thus, even though the processor 
is 
in reset, it can still float its bus in response 
to any of 
these 
requests. 
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While in reset, the Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
bus is in the 
state 
shown 
in 
Figure 
6.4 
if the 
HOLD, 
AHOLD 
and 
BOFF # 
re- 


quests 
are inactive. 
The figure shows 
the bus state 
for the 
Intel486 
SX microprocessor 
when 
the 
Intel 
OverDrive 
Processor 
is not present 
or for Intel Over- 
Drive Processor 
when 
it is installed 
in the system. 


Note that the address 
(A31-A2, 
BE3# -BEO#) 
and 
cycle definition 
(M/IO#, 
D/C#, 
W/R#) 
pins are un- 


defined 
from 
the 
time 
reset 
is 
asserted 
up 
to 
the 
start 
of the first 
bus cycle. 
All undefined 
pins 
(except 
FERRill) 
assume 
known 
values 
at the be- 
ginning 
of the first bus cycle. 
The first bus cycle 
is 
always a code fetch to address 
FFFFFFFOH. 


FERR# 
reflects 
the state of the ES (error sum- 
mary status) 
bit in the floating 
point 
unit status 
word. The ES bit is initialized 
whenever 
the float- 
ing point unit state Is initialized. 
The floating 
point 
unit's 
status 
word 
register 
can be initialized 
by 
BIST or by executing 
FINIT /FNINIT 
instruction. 
Thus, 
after 
reset 
and 
before 
executing 
the first 
FINIT 
or FNINIT 
instruction, 
the 
values 
of the 
FERR # and the numeric status word register bits 
0-7 
depends 
on whether 
or 
not 
BIST 
is per- 
formed. 
Table 
6-4 shows 
the 
state 
of FERR# 
signal after reset and before the execution 
of the 
FINIT IFNINIT 
instruction. 


BtST 
FERR# 
FPUStatus 


Performed 
Pin 
Word Register 
Bits 0-7 


YES 
Inactive 
Inactive 
(High) 
(Low) 


NO 
Undefined 
Undefined 
(Low or High) 
(Low or High) 


After 
the 
first 
FINIT 
or 
FNINIT 
instruction, 
FERR# 
pin and the FPU status word register 
bits 
(0-7) 
will be inactive 
irrrespective 
of the Built-In 
Self-Test 
(BIST). 


The bus state of the Intel486 
SX microprocessor 
during reset with Intel OverDrive 
Processor 
pres- 
ent in the system 
is shown 
in Figure 
6.5. Note 
that BOFF # is driven active 
to float most of the 
output 
signals 
during 
reset 
and 
remaining 
are 
floated 
after the falling edge of reset on sampling 
FLUSH # active. 
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NOTES: 
1. RESET is an asynchronous 
input. t20 must be met only to guarantee 
recognition 
on a specific 
clock 
edge. 
2a. When A20M# 
is driven synchronously, 
it must be driven high (inactive) 
for the ClK 
edge prior to the falling edge of RESET to ensure 
proper operation. 
A20M# 
setup and hold times 
must be met. 


2b. When A20M # is driven asynchronously, 
it must be driven 
high (inactive) 
for two ClKs 
prior to and two ClKs 
after the falling 
edge of RESET 
to ensure 
proper 
operation. 
3a. When 
FlUSH# 
is driven synchronously, 
it should 
be driven low (active) 
for the ClK 
edge prior to the falling 
edge of RESET to invoke 
the Tri-State 
Output 
Test 
Mode. All outputs 
are guaranteed 
tri-stated 
within 
10 ClKs 
of RESET being deasserted. 
FLUSH# 
setup and hold times 
must be met. 


3b. When 
FlUSH# 
is driven 
asynchronously, 
it must be driven 
low (active) 
for two ClKs 
prior to and two ClKs 
after the falling 
edge of RESET 
to invoke 
the Tri- 
State Output 
Test Mode. All outputs 
are guaranteed 
tri-stated 
within 
10 ClKs 
of RESET 
being deasserted. 


3c. FlUSH# 
must be driven high (inactive) 
during, 
Built-in-Self-Test 
(BIST). 


4. AHOlD 
should 
be driven 
high (active) 
for the ClK 
edge prior to the falling 
edge of RESET to invoke the Built-In-Self-Test 
(BIST). 
AHOlD 
setup 
and hold times 
must be met. 
5. Hold is recognized 
normally 
during 
RESET. 


6.15 
ClKs 
RESET pulse width for warm resets. 
Power-up 
resets require 
RESET to be asserted 
for at least 1 ms after Vcc 
and ClK 
are stable. 
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NOTES: 
1.15 
ClKs 
reset pulse width for warm reset. 
Power-up 
resets 
require reset to be asserted 
for at least 
1 ms after Vcc 
and ClK 
are stable. 
2. The outputs 
will be floated 
within 
10 ClKs 
of reset being deasserted. 


Figures 6.6 and 6.7 show the circuit to allow up- 
grades with an Intel OverDrive Processor or Intel487 
SX Match CoProcessor. 


6.6.1 
UPGRADE 
CIRCUIT 
FOR Intel486 
SX CPU 
IN PGA 


The circuit in Figure 6.6 is recommended for In- 
tel486 SX CPU in PGA package only. All address, 
data and control signals including CPU clock of the 
Intel486 SX microprocessor, are tied to the corre- 
sponding signals of the Intel OverDrive Processor 
Note that the UP# and FERR# pins in the Intel 
OverDrive Processor should be tied high to allow the 
system to function normally when the Intel Over- 
Drive Processor is not present. Additionally, HLDA 
of the Intel486 SX microprocessor and the Intel 
OverDrive Processor should be pulled-down and 
ORed as shown in the Figure 6.6. This will allow 
HLDA to be asserted normally when this Intel Over- 
Drive Processor is not installed in the system. This 
arrangement also takes care of the case of HOLD 
during reset in the presence of an Intel OverDrive 
Processor in the system. In this case the Intel486 SX 
microprocessor, under the influence of BOFF#, may 
not recognize HOLD, while the Intel OverDrive Proc- 
essor will sense HOLD and assert the HLDA signal. 
The OR gate on the HLDA signals avoids the con- 
tention that may have occurred from this. 


When the Intel OverDrive Processor is inserted in 
the system, the Intel OverDrive Processor UP# pin 
drives the BOFF# and FLUSH# pins of the Intel486 
SX microprocessor active. The BOFF# signal floats 
most of the outputs (except PCHK#, BREQ# and 
HLDA) of the Intel486 SX microprocessor on the 
next clock, while FLUSH# is sampled at the falling 
edge of the reset by the Intel486 SX microprocessor 
and all its outputs are floated. The Intel OverDrive 
Processor then takes charge of the bus and the sys- 
tem works normally. (See Figure 6.5 for timing de- 
tails during reset.) 


6.6.2 
UPGRADE 
CIRCUIT 
FOR Intel486 
SX CPU 
IN PQFP 


The PQFP version of the Intel486 SX CPU provides 
a direct interface with the OverDrive Processor. 
Thus the interface logic between the Intel486 SX 
CPU and the OverDrive Processor as shown in Fig- 
ure 6.6 is not required. Instead, the UP# pin of the 
OverDrive Processor is connected directly to the 
UP# pin of the Intel486 SX CPU. All other signals 
are connected together as shown in Figure 6.7. 


6.6.2.1 
Power 
Down Mode 


Power Down Mode is a new mode in the Intel486 SX 
CPU in PQFP. It is initiated by the Intel OverDrive 
Processor via the Upgrade Circuit. Upon sensing the 
presence of the Intel OverDrive Processor, Intel486 
SX microprocessor not only tri-states its outputs but 
also enters the "Power Down Mode", whereby it re- 
mains tri-stated and lowers its power consumption 
(See D.C. Specfications) 


The Intel486 SX CPU samples the UP# pin to enter 
the Power Down Mode (PDM). The UP# pin of the 
Intel486 SX microprocessor is driven active by the 
UP# pin of the Intel OverDrive Processor in the Per- 
formance Upgrade Circuit, as shown in Figure 6.7. 


UP# of the OverDrive Processor is driven active af- 
ter power-up. It drives the UP# pin of the Intel486 
SX CPU to signal that the upgrade component has 
been installed in the system. Within a few clocks the 
Intel486 SX CPU tri-states all of its outputs. This is 
followed by the power down mode after the falling 
edge of Reset. Once the Intel486 SX CPU enters the 
power down mode, it will remain in it until the next 
reset. For warm resets, the Intel486 SX CPU will re- 
main tri-stated and re-enter the power down mode 
after the reset has been de-asserted. Similarly for 
the power-up reset, if the Intel OverDrive Processor 
is not taken out of the system, the Intel486 SX mi- 
croprocessor will be tri-stated and will enter Power 
Down Mode after the falling edge of reset. 


The interfaces shown in Figure 6.6 and 6.7 are de- 
signed for a discrete OverDrive Processor socket. 
The following should be considered when designing 
an 
Intel486 
SX 
microprocesser/lntel 
OverDrive 
Processor system. 
1. The timing loops should be independent of the 
cpi. One way to attain this is to implement these 
loops in hardware and not in software (e.g., 
BIOS). 
2. Initialization routine should check the presence of 
a math coprocesser e.g., Intel OverDrive Proces- 
sor and should set the floating point related bits 
in the CRO register accordingly. Recommended 
bit pattern is given in Table 6-5. The FSTCW in- 
struction will give a value of FFFFh for the In- 
tel486 SX microprocessor and 037Fh for the Intel 
OverDrive Processor. 
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eRO Bit 
Intel486 
SX Microprocessor 
Intel OverDrive 
Processor 


EM 
1 
0 


MP 
0 
1 


NE 
1 
0, for DOS Systems 
1, for User-Defined Exception Handler 


Table 6·5. Recommended 
Values of the FP Related 
Bits for 
Intel486TM SX Microprocessor/Intel 
OverDrive 
Processor 
System 


Following is an example code to initialize the system and check for the presence of Intel486 SX microproces- 
sor/Intel OverDrive Processor. 


fninit 
fstcw 
mov 
cmp 
jz 
jmp 


mem_loc 
ax, 
mem_loc 
ax, 
037fh 
Intel 
OverDrive 
Processor_present 
Intel486 
SX microprocessor_present 
;ax=037fh 
;ax=ffffh 


If the Intel OverDrive Processor is not present, the following code can be run to set the eRO register for 
Intel486 SX microprocessor: 


mov 
and 
or 
mov 


eax, 
crO 
eax, 
fffffffdh 
eax, 
0024h 
crO, 
eax 


;make MP=O 
;make EM=l, 
NE=l 


The above initialization will cause any floating point 
instruction to generate the interrupt 7. The software 
emulation will then take control to execute these in- 
structions. This code is not required if Intel Over- 
Drive Processor is present in the system, there-upon 
the typical initialization routine for the Intel486 SX 
microprocessor will be adequate. 


Following is the interpretation of different combina- 
tions of the EM and MP bits. 


EM 
MP 
Interpretation 


0 
0 
Numeric Instructions are Passed to FPU; WAIT Ignores TS 
0 
1 
Numeric Instructions are Passed to FPU;WAIT Tests TS 
1 
0 
Numeric Instructions Trap to Emulator; WAIT Ignores TS 
1 
1 
Numeric Instructions Trap to Emulator; WAIT Tests TS 
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All data transfers occur as a result of one or more 
bus cycles. Logical data operands of byte, word and 
dword lengths may be transferred without restric- 
tions on physical address alignment. Data may be 
accessed at any byte boundary but two or three cy- 
cles may be required for unaligned data transfers. 
See Section 7.1.3 Dynamic Bus Sizing and 7.1.6 Op- 
erand Alignment. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor address signals are split into two compo- 
nents. High-order address bits are provided by the 
address lines, A2-A31. The byte enables, BEO#<- 
BE3#<,form the low-order address and provide lin- 
ear selects for the four bytes of the 32-bit address 
bus. 


The byte enable outputs are asserted when their as- 
sociated data bus bytes are involved with the pres- 
ent bus cycle, as listed in Table 7.1. Byte enable 
patterns which have a negated byte enable separat- 
ing two or three asserted byte enables will never 
occur (see Table 7.5). All other byte enable patterns 
are possible. 


Table 7.1. Byte Enables and Associated 
Data and Operand Bytes 


Byte 
Enable 
Associated Data Bus Signals 
Signal 


BEO#< 
00-07 
(byte Q-Ieast significant) 


BE1#< 
08-015 
(byte 1) 


BE2#< 
016-023 
(byte 2) 


BE3#< 
024-031 
(byte 3-most 
significant) 


Address bits AO and A1 of the physical operand's 
base address can be created when necessary. Use 
of the byte enables to create AOand A1 is shown in 
Table 7.2. The byte enables can also be decoded to 
generate BLE#<(byte low enable) and BHE#<(byte 
high enable). These signals are needed to address 
16-bit memory systems (see Section 7.1.4 Inter- 
facing with 8- and 16-bit memories). 


Table 7.2. Generating AO-A31 from 
BEO#<-BE3#<and A2-A31 


Intel486™ SX Mlcroprocell8or/ 
OverDrive Processor Address Signals 


A31 
......... 
A2 
BE3# 
BE2# 
BE1# 
BEO# 


Physical Base 
Address 


A31 
. ........ A2 
A1 
AD 


A31 
......... 
A2 
0 
0 
X 
X 
X 
Low 


A31 
......... 
A2 
0 
1 
X 
X 
Low 
High 


A31 
......... 
A2 
1 
0 
X 
Low 
High 
High 


A31 
......... 
A2 
1 
1 
Low 
High 
High 
High 


Bus cycles may access physical memory space or 
I/O space. Peripheral devices in the system may ei- 
ther be memory-mapped, or I/O-mapped, or both. 
Physical memory addresses range from OOOOOOOOH 
to FFFFFFFFH (4 gigabytes). I/O addresses range 
from OOOOOOOOH 
to OOOOFFFFH(64 Kbytes) for pro- 
grammed I/O. See Figure 7.1. 
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OOOOffffH I 
I}ACCESSiBlE 
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PROGRAMMED 
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••• 
I/O 
SPACE 


7.1.2 MEMORY AND 1/0 SPACE 
ORGANIZATION 
32·Blt Wide Organization 


ffffffffH ITIIJ 
fffffffCH 


o0000003H...... 
• 
• 
• 
• 
• 
•...J oOOOOOOOH 


BE3tJ BE2" 
BEl" 
BED" 


The Intel486 SX microprocessor/Intel 
OverOrive 
Processor datapath to memory and input! output 
(I/O) spaces can be 32-, 16- or 8-bits wide. The byte 
enable signals, BEO#-BE3#, 
allow byte granularity 
when addressing any memory or I/O 
structure 
whether 8, 16 or 32 bits wide. 


The Intel486 SX microprocessor/Intel 
OverOrive 
Processor includes bus control pins, BS16# 
and 
BS8#, which allow direct connection to 16- and 8-bit 
memories and 110 devices. Cycles to 32-, 16- and 
8-bit may occur in any sequence, since the BS8# 
and BS16# signals are sampled during each bus 
cycle. 


32-bit wide memory and I/O spaces are organized 
as arrays of physical 4-byte words. Each memory or 
I/O 4-byte word has four individually addressable 
bytes at consecutive byte addresses (see Figure 
7.2). The lowest addressed byte is associated with 
data signals 00-07; 
the highest-addressed byte 
with 024-031. 
Physical 4-byte words begin at ad- 
dresses divisible by four. 


16·Blt Wide Organization 


ffffffffH rnfffffffEH 


oOOOOOOIH...... 
• 
•...J oOOOOOOOH 


Figure 7.2. Physical Memory 
and I/O Space Organization 
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16-bit memories are organized as arrays of physical 
2-byte words. Physical 2-byte words begin at ad- 
dresses divisible by two. The byte enables 8EO#- 
8E3#, must be decoded to A1, 8lE# 
and 8HE# to 
address 16-bit memories (see 5ection 7.1.4). 


To address 8-bit memories, the two low order ad- 
dress bits AOand A1, must be decoded from 8EO#- 
8E3#. The same logic can be used for 8- and 16-bit 
memories since the decoding logic for 8lE # and AO 
are the same (see 5ection 7.1.4). 


Dynamic data bus sizing is a feature allowing proc- 
essor connection to 32-, 16- or 8-bit buses for mem- 
ory or I/O. The Intel486 5X microprocessor/Intel 
OverDrive Processor may connect to all three bus 
sizes. Transfers to or from 32-, 16- or 8-bit devices 
are supported by dynamically determining the bus 
width during each bus cycle. Address decoding cir- 
cuitry may assert 8516# 
for 16-bit devices, or 
8SS# for 8-bit devices during each bus cycle. 858 # 
and 8516# 
must be negated when addressing 
32-bit devices. An 8-bit bus width is selected if both 
8516# 
and 858# 
are asserted. 


8516# 
and 858# force the Intel486 5X microproc- 
essor/Intel 
OverDrive Processor to run additional 
bus cycles to complete requests larger than 16- or 8 
bits. A 32·bit transfer will be converted into two 16- 
bit transfers (or 3 transfers if the data is misaligned) 
when 8516# 
is asserted. Asserting 858# 
will con- 
vert a 32-bit transfer into four 8-bit transfers. 


Extra cycles forced by 8516# 
or 858# 
should be 
viewed as independent bus cycles. 8516# or 858# 
must be driven active during each of the extra cycles 
unless the addressed device has the ability to 
change the number of bytes it can return between 
cycles. 


The Intel486 5X microprocessor/Intel 
OverDrive 
Processor will drive the byte enables appropriate- 


Iy during extra cycles forced by 858# 
and 8516#. 


A2-A31 will not change if accesses are to a 32-bit 
aligned area. Table 7.3 shows the set of byte en· 
abies that will be generated on the next cycle for 
each of the valid possibilities of the byte enables on 
the current cycle. 


The dynamic bus sizing feature of the Intel486 5X 
microprocessor/Intel OverDrive Processor is signifi- 
cantly different than that of the Intel386 microproc- 
essor. Unlike the Intel386 microprocessor, the In- 
tel486 5X microprocessor/Intel 
OverDrive Proces- 
sor requires that data bytes be driven on the ad- 
dressed data pins. The simplest example of this 
function is a 32-bit aligned, 8516# 
read. When the 
Intel486 5X microprocessor/Intel OverDrive Proces- 
sor reads the two high order bytes, they must be 
driven on the data bus pins 016-031. 
The Intel486 
2 


5X microprocessor/Intel 
OverDrive Processor ex- 
pects the two low order bytes on 00-015. 
The In- 
tel386 microprocessor expects both the high and 
low order bytes on 00-015. 
The Intel386 micro- 
processor always reads or writes data on the lower 
16 bits of the data bus when 8516# 
is asserted. 


The external system must contain buffers to enable 
the 
Intel486 5X 
microprocessor/Intel 
OverDrive 
Processor to read and write data on the appropriate 
data bus pins. Table 7.4 shows the data bus lines 
where the Intel486 5X microprocessor/Intel 
Over- 
Drive Processor expects data to be returned for 
each valid combination of byte enables and bus siz- 
ing options. 


Valid data will only be driven onto data bus pins cor- 
responding to active byte enables during write cy- 
cles. Other pins in the data bus will be driven but 
they will not contain valid data. Unlike the Intel386 
microprocessor, the Intel486 5X microprocessor/In- 
tel OverDrive Processor will not duplicate write data 
onto parts of the data bus for which the correspond- 
ing byte enable is negated. 
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Next with BS8# 
Next with B516# 
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II 
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BE3# 
BE2# 
BE1# 
BEO# 
w/o BS8#/BS16# 
wBS8# 
WBS16# 


1 
1 
1 
0 
07-00 
07-00 
07-00 
1 
1 
0 
0 
015-00 
07-00 
015-00 
1 
0 
0 
0 
023-00 
07-00 
015-00 
0 
0 
0 
0 
031-00 
07-00 
015-00 
1 
1 
0 
1 
015-08 
015-08 
015-08 
1 
0 
0 
1 
023-08 
, 
015-08 
015-08 
0 
0 
0 
1 
031-08 
015-08 
015-08 
1 
0 
1 
1 
023-016 
023-016 
023-016 
0 
0 
1 
1 
031-016 
023-016 
031-016 
0 
1 
1 
1 
031-024 
031-024 
031-024 


7.1.4 INTERFACING WITH 8-, 16- AND 32-BIT 
MEMORIES 


In 32-bit physical memories such as Figure 7.3, each 
4-byte word begins at a byte address that is a multi- 
ple of four. A2-A31 are used as a 4-byte word se- 
lect. BEO# - BE3# select individual bytes within the 
4-byte word. B88# and B816# are negated for all 
bus cycles involving the 32-bit array. 


Figure 7.3.lnteI486™ SX Microprocessor/ 
Intel OverDrive Processor 
with 32-Blt Memory 


16- and 8·bit memories require external byte swap- 
ping logic for routing data to the appropriate data 
lines and logic for generating BHE#, BLE# and A1. 
In systems where mixed memory widths are used, 
extra address decoding logic is necessary to assert 
B816# or B88#. 


Figure 7.4 shows the Intel486 8X microprocessor/ 
Intel OverOrive Processor address bus interface to 
32-,16- and 8-bit memories. To address 16-bit mem- 
ories the byte enables must be decoded to produce 
A1, BHE# and BLE# (AO).For 8-bit wide memories 
the byte enables must be decoded to produce AO 
and A1. The same byte select logic can be used in 
16- and 8-bit systems since BLE# is exactly the 
same as AO(see Table 7.5). 


BEO#-BE3# 
can be decoded as shown in Table 
7.5 to generate A1, BHE# and BLE#. The byte se- 
lect logic necessary to generate BHE# and BLE# is 
shown in Figure 7.5. 


Intel486™ SX Microprocessorl 
8·, 16·Blt 
Intel OverDrive Processor 
Bus Signals 
Comments 


BE3# 
BE2# 
BE1'*' 
BED# 
A1 
BHE# 
BlE'*' (AD) 


H* 
H* 
H* 
H* 
x 
x 
x 
x-no 
active bytes 
H 
H 
H 
L 
L 
H 
L 
H 
H 
L 
H 
L 
L 
H 
H 
H 
L 
L 
L 
L 
L 
H 
L 
H 
H 
H 
H 
L 
H* 
L* 
H* 
L* 
x 
x 
x 
x-not 
contiguous bytes 
H 
L 
L 
H 
L 
L 
H 
H 
L 
L 
L 
L 
L 
L 
L 
H 
H 
H 
H 
L 
H 
L* 
H* 
H* 
L* 
x 
x 
x 
x-not 
contiguous bytes 
L* 
H* 
L* 
H* 
x 
x 
x 
x-not 
contiguous bytes 
L* 
H* 
L* 
L* 
x 
x 
x 
x-not 
contiguous bytes 
L 
L 
H 
H 
H 
L 
L 
L* 
L* 
H* 
L* 
x 
x 
x 
x-not 
contiguous bytes 
L 
L 
L 
H 
L 
L 
H 
L 
L 
L 
L 
L 
L 
L 


BLE# asserted when 00-07 
of 16-bit bus is active. 
BHE# asserted when 08-015 
of 16-bit bus is active. 


A1 low for all even words; A1 high for all odd words. 


Key: 
x = don't care 
H = high voltage level 
L = low voltage level 
* = a non-occurring pattern of Byte Enables; either none are asserted, 
or the pattern has Byte Enables asserted for non-contiguous bytes 


~ 
~ 
240950-42 


~ 


BE1/tO 
BHE/tO 
BE3ItO 


240950-43 


Combinations of BEO# -BE3 # which never occur 
are those in which two or three asserted byte en- 
ables are separated by one or more negated byte 
enables. These combinations are "don't care" con- 
ditions in the decoder. A decoder can use the non- 
occurring BEO#-BE3# 
combinations to its best ad- 
vantage. 


Figure 7.6 shows an Intel486 SX microprocessor/ln- 
tel OverDrive Processor data bus interface to 16- 
and 8-bit wide memories. External byte swapping 
logic is needed on the data lines so that data is sup- 
plied to, and received from the Intel486 SX micro- 
processor/Intel OverDrive Processor on the correct 
data pins (see Table 7.4). 
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00-07 
08-015 
016-023 
024-031 


Int.104861111 
SX 
l.4icroprocessor / 
Int.1 Ov"Ori•••.• 
11Il 
Processor 


7.1.5 DYNAMIC BUS SIZING DURING CACHE 
LINE FILLS 


B88# and B816# can be driven during cache line 
fills. The Intel486 8X microprocessor/Intel 
Over- 
Drive Processor will generate enough 8- or 16-bit 
cycles to fill the cache line. This can be up to 16 
8-bit cycles. 


The external system should assume that all byte en- 
ables are active for the first cycle of a cache line fill. 
The Intel486 8X microprocessorlintel 
OverDrive 
Processor will generate proper byte enables for sub- 
sequent cycles in the line fill. Table 7.6 shows the 
appropriate AO (BLE#), A1 and BHE# for the vari- 
ous combinations of the Intel486 8X microproces- 
sor/Intel OverDrive Processor byte enables on both 
the first and subsequent cycles of the cache line fill. 
The "." 
marks all combinations of byte enables that 
will be generated by the Intel486 8X microproces- 
sor/Intel OverDrive Processor during a cache line 
fill. 


Physical 4-byte words begin at addresses that are 
multiples of four. It is possible to transfer a logical 
operand that spans more than one physical 4-byte 
word of memory or I/O at the expense of extra cy- 
cles. Examples are 4-byte operands beginning at ad- 
dresses that are not evenly divisible by 4, or 2-byte 
words split between two physical 4-byte words. 
These are referred to as unaligned transfers. 


Operand alignment and data bus size dictate when 
multiple bus cycles are required. Table 7.7 describes 
the transfer cycles generated for all combinations of 
logical operand lengths, alignment, and data bus siz- 
ing. When multiple cycles are required to transfer a 
multi-byte logical operand, the highest-order bytes 
are transferred first. For example, when the proces- 
sor does a 4-byte unaligned read beginning at loca- 
tion x11 in the 4-byte aligned space, the three high 
order bytes are read in the first bus cycle. The low 
byte is read in a subsequent bus cycle. 


Table 7.6. Generating AO,A1 and BHE# from the Intel486TM SX 
Microprocessor/Intel 
OverDrive Processor Byte Enables 


BE3# 
BE2# 
BE1# 
BEO# 
First Cache Fill Cycle 
Any Other Cycle 
AO 
A1 
BHE# 
AO 
A1 
BHE# 


1 
1 
1 
0 
0 
0 
0 
0 
0 
1 
1 
1 
0 
0 
0 
0 
0 
0 
0 
0 
1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
'0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
1 
0 
1 
0 
0 
0 
1 
0 
0 
1 
0 
0 
1 
0 
0 
0 
1 
0 
0 
'0 
0 
0 
1 
0 
0 
0 
1 
0 
0 
1 
0 
1 
1 
0 
0 
0 
0 
1 
1 
'0 
0 
1 
1 
0 
0 
0 
0 
1 
0 
'0 
1 
1 
1 
, 


0 
0 
0 
1 
1 
0 


Trantf.r 
Cycl•• over 
16-8;1 Dolo 8u. 


~ = 8S16- 
A••• rlod 


Tran.f.r 
Cycl•• over 
8-Bit Data Bu. 
~ = 856- 
A••• rtod 


KEY: 
b = byte transfer 
w = 2-byte transfer 
3 = 3-byte transfer 
d = 4-byte transfer 


h = high-order portion 
I = low-order portion 
m = mid-order portion 


The 
function 
of 
unaligned 
transfers 
with 
dynamic 
bus sizing is not obvious. When the external 
systems 
asserts 
BS16# 
or BSB# 
forcing 
extra cycles, 
low- 
order 
bytes or words 
are transferred 
first (opposite 
to the example 
above). When the Intel4B6 SX micro- 
processor/Intel 
OverDrive 
Processor 
requests 
a 4- 
byte read and the external 
system 
asserts 
BS 16 # , 


the lower 2 bytes are read first followed 
by the upper 
2 bytes. 


In the unaligned 
transfer 
described 
above, the proc- 
essor requested 
three bytes on the first cycle. 
If the 
external 
system 
asserted 
BS16# 
during this 3-byte 
transfer, 
the lower word is transferred 
first followed 
by the upper byte. In the final cycle the lower byte of 
the 4-byte operand 
is transferred 
as in the 32-bit ex- 
ample above. 


7.2 
Bus Functional 
Description 


The 
Intel4B6 
SX 
microprocessor/Intel 
OverDrive 
Processor 
supports 
a wide variety of bus transfers 
to 
meet the needs 
of high performance 
systems. 
Bus 
transfers 
can be single cycle or multiple 
cycle, burst 
or non-burst, 
cacheable 
or non-cacheable, 
B-, 16- or 
32-bit, and pseudo-locked. 
To support 
multiprocess- 


ing systems 
there are cache 
invalidation 
cycles 
and 
locked 
cycles. 


4-ByteOperand 
~ 
t 
t 
byte with 
byte with 
lowest 
highest 
address 
address 


This section 
begins 
with basic 
non-cacheable 
non- 


burst single cycle transfers. 
It moves 
on to multiple 
cycle 
transfers 
and 
introduces 
the 
burst 
mode. 


Cacheability 
is introduced 
in Section 
7.2.3. The re- 


maining 
sections 
describe 
locked, 
pseudo-locked, 


invalidate, 
bus hold and interrupt 
cycles. 


Bus cycles 
and 
data 
cycles 
are discussed 
in this 
section. 
A bus cycle 
is at least two clocks 
long and 
begins with ADS # active in the first clock and ready 
active in the last clock. 
Data is transferred 
to or from 
the 
Intel4B6 
SX 
microprocessor/Intel 
OverDrive 
Processor 
during a data cycle. A bus cycle contains 
one or more data cycles. 


Refer to Section 
7.2.13 for a oescription 
of the bus 
states shown 
in the timing diagrams. 


7.2.1 NON-CACHEABLE 
NON-BURST 
SINGLE 
CYCLE 


The fastest 
non-burst 
bus cycle that the Intel4B6 SX 
microprocessor/Intel 
OverDrive 
Processor 
supports 
is two clocks 
long. These 
cycles 
are called 
2-2 cy- 


cles because 
reads and writes take two cycles each. 


The first 2 refers to reads and the second 
to writes. 


• 


For example, if a wait state needs to be added to a 
write, the cycle would be called 2-3. 


Basic two clock read and write cycles are shown in 
Figure 7.7. The Intel486 SX microprocessor/Intel 
OverDrive Processor initiates a cycle by asserting 
the address status signal (ADS#) at the rising edge 
of the first clock. The ADS# output indicates that a 
valid bus cycle definition and address is available on 
the cycle definition lines and address bus. 


The non-burst ready input (RDY#) is returned by the 
external system in the second clock. RDY# indi- 
cates that the external system has presented valid 
data on the data pins in response to a read or the 
external system has accepted data in response to a 
write. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor samples RDY# at the end of the second 
clock. The cycle is complete if RDY# 
is active 
(LOW) when sampled. Note that RDY# is ignored at 
the end of the first clock of the bus cycle. 


The burst last signal (BLAST#) is asserted (LOW) 
by the Intel486 SX microprocessor/Intel OverDrive 
Processor during the second clock of the first cycle 
in all bus transfers illustrated in Figure 7.7. This indi- 
cates that each transfer is complete after a single 
cycle. The Intel486 SX microprocessor/Intel Over- 
Drive Processor asserts BLAST# in the last cycle of 
a bus transfer. 


The timing of the parity check output (PCHK#) is 
shown in Figure 7.7. The Intel486 SX microproces- 
sor/Intel 
OverDrive Processor drives the PCHK# 
output one clock after ready terminates a read cycle. 
PCHK# indicates the parity status for the data sam- 
pled at the end of the previous clock. The PCHK# 
signal can be used by the external system. The In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 
sor does nothing in response to the PCHK# output. 


7.2.1.2 Inserting Walt States 


The external system can insert wait states into the 
basic 2-2 cycle by driving RDY# inactive at the end 
of the second clock. RDY# must be driven inactive 
to insert a wait state. Figure 7.8 illustrates a simple 
non-burst, non-cacheable signal with one wait state 
added. Any number of wait states can be added to 
an 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor bus cycle by maintaining RDY# inactive. 


The burst ready input (BRDY#) must be driven inac- 
tive on all clock edges where RDY# is driven inac- 
tive for proper operation of these simple non-burst 
cycles. 


7.2.2 MULTIPLE AND BURST CYCLE BUS 
TRANSFERS 


Multiple cycle bus transfers can be caused by inter- 
nal requests from the Intel486 SX microprocessor/ 
Intel OverDrive Processor or by the external memory 
system. An internal request for a 128-bit pre-fetch 
must take more than one cycle. Internal requests for 
unaligned data may also require multiple bus cycles. 
A cache line fill requires multiple cycles to complete. 


An Internal request by the Intel OverDrive Proc- 
essor for a 64-bit floating point load must take 
more than one internal cycle. 


The external system can cause a multiple cycle 
transfer when it can only supply 8 or 16 bits per 
cycle. 


Only multiple cycle transfers caused by internal re- 
quests are considered in this section. Cacheable cy- 
cles and 8- and 16-bit transfers are covered in Sec- 
tions 7.2.3 and 7.2.5. 


7.2.2.1 Burst Cycles 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor can accept burst cycles for any bus re- 
quests that require more than a single data cycle. 
During burst cycles, a new data item is strobed into 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor every clock rather than every other clock 
as in non-burst cycles. The fastest burst cycle re- 
quires 2 clocks for the first data item with subse- 
quent data items returned every clock. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor is capable of bursting a maximum of 32 
bits during a write. Burst writes can only occur if 
BS8# or BS16# is asserted. For example, the In- 
tel486 SX microprocessor/Intel OverDrive Proces- 
sor can burst write four 8-bit operands or two 16-bit 
operands in a single burst cycle. But the Intel486 SX 
microprocessor/Intel 
OverDrive Processor cannot 
burst multiple 32-bit writes in a single burst cycle. 


Burst cycles begin with the Intel486 SX microproc- 
essor/Intel OverDrive Processor driving out an ad- 
dress and asserting ADS# in the same manner as 
non-burst cycles. The Intel486 SX microprocessor/ 
Intel OverDrive Processor indicates that it is willing 
to perform a burst cycle by holding the burst last 
signal (BLAST#) inactive in the second clock of the 
cycle. The external system indicates its willingness 
to do a burst cycle by returning the burst ready sig- 
nal (BRDY#) active. 


The addresses of the data items in a burst cycle will 
all fall within the same 16-byte aligned area (corre- 
sponding to an internallntel486 SX microprocessor/ 
Intel OverDrive Processor cache line). A 16-byte 
aligned area begins at location XXXXXXXOand ends 
at location XXXXXXXF. During a burst cycle, only 
BEO-3#, 
A2' 
and 
A3 
may 
change. 
A4-A31, 


M/IO#, 
D/C#, 
and W/R# 
will 
remain 
stable 
throughout a burst. Given the first address in a burst, 
external hardware can easily calculate the address 
of subsequent transfers in advance. An external 
memory system can be designed to quickly fill the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor internal cache lines. 
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Burst cycles are not limited to cache line fills. Any 
multiple cycle read request by the Intel486 SX micro- 
processor/Intel 
OverDrive Processor can be con- 
verted into a burst cycle. The Intel486 SX microproc- 
essorllntel 
OverDrive Processor will only burst the 
number of bytes needed to complete a transfer. 


For example, 
the Intel OverDrive 
Processor 
will 
burst eight 
bytes for a 64-bit 
floating 
point 
non- 
cacheable 
read. 


The external system converts a multiple cycle re- 
quest into a burst cycle by returning BRDY# active 
rather than RDY# (non-burst ready) in the first cycle 
of a transfer. For cycles that cannot be bursted such 
as interrupt acknowledge and halt, BRDY# has the 
same effect as RDY#. BRDY# is ignored if both 
BRDY# and RDY# are returned in the same clock. 
Memory areas and peripheral devices that cannot 
perform bursting must terminate cycles with RDY#. 


7.2.2.2 Terminating 
Multiple and 
Burst Cycle Transfers 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor drives BLAST# inactive for all but the last 
cycle in a multiple cycle transfer. BLAST# is driven 
inactive in the first cycle to inform the external sys- 
tem that the transfer could take additional cycles. 
BLAST# is driven active in the last cycle of the 
transfer indicating that the next time BRDY# or 
RDY# is returned the transfer is complete. 


BLAST# is not valid in the first clock of a bus cycle. 
It should be sampled only in the second and subse- 
quent clocks when RDY# or BRDY# is returned. 


The number of cycles in a transfer is a function of 
several factors including the number of bytes the In- 
tel486 SX microprocessorlintel OverDrive Proces- 
sor needs to complete an internal request (1, 2, 4, 8, 
or 16), the state of the bus size inputs (BS8# and 
BS16#), 
the 
state 
of 
the 
cache 
enable 
input 
(KEN#) and alignment of the data to be transferred. 


When the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor initiates a request it knows how 
many bytes will be transferred and if the data is 
aligned. The external system must indicate whether 
the data is cacheable (if the transfer is a read) and 
the width of the bus by returning the state of the 
KEN#, BS8# and BS16# inputs one clock before 
RDY# or BRDY# is returned. The Intel486 SX mi- 
croprocessorllntel 
OverDrive Processor determines 
how many cycles a transfer will take based on its 
internal information and inputs from the external sys- 
tem. 


BLAST# is not valid in the first clock of a bus cycle 
because 
the 
Intel486 
SX 
microprocessorlintel 


OverDrive Processor cannot determine the number 
of cycles a transfer will take until the external system 
returns KEN#, BS8# and BS16#. BLAST# should 
only be sampled in the second and subsequent 
clocks of a cycle when the external system returns 
RDY# or BRDY#. 


The system may terminate a burst cycle by returning 
RDY# 
instead of BRDY#. 
BLAST# 
will remain 
deasserted until the last transfer. However, any 
transfers required to complete a cache line fill will 
follow the burst order, e.g., if burst order was 4, 0, C, 
8 and RDY# was returned at after 0, the next trans- 
fers will be from C and 8. 


7.2.2.3 Non-Cacheable, Non-Burst, Multiple 


.Cycle Transfers 


Figure 7.9 illustrates a 2 cycle non-burst, non-cache- 
able multiple cycle read. This transfer is simply a 
sequence of two single cycle transfers. The Intel486 
SX microprocessor/Intel OverDrive Processor indi- 
cates to the external system that this is a multiple 
cycle transfer by driving BLAST# inactive during the 
second clock of the first cycle. The external system 
returns RDY# active indicating that it will not burst 
the data. The external system also indicates that the 
data is not cacheable by returning KEN# inactive 
one clock before it returns RDY# active. When the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor samples RDY# active it ignores BRDY#. 


Each cycle in the transfer begins when ADS# is 
driven active and the cycle is complete when the 
external system returns RDY# active. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor indicates the last cycle of the transfer by 
driVing BLAST# active. The next RDY# returned by 
the external system terminates the transfer. 


7.2.2.4 Non-Cacheable Burst Cycles 


The external system converts a multiple cycle re- 
quest into a burst cycle by returning BRDY# active 
rather than RDY# in the first cycle of the transfer. 
This is illustrated in Figure 7.10. 


There are several features to note in the burst read. 
ADS# is only driven active during the first cycle of 
the transfer. RDY# must be driven inactive when 
BRDY# is returned active. 


BLAST# behaves exactly as it does in the non-burst 
read. BLAST# is driven inactive in the second clock 
of the first cycle of the transfer indicating more cy- 
cles to follow. In the last cycle, BLAST# is driven 
active telling the external memory system to end the 
burst after returning the next BRDY# . 
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Any memory read can become a cache fill operation. 
The external memory system can allow a read re- 
quest to fill a cache line by returning KEN# active 
one clock before ROY# or BRDY# during the first 
cycle of the transfer on the external bus. Once 
KEN# is asserted and the remaining three require- 
ments described below are met, the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor will fetch an 
entire cache line regardless of the state of KEN# . 
KEN# must be returned active in the last cycle of 
the transfer for the data to be written into the inter- 
na~ cache. The Intel486 SX microprocessor/Intel 
OverDrive Processor will only convert memory reads 
or prefetches into a cache fill. 


KEN# is ignored during write or I/O cycles. Memory 
writes will only be stored in the on-chip cache if 
there is a cache hit. I/O space is never cached in 
the internal cache. 


To transform a read or a prefetch into a cache line 
fill the following conditions must be met: 


1. The KEN# pin must be asserted one clock pri- 
or to ROY# or BRDY# being returned for the 
first data cycle. 
2. The cycle must be of the type that can be inter- 
nally cached. (Locked reads, I/O reads, and in- 
terrupt acknowledge cycles are never cached). 
3. The page table entry must have the page cache 
disable bit (PCD) set to O. To cache a page 
table 
entry, the 
page directory 
must have 
PCD=O. To cache reads or prefetches when 
paging is disabled, or to cache the page direc- 
tory entry, control register 3 (CR3) must have 
PCD=O. 
4. The cache disable (CD) bit in control register 0 
(CRO)must be clear. 


External hardware can determine when the Intel486 
SX microprocessor/Intel 
OverDrive Processor has 
transformed a read or prefetch into a cache fill by 
examining 
the 
KEN#, 
M/IO#, 
D/C#, 
W/R#, 
LOCK#, and PCD pins. These pins convey to the 
system the outcome of conditions 1-3 in the above 
list. In addition, the Intel486 SX microprocessor/Intel 
OverDrive Processor drives PCD high whenever the 
CD bit in CROis set, so that external hardware can 
evaluate condition 4. 


7.2.3.1 
Byte Enables 
during a Cache 
Line Fill 


For the first cycle in the line fill, the state of the byte 
enables should be ignored. In a non-cacheable 
memory read, the byte enables indicate the bytes 
actually required by the memory or code fetch. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor expects to receive valid data on its entire 
bus (32 bits) in the first cycle of a cache line fill. Data 
should be returned with the assumption that all the 
byte enable pins are driven active. However if BS8# 
is asserted only one byte need be returned on data 
lines 00-07. 
Similarly if BS16# 
is asserted two 
bytes should be returned on 00-015. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor will generate the addresses and byte en- 
ables for all subsequent cycles in the line fill. The 
order in which data is read during a line fill depends 
on the address of the first item read. Byte ordering is 
discussed in Section 7.2.4. 


7.2.3.2 Non-Burst 
Cacheable 
Cycles 


Figure 7.11 shows a non-burst cacheable cycle. The 
cycle becomes a cache fill when the Intel486 SX 
microprocessor/Intel OverDrive Processor samples 
KEN# active at the end of the first clock. The In- 
tel486 SX microprocessor/Intel OverDrive Proces- 
sor drives BLAST# inactive in the second clock in 
response to KEN#. BLAST# is driven inactive be- 
cause a cache fill requires 3 additional cycles to 
complete. BLAST# remains inactive until the last 
transfer in the cache line fill. KEN# must be re- 
turned active in the last cycle of the transfer for the 
data to be written into the internal cache. 


Note that this cycle would be a single bus cycle if 
KEN# was not sampled active at the end of the first 
clock. The subsequent three reads would not have 
happened since a cache fill was not requested. 


The BLAST# output is invalid in the first clock of a 
cycle. BLAST# may be active during the first clock 
due to earlier inputs. Ignore BLAST# until the sec- 
ond clock. 


During the first cycle of the cache line fill the exter- 
nal system should treat the byte enables as if they 
are all active. In subsequent cycles in the burst, the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor drives the address lines and byte enables (see 
Section 7.2.4.2 for Burst 
and Cache 
Line Fill Or- 
der). 
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7.2.3.3 
Burst Cacheable 
Cycles 


Figure 7.12 illustrates a burst mode cache fill. As in 
Figure 7.11, the transfer becomes a cache line fill 
when the external system returns KEN# active at 
the end of the first clock in the cycle. 


The external system informs the Intel486 SX micro- 
processor/Intel OverDrive Processor that it will burst 
the line in by driving BRDY# active at the end of the 
first cycle in the transfer. 


Note that during a burst cycle ADS# is only driven 
with the first address. 
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7.2.3.4 Effect 
of Changing 
KEN# 
during a 
Cache 
Line Fill 
cache line fill. Similarly, it uses the value of KEN# in 
the last cycle, before early ROY# to load the line 
just retrieved from the memory into the cache. 
KEN# is sampled every clock, it must satisfy setup 
and hold time. 
KEN# can change multiple times as long as it ar· 
rives at its final value in the clock before ROY# or 
BRDY# is returned. This is illustrated in Figure 7.13. 
Note that the timing of BLAST# follows that of 
KEN# by one clock. The Intel486 SX microproces- 
sor/Intel OverDrive Processor samples KEN# every 
clock and uses the value returned in the clock be- 
fore ready to determine if a bus cycle would be a 


KEN# can also change multiple times before a burst 
cycle as long as it arrives at its final value one clock 
before ready is returned active. 
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either RDY,*, or BRDY,*, are active. Driving BRDY,*, 
and RDY'*' inactive adds a wait state to the transfer. 
A burst cycle where two clocks are required for ev- 
ery burst item is shown in Figure 7.14. 
7.2.4.1 Adding Walt States to Burst Cycles 


Burst cycles need not return data on every clock. 
The Intel486 SX microprocessor/Intel 
OverDrive 
Processor will only strobe data into the chip when 
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7.2.4.2 Burst and Cache Line Fill Order 


The burst order used by the Intel486 SX microproc- 
essor/lntel OverDrive Processor is shown in Table 
7.7. This burst order is followed by any burst cycle 
(cache or not), cache line fill (burst or not) or code 
prefetch. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor presents each request for data in an or- 
der determined by the first address in the transfer. 
For example, if the first address was 104 the next 
three addresses in the burst will be 100, 10C and 
108. 


First 
Second 
Third 
Fourth 
Addr. 
Addr. 
Addr. 
Addr. 


0 
4 
8 
C 
4 
0 
C 
8 
8 
C 
0 
4 
C 
8 
4 
0 


An example of burst address sequencing is shown in 
Figure 7.15. 


The sequences shown in Table 7.7 accommodate 
systems with 64-bit busses as well as systems with 
32-bit data busses. The sequence applies to all 
bursts, regardless of whether the purpose of the 
burst is to fill a cache line, do a 64-bit read, or do a 
pre-fetch. If either BS8# or BS16# is returned ac- 
tive, the Intel486 SX microprocessor/Intel OverDrive 
Processor completes the transfer of the current 
32-bit word before progressing to the next 32-bit 
word. For example, a BS16# burst to address 4 has 
the following order: 4-6-0-2-C-E-8-A. 


7.2.4.3 
Interrupted 
Burst Cycles 


Some memory systems may not be able to respond 
with burst cycles in the order defined in Table 7.7. 
To support these systems the Intel486 SX micro- 
processor/Intel OverDrive Processor allows a burst 
cycle 
to 
be 
interrupted 
at 
any 
time. 
The 


Intel486 SX microprocessor/Intel OverDrive Proces- 
sor will automatically generate another normal bus 
cycle after being interrupted to complete the data 
transfer. This is called an interrupted burst cycle. 
The external system can respond to an interrupted 
burst cycle with another burst cycle. 


The external system can interrupt a burst cycle by 
returning RDY# instead of BRDY#. RDY# can be 
returned after any number of data cycles terminated 
with BRDY#. 


An example of an interrupted burst cycle is shown in 
Figure 7.16. The Intel486 SX microprocessor/Intel 
OverDrive Processor immediately drives ADS# ac- 
tive to initiate a new bus cycle after RDY# is re- 
turned active. BLAST# driven inactive one clock af- 
ter ADS# begins the second bus cycle indicating 
that the transfer is not complete. 
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KENII need not be returned active in the first data 
cycle of the second part of the transfer in Figure 
7.16. The cycle had been converted to a cache fill in 
the first part of the transfer and the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor expects the 
cache fill to be completed. Note that the first half 
and second half of the transfer in Figure 7.16 are 
each two cycle burst transfers. 


The order in which the Intel486 SX microprocessor/ 
Intel OverDrive Processor requests operands during 
an interrupted burst transfer is determined by Table 
7.7. Mixing ROYII and BRDYII does not change the 
order in which operand addresses are requested by 
the 
Intel486 SX microprocessor/lntel 
OverDrive 
Processor. 


An example of the order in which the Intel486 SX 
microprocessor/Intel OverDrive Processor requests 
operands during a cycle in which the external sys- 
tem mixes ROYII and BRDYII is shown in Figure 
7.17. The Intel486 SX microprocessor/Intel 
Over- 
Drive Processor initially requests a transfer begin- 
ning at location 104. The transfer becomes a cache 
line fill when the external system returns KENII ac- 
tive. The first cycle of the cache fill transfers the 
contents of location 104 and is terminated with 
ROYII. The Intel486 SX microprocessor/Intel Over- 
Drive Processor drives out a new request (by assert- 
ing ADSII) to address 100. If the external system 
terminates the second cycle with BRDYII, the In- 
tel486 SX microprocessor/Intel 
OverDrive Proces- 
sor will next request/expect address 10C. The cor- 
rect order is determined by the first cycle in the 
transfer, which may not be the first cycle in the burst 
if the system mixes ROYII with BRDYII. 
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Figure 7.17. Interrupted 
Burst Cycle with Unobvious 
Order of Addresses 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor supports both 16- and 8-bit external bus- 
ses through the BS16# and BS8# inputs. BS16# 
and BS8# allow the external system to specify, on a 
cycle by cycle basis, whether the addressed compo- 
nent can supply 8, 16 or 32 bits. BS16# and BS8# 
can be used in burst cycles as well as non-burst 
cycles. If both BS16# and BS8# are returned active 
for any bus cycle, the Intel486 SX microprocessor/ 
Intel OverDrive Processor will respond as if only 
BS8# were active. 


The timing of BS16# and BS8# is the same as that 
of KEN#. BS16# and BS8# must be driven active 
before the first ROY# or BRDY# is driven active. 


Driving the BS16# and BS8# active can force the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor to run additional cycles to complete what would 
have been only a single 32-bit cycle. BS8# and 
BS16# 
may change the state of BLAST# 
when 
they force subsequent cycles from the transfer. 


Figure 7.18 shows an example in which BS8# 
forces the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor to run two extra cycles to complete 
a transfer. The Intel486 SX microprocessor/lntel 
OverDrive Processor issues a request for 24 bits of 
information. The external system drives BS8# ac- 
tive indicating that only eight bits of data can be sup- 
plied per cycle. The Intel486 SX microprocessor/In- 
tel OverDrive Processor issues two extra cycles to 
complete the transfer. 
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Extra cycles forced by the BS16# and BS8# should 
be viewed as independent bus cycles. BS16# and 
BS8# should be driven active for each additional 
cycle unless the addressed device has the ability to 
change the number of bytes it can return between 
cycles. The Intel486 SX microprocessor/Intel Over- 
Drive Processor will drive BLAST# inactive until the 
last cycle before the transfer is complete. 


Refer to Section 7.1.3 for the sequencing of ad- 
dresses while BS8# or BS16# are active. 


BS8# and BS16# operate during burst cycles in ex- 
actly the same manner as non-burst cycles. For ex- 
ample, a single non-cacheable read could be trans- 
ferred 
by the 
Intel486 SX microprocessor/Intel 
OverDrive Processor as four 8-bit burst data cycles. 
Similarly, a single 32-bit write could be written as 
four 8-bit burst data cycles. An example of a burst 
write is shown in Figure 7.19. Burst writes can only 
occur if BS8# or BS16# is asserted. 
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Locked cycles are generated in software for any in- 
struction that performs a read-modify-write opera- 
tion. 
During 
a 
read-modify-write 
operation 
the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor can read and modify a variable in external memo 
ory and be assured that the variable is not accessed 
between the read and write. 


Locked cycles are automatically generated during 
certain bus transfers. The xchg (exchange) instruc- 
tion generates a locked cycle when one of its oper- 
ands is memory based. Locked cycles are generat- 
ed when a segment or page table entry is updated 
and during interrupt acknowledge cycles. Locked cy- 
cles are also generated when the LOCK instruction 
prefix is used with selected instructions. 


Locked cycles are implemented in hardware with the 
LOCK# pin. When LOCK# is active, the Intel486 SX 


microprocessor/Intel 
OverDrive Processor is per· 


forming a read-modify-write operation and the exter- 
nal bus should not be relinquished until the cycle is 
complete. Multiple reads or writes can be locked. A 
locked cycle is shown in Figure 7.20. LOCK# goes 
active with the address and bus definition pins at the 
beginning of the first read cycle and remains active 
until RDY# is returned for the last write cycle. For 
unaligned 32 bits read-modify-write operation, the 
LOCK# remains active for the entire duration of the 
multiple cycle. It will go inactive when RDY# is re- 
turned for the last write cycle. 


When LOCK# is active, the Intel486 SX microproc- 
essor/Intel OverDrive Processor will recognize ad- 
dress hold and backoff but will not recognize bus 
hold. It is left to the external system to properly arbi- 
trate a central bus when the Intel486 SX microproc- 
essor/Intel OverDrive Processor generates LOCK#. 
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Figure 7.20. Locked Bus Cycle 


The first cycle of a 64-bit floating point write is the 
only case in which both PLOCK# and BLAST# are 
asserted. Normally PLOCK# and BLAST# are the 
inverse of each other. 


Pseudo-locked cycles assure that no other master 
will be given control of the bus during operand trans- 
fers which take more than one bus cycle. 


For the Intel486 SX microprocessor/Intel OverDrive 
Processor, 
examples 
include 
64-bit 
description 
loads and cache line fills. 


For the Intel OverDrive Processor only, 64-b1t 
floating point read and write cycles are also ex- 
amples of operand transfers which take more 
than one bys cycle. 


Pseudo-locked 
transfers 
are 
indicated 
by 
the 
PLOCK# 
pin. The 
memory operands 
must be 
aligned for correct operation of a pseudo-locked cy- 
cle. 


PLOCK# need not be examined during burst reads. 
A 64-bit aligned operand can be retrieved in one 
burst (note: this is only valid in systems that do not 
interrupt bursts). 


The system must examine PLOCK# during 64-bit 
writes since the Intel486 SX microprocessor/Intel 
OverDrive Processor cannot burst write more than 
32 bits. However, burst can be used within each 32- 
bit write cycle if BS8# or BS16# is asserted. BLAST 
will be deasserted in response to BS8# or BS16# . 
A 64-bit write will be driven out as two non-burst bus 
cycles. BLAST# is asserted during both writes since 
a burst is not possible. PLOCK# is asserted during 
the first write to indicate that another write follows. 
This behavior is shown in Figure 7.21. 


During all of the cycles where PLOCK# is asserted, 
HOLD is not acknowledged until the cycle com- 
pletes. This results in a large HOLD latency, espe- 
cially when BS8# or BS16# is asserted. To reduce 
the HOLD latency during these cycles, windows are 
available between transfers to allow HOLD to be ac- 
knowledged during non-cacheable code prefetches. 
PLOCK# will be asserted since BLAST# is negat- 
ed, but it is ignored and HOLD is recognized during 
the prefetch. 


PLOCK# can change several times during a cycle 
settling to its final value in the clock ready is re- 
turned. 


Invalidate cycles are needed to keep the Intel486 
SX microprocessor/Intel OverDrive Processor inter- 
nal cache contents consistent with external memory. 
The Intel486 SX microprocessor/Intel 
OverDrive 
Processor contains a mechanism for listening to 
writes by other devices to external memory. When 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor finds a write to a Section of external 
memory contained in its internal cache, the Intel486 
SX microprocessor/Intel OverDrive Processor's in- 
ternal copy is invalidated. 
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Invalidations use two pins, address hold request 
(AHOLD) and valid external 
address (EADS#). 


There are two steps in an invalidation cycle. First, 
the external system asserts the AHOLD input forcing 
the 
Intel486 
SX microprocessor/Intel 
OverDrive 
Processor to immediately relinquish its address bus. 
Next, the external system asserts EADS# indicating 
that a valid address is on the Intel486 SX microproc- 
essor/Intel OverDrive Processor address bus. Fig- 
ure 7-22 shows the fastest possible invalidation cy- 
cle. The Intel486 SX microprocessor/Intel OverDrive 
Processor recognizes AHOLD on one CLK edge and 
floats the address bus in response. To allow the ad- 
dress bus to float and avoid contention, EADS# and 
the invalidation address should not be driven until 
the following CLK edge. The Intel486 SX microproc- 
essor/Intel OverDrive Processor reads the address 
over its address lines. If the Intel486 SX microproc- 
essor/Intel OverDrive Processor finds this address 
in its internal cache, the cache entry is invalidated. 
Note that 
the 
Intel486 SX microprocessor/Intel 
OverDrive Processor address bus is input! output un- 
like the Intel386 microprocessor's bus, which is out- 
put only. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor immediately relinquishes its address bus 
in the next clock upon assertion of AHOLD. For ex- 
ample, the bus could be 3 wait states into a read 
cycle. If AHOLD is activated, the Intel486 SX micro- 
processor/Intel OverDrive Processor will immediate- 


Iy float its address bus before ready is returned ter- 
minating the bus cycle. 


When AHOLD is asserted only the address bus is 
floated, the data bus can remain active. Data can be 
returned for a previously specified bus cycle during 
address hold (see Figures 7.22, 7.23). 


EADS# is normally asserted when an external mas- 
ter drives an address onto the bus. AHOLD need not 
be driven for EADS# to generate an internal invali- 
date. If EADS# alone is asserted while the Intel486 
SX microprocessor/Intel 
OverDrive Processor is 
driving the address bus, it is possible that the invali- 
dation address will come from the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor itself. 


Note that it is also possible to run an invalidation 
cycle by asserting EADS# when HOLD or BUFF# is 
asserted. 


Running an invalidate cycle prevents the Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
cache from satisfying other internal requests, so in- 
validations should be run only when necessary. The 
fastest possible invalidate cycle is shown in Figure 
7.22, while a more realistic invalidation cycle is 
shown in 7.23. Both of the examples take one clock 
of cache access from the Intel486 SX microproces- 
sor/Intel OverDrive Processor. 
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Figure 7.22. Fast Internal Cache Invalidation Cycle 
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Figure 7.23. Typical Internal Cache Invalidation Cycle 
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7.2.8.1 
Rate of Invalidate 
Cycles 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor can accept one invalidate per clock ex- 
cept in the last clock of a line fill. One invalidate per 
clock is possible as long as EADS# is negated in 
ONE or BOTH of the following cases: 
1. In the clock ROY# or BRDY# is returned for 
the last time. 
2. In the clock following ROY# or BRDY# being 
returned for the last time. 


This definition allows two system designs. Simple 
designs can restrict invalidates to one every other 
clock. The simple design need not track bus activity. 
Alternatively, systems can request one invalidate 
per clock provided that the bus is monitored. 


7.2.8.2 
Running 
Invalidate 
Cycles 
Concurrently 
with Line Fills 


Precautions are necessary to avoid caching stale 
data in the Intel486 SX microprocessor/lntel Over- 
Drive Processor cache in a system with a second 
level cache. An example of a system with a second 
level cache is shown in Figure 7.24. An external de- 
vice can be writing to main memory over the system 
bus while the 
Intel486 SX microprocessor/Intel 
OverDrive Processor is retrieving data from the sec- 
ond level cache. The Intel486 SX microprocessor/ 
Intel OverDrive Processor will need to invalidate a 
line in its internal cache if the external device is writ- 
ing to a main memory address also contained in the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor cache. 
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Figure 7.24. System 
with Second 
Level Cache 


A potential problem exists if the external device is 
writing to an address in external memory, and at the 
same time the Intel486 SX microprocessor/Intel 
OverDrive Processor is reading data from the same 
address in the second level cache. The system must 
force an invalidation cycle to invalidate the data that 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor has requested during the line fill. 


If the system asserts EADS# before the first data in 
the line fill is returned to the Intel486 SX microproc- 
essor/Intel OverDrive Processor, the system must 
return data consistent with the new data in the exter- 
nal memory upon resumption of the line fill after the 
invalidation cycle. This is illustrated by the asserted 
EADS# signal labeled 1 in Figure 7.25. 
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NOTES: 
1. Data returned 
must be consistent 
if its address 
equals 
the invalidation 
address 
in this clock 
2. Data returned 
will not be cached 
if its address 
equals 
the invalidation 
address 
in this clock 


If the system 
asserts 
EADS # at the same 
time or 
after 
the first data in the line fill is returned 
(in the 
same 
clock 
that 
the first 
RDY # 
or BRDY # 
is re- 
turned 
or any subsequent 
clock 
in the line fill) the 
data will be read into the Intel486 
SX microproces- 
sor/Intel 
OverDrive 
Processor 
input buffers but it will 
not be stored 
in the on-chip 
cache. This is illustrated 
by asserted 
EADS# 
signal labeled 
2 in Figure 7.25. 
The stale data will be used to satisfy the request that 
initiated 
the cache fill cycle. 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
provides 
a bus hold, 
hold 
acknowledge 
protocol 
using the bus hold request 
(HOLD) and bus 
hold acknowledge 
(HLDA) pins. Asserting 
the HOLD 
input indicates 
that another 
bus master desires 
con- 
trol 
of the 
Intel486 
SX microprocessor/Intel 
Over- 
Drive Processor 
bus. The Intel486 
SX microproces- 


sor/lntel 
OverDrive 
Processor 
will respond 
by float- 
ing its bus and driving HLDA active when the current 
bus cycle, or sequence 
of locked cycles is complete. 
An example 
of a HOLD/HLDA 
transaction 
is shown 
in Figure 7.26a. Unlike the Intel386 
microprocessor, 
the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
can respond 
to HOLD by floating 
its bus 
and asserting 
HLDA while RESET is asserted. 


Note that HOLD will be recognized 
during un-aligned 
writes 
(less than or equal to 32-bits) 
with BLAST # 
being active for each write. For greater than 32-bit or 
un-aligned 
write, 
HOLD# 
recognition 
is prevented 
by PLOCK # 
getting 
asserted. 
However, 
HOLD 
is 
recognized 
during 
non-cachable, 
non-burstable 
code prefetches 
even though 
PLOCK # is active. 


For cacheable 
and 
nonbursted 
or bursted 
cycles, 


HOLD is acknowledged 
during backoff 
only if HOLD 
and BOFF # are asserted 
during an active bus cycle 
(after ADS# 
asserted) 
and before the first RDY# 
or 
BRDY # has been returned 
(see Figure 7.26b). The 
order in which HOLD and BOFF # go active is unim- 
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portant 
(SO long as both are active 
prior to the first 
ROY II /BROY II 
returned 
by 
the 
system). 
Figure 
7.26b shows the case where 
HOLD is asserted 
first; 
HOLD 
could 
be 
asserted 
simultaneously 
or 
after 
BOFF II and still be acknowledged. 


The pins floated 
during bus hold are: BEOII-BE311, 
PCO, 
PWT, 
W/RII, 
O/CII, 
M/IOII, 
LOCK II , 
PLOCK II , 
ADS II , 
BLAST II , 
00-031, 
A2-A31, 
OPO-OP3. 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
generates 
interrupt 
acknowledge 
cycles 
in response 
to maskable 
interrupt 
requests 
generat- 
ed on the interrupt 
request 
input (INTR) pin. Interrupt 
acknowledge 
cycles have a unique cycle type gener- 
ated on the cycle type pins. 
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An example 
of an interrupt 
acknowledge 
transaction 


is shown 
in Figure 
7.27. Interrupt 
acknowledge 
cy- 
cles 
are generated 
in locked 
pairs. 
Data 
returned 
during the first cycle is ignored. 
The interrupt 
vector 


is returned 
during the second 
cycle 
on the lower 8 
bits of the data bus. The Intel486 
SX microproces- 


sor/Intel 
OverDrive 
Processor 
has 256 possible 
in- 


terrupt 
vectors. 


\---.---~ 


The state 
of A2 distinguishes 
the first and second 


interrupt 
acknowledge 
cycles. 
The 
byte 
address 


driven during the first interrupt 
acknowledge 
cycle is 


4 (A31-A3 
low, A2 high, 
BE3 # -BE1 
# 
high, 
and 


BEO# 
low). The address 
driven 
during 
the second 


interrupt 
acknowledge 
cycle 
is 
0 
(A31-A2 
low, 


BE3#-BE1 
# high, BEO# 
low). 
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Each of the interrupt acknowledge cycles are termi- 
nated when the external system returns ROY# or 
BRDY#. Wait states can be added by withholding 
ROY# or BRDY#. The Intel486 SX microproces- 
sor/Intel OverDrive Processor automatically gener- 
ates four idle clocks between the first and second 
cycles to allow for 8259A recovery time. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor provides four special bus cycles to indi- 
cate that certain instructions have been executed, or 
certain conditions have occurred internally. The spe- 
cial bus cycles in Table 7.8 are defined when the bus 
cycle definition pins are in the following state: 
M/IO# =0, D/C# =0 and W/R# = 1. During these 
cycles the address bus is driven low while the data 
bus is undefined. 


Two of the special cycles indicate halt or shutdown. 
Another special cycle is generated when the In- 
tel486 SX microprocessor/lntel 
OverDrive Proces- 
sor executes an INVD (invalidate data cache) in- 
struction and could be used to flush an external 
cache. The Write Back cycle is generated when the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor executes the WBINVD (write-back invalidate 
data cache) instruction and could be used to syn- 
chronize an external write-back cache. 


The external hardware must acknowledge these 
special bus cycles by returning ROY# or BRDY#. 


Table 7.8. Special Bus Cycle Encoding 


BE3# 
BE2# 
BE1# 
BEO# 
Special 


Bus Cycle 


1 
1 
1 
0 
Shutdown 
1 
1 
0 
1 
Flush 
1 
0 
1 
1 
Halt 
0 
1 
1 
1 
Write Back 


7.2.11.1 Halt Indication Cycle 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor halts as a result of executing a HALT in- 
struction. Signaling its entrance into the halt state, a 
halt indication cycle is performed. The halt indication 
cycle is identified by the bus definition signals in spe- 
cial bus cycle state and a byte address of 2. BEO# 
and BE2# are the only signals distinguishing halt 
indication from shutdown indication, which drives an 
address of o. During the halt cycle undefined data is 
driven on 00-031. 
The halt indication cycle must be 
acknowledged by READY# asserted. 
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Figure 7.28. Restarted Read Cycle 
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A 
halted 
Intel486 
SX 
microprocessor/Intel 
Over- 
Drive 
Processor 
resumes 
execution 
when 
INTR 
(if 
interrupts 
are enabled) 
or NMI or RESET 
is assert- 
ed. 


7.2.11.2 
Shutdown 
Indication 
Cycle 


The 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
shuts 
down 
as a result 
of a protection 
fault while attempting 
to process 
a double 
fault. Sig- 
naling its entrance 
into the shutdown 
state, 
a shut- 
down 
indication 
cycle 
is performed. 
The 
shutdown 
indication 
cycle is identified 
by the bus definition 
sig- 
nals in special bus cycle state and a byte address 
of 
O. 


In a multi-master 
system 
another 
bus master 
may 
require the use of the bus to enable 
the Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
to com- 
plete its current 
bus request. 
In this situation 
the In- 
tel486 
SX microprocessor/Intel 
OverDrive 
Proces- 
sor will need to restart 
its bus cycle after the other 
bus master 
has completed 
its bus transaction. 


A bus cycle may be restarted 
if the external 
system 
asserts 
the backoff 
(BOFF#) 
input. The Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
samples 
the BOFF# 
pin every clock. The Intel486 
SX micro- 
processor/Intel 
OverDrive 
Processor 
will immediate- 
ly (in the 
next 
clock) 
float 
its address, 
data 
and 
status 
pins 
when 
BOFF # 
is asserted 
(see 
Fig- 


ure 7.28). Any bus cycle in progress 
when BOFF# 
is 
asserted 
is aborted 
and 
any data 
returned 
to the 
processor 
is ignored. 
The same 
pins are floated 
in 
response 
to BOFF # as are floated 
in response 
to 
HOLD. 
HLDA 
is 
not 
generated 
in 
response 
to 
BOFF #. 
BOFF # has higher 
priority 
than RDY # or 
BRDY #. If either 
RDY # or BRDY # are returned 
in 
the same clock 
as BOFF #, BOFF # takes effect. 


The device 
assertihg 
BOFF # is free to run any cy- 
cles it wants while the Intel486 
SX microprocessor/ 
Intel OverDrive 
Processor 
bus is in its high imped- 
ance state. If backoff 
is requested 
after the Intel486 
SX microprocessor/Intel 
OverDrive 
Processor 
has 
started 
a cycle, 
the 
new 
master 
should 
wait 
for 
memory 
to return 
RDY # or BRDY # before 
assum- 
ing control 
of the bus. Waiting 
for ready provides 
a 
handshake 
to 
insure 
that 
the 
memory 
system 
is 
ready to accept 
a new cycle. 
If the bus is idle when 
BOFF # 
is asserted, 
the 
new 
master 
can 
start 
its 
cycle two clocks 
after issuing 
BOFF # . 


The external 
memory 
can view BOFF # in the same 
manner as BLAST # . Asserting 
BOFF # tells the ex- 


ternal 
memory 
system 
that the current 
cycle 
is the 
last cycle in a transfer. 


The bus remains 
in the high impedance 
state 
until 
BOFF # is negated. 
Upon negation, 
the Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
restarts 
its bus cycle 
by driving 
out the address 
and status 
and asserting 
ADS #. The bus cycle then continues 
as usual. 
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Asserting BOFF# during a burst, BS8# or BS16# 
cycle will force the Intel486 SX microprocessor/Intel 
OverDrive Processor to ignore data returned for that 
cycle only. Data from previous cycles will still be val- 
id. For example, if BOFF# is asserted on the third 
BRDY# of a burst, the Intel486 SX microprocessor/ 
Intel OverDrive Processor assumes the data re- 
turned with the first and second BRDY# 's is correct 
and restarts the burst beginning with the third item. 
The same rule applies to transfers broken into multi- 
ple cycle by BS8# or BS16#. 


has begun even-though the cycle was aborted. 
There are two possible solutions to this problem. 
The first is to have all devices recognize this condi- 
tion and ignore ADS# until ready comes back. The 
second approach is to use a "two clock" backoff: in 
the first clock AHOLD is asserted, and in the second 
clock BOFF# 
is asserted. This guarantees that 
ADS# will not be floating low. This is only necessary 
in systems where BOFF# may be asserted in the 
same clock as ADS# . 


Asserting BOFF# in the same clock as ADS# will 
cause the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor to float its bus in the next clock and 
leave ADS# floating low. Since ADS# 
is floating 
low, a peripheral may think that a new bus cycle 


A bus state diagram is shown in Figure 7.30. A de- 
scription of the signals used in the diagram is given 
in Table 7.9. 


(RDY# 
ASSERTED + (BRDY# 
• BLAST#)ASSERTED) 
• 
(HOLD 
+ AHOLD + NO REQUEST) 
• 
BOFF# 
NEGATED 


REQUEST PENDING 
• 
(RDY# 
ASSERTED + (BRDY# 
• BLAST#)ASSERTED) 
• 
HOLD NEGATED 
• 
AHOLD NEGATED. 
BOFF# 
NEGATED· 


BOFF# 
NEGATE/ 


BOFF# 
~~. 'v~ 


ASSERTED 
./ 
~..,'v¢' 


/BOFF# 
ASSERTED 


AHOLD NEGATED. 
BOFF# 
NEGATED. 
(HOLD 
NEGATED·) 


• HOLD is only factored 
into this state transition 
jf Tb was entered 
while a 


non-cocheoble, 
non-bursted, 
code prefetch 
was in 
progrns. 
Otherwise. 
ignore 
HOLD. 


Figure 7.30. Bus State Diagram 


Table 7.9. Bus State Description 


State 
Means 


Ti 
Bus is idle. Address and status signals may be driven to undefined values, or 
the bus may be floated to a high impedance state. 


T1 
First clock cycle of a bus cycle. Valid address and status are driven and 
ADS# is asserted. 


T2 
Second and subsequent clock cycles of a bus cycle. Data is driven if the 
cycle is a write, or data is expected if the cycle is a read. ROY# and BRDY# 
are sampled. 


T1b 
First clock cycle of a restarted bus cycle. Valid address and status are driven 
and ADS# is asserted. 


Tb 
Second and subsequent clock cycles of an aborted bus cycle. 
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The Intel OverDrive Processor provides two 0p- 
tions for reporting floating point errors. The sim- 
plest method is to raise interrupt 16 whenever an 
unmasked floating point error occurs. This option 
may be enabled by setting the NE bit In control 
register 0 (CRO). 


The Intel OverDrive Processor also provides the 
option of allowing external hardware to deter- 
mine how floating point errors are reported. This 
option is necessary for compatibility with the er- 
ror reporting scheme used in DOS based sys- 
tems. The NE bit must be cleared in CROto en- 
able user-defined error reporting. User-deflned 
error reporting is the default condition because 
the NE bit is cleared on reset. 


Two pins, floating point error (FERR#) and ig- 
nore numeric error (IGNNE#), are provided to 
direct the actions of hardware if user-deflned er- 
ror reporting is used. The Intel OverDrive Proces- 
sor asserts the FERR# output to indicate that a 
floating point error has occurred. FERR# corre- 
sponds to the ERROR# pin on the Inte1387Math 
CoProcessor. However, there is a difference In 
the behavior of the two. 


In some cases FERR# is asserted when the 
next floating point instruction is encountered, and 
in other cases it is asserted before the next float- 
ing point instruction is encountered depending 
upon the execution state of the instruction caus- 
ing the exception. 


The following class of floating point exceptions 
drive FERR# at the time the exception occurs 
O.e.,before encountering the next floating point 
instruction). 
1. The stack fault, invalid operation, and denor- 
mal exceptions on all transcendental instruc- 
tions, integer arithmetic instructions, FSQRT, 
FSEAlE, 
FPREM(1), FXTRACT, FBlD, and 
FBSTP. 
2. Any exceptions on store instructions (includ- 
ing integer store instructions). 


The following class of floating point exceptions 
drive FERR# only after encountering the next 
floating point instruction. 
1. Exceptions other than on all transcendental 
instructions, integer arithmetic instructions, 
FSQRT, 
FSCALE, 
FPREM(1}, 
FXTRACT, 
FBlD, and FBSTP. 
2. Any exception on all basic arithmetic, load, 
compare, and control instructions (i.e., all 0th- 
er instructions). 


For both sets of exceptions above, the 387 Math 
CoProcessor asserts ERROR# when the error 
occurs and does not walt for the next floating 
point instruction to be encountered. 


IGNNE# is an input to the Intel OverDrive Proc- 
essor When the NE bit in CRO is cleared, and 
IGNNE# is asserted, the Intel OverDrive Proces- 
sor will ignore a user floating point error and con- 
tinue executing floating point instructions. When 
IGNNE# is negated, the IGNNE# is an input to 
the Intel OverDrive Processor will freeze on float· 
ing point instructions which get errors (except for 
the 
control 
instructions 
FNClEX, 
FNINIT, 
FNSAVE, 
FNSTENV, 
FNSTCW, 
FNSTSW, 
FNSTSW AX, FNENI, FNDISI and FNSETPM). 
IGNNE# may be asynchronous to the Intel Over- 
Drive Processor clock. 


In systems with user-defined error reporting, the 
FERR# pin is connected to the interrupt control- 
ler. When an unmasked floating point error oc- 
curs, an interrupt is raised. If IGNNE# is high at 
the time of this interrupt, the Intel OverDrive 
Processor will freeze (disallowing execution of a 
subsequent floating point instruction) until the in- 
terrupt 
handler 
is 
invoked. 
By 
driving 
the 
IGNNE# pin low (when clearing the Interrupt re- 
quest), the interrupt handler can allow execution 
of a floating point instruction, within the interrupt 
handler, before the error condition Is cleared (by 
FNClEX, FNINIT, FNSAVE or FNSTENV). If exe· 
cution of a non-control floating point instruction, 
within the floating point interrupt handler, Is not 
needed, the IGNNE# pin can be tied HIGH. 


7.2.15 FLOAnNG POINT ERROR HANDLING 
IN AT COMPAnBLE SYSTE 


The Intel OverDrive Processor provides apeclal 
features to allow the implementation of an AT 
compatible 
numerics error reporting scheme. 
These features DO NOT replace the external cir- 
cuit logic is stlII required that decodes the OUT 
FO Instruction and latches the FERR# signal. 
What follows is a description of the use of these 
Intel OverDrive Processor features. 


The features provided by the Intel OverDrive 
Processor are the NE bit in the Machine Status 
Register, the IGNNE# pin, and the FERR"# pin. 


The NE bit determines the action taken by the 
Intel OverDrive Processor when a numerics error 
is detected. When set this bit signals that non- 
COS compatil;>le error handling will be imple- 
mented. In this mode the Intel OverDrive Proces>- 
sor takes a software exception (16) if a numerics 
error is detected. 


If the NE bit Is reset the Intel OverDrive Proces- 
sor uses the IGNNE# pin to allow an external 
circuit to control the time at which non-control 
numerics instructions are allowed to execute. 
Note that floating point control instructions such 
as FNINIT and FNSAVE can be executed during 
a floating point error condition regardless of the 
state of IGNNE#. 


To process a floating point error in the OOSenvi- 
ronment the following sequence must take place: 
1. The error is detected by the Intel OverDrive 
Processor which activates the FEAR# pin. 
2. FERR# is latched so that it can be cleared by 
the OUT FOinstruction. 
3. The latched FERR# signal activates an inter- 
rupt at the interrupt contrOller. This interrupt is 
usually handled on IAQ13. 


4. The Interrupt Service Aoutine (ISA) handles 
the error and then clears the interrupt by eXe- 
cuting an OUT instruction to portFO. The ad- 
dress FO is decoded externally to clear the 
FERR# latch. The IGNNE# Signalis alsoacti- 
vated by the decoder outptIt. 


5. Usually the ISR then executes an FNINIT in- 
struction or other control instruction before re- 
starting 
theptogram. 
FNINIT 
clears 
the 
FERR# output 


Figure 7.31 illustrates the circuit required to per- 
form this function. Note that this circuit has not 
been tested. It is included as an example of the 
required error handling logic. 


Note that the IGNNE" 
input allows non-control 
• 
instructions to be executed prior to the time the 
FERR#- signal is reset by the Intel OverDrive 
Processor. This function is lrnplementad to allow 
exact compatibility with the AT implementation. 
Most programs reinltlalize the floating point unit 
before continuing after an error is detected. The 
floating point unit can be reinltlalized using one of 
the 
follOWing 
four 
instructions: 
FCLEX, 
FIN1T.FSAVE, FSTENV. 
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Testing in the 
Intel486 SX microprocessor/Intel 
OverDrive Processor can be divided into two catego- 
ries: Built-in Self Test (BIST) and external testing. 
The BIST tests the non-random logic, control ROM 
(CROM), translation lookaside buffer (TLB) and on- 
chip cache memory. External tests can be run on the 
TLB and the on-chip cache. The Intel486 SX micro- 
processor/Intel OverDrive Processor also has a test 
mode in which all outputs are tristated. 


The BIST is initiated by asserting AHOLD (address 
hold) on the falling edge of RESET. AHOLD is a 
synchronous signal only. It should be asserted in the 
clock prior to RESET going from High to Low to start 
BIST. FLUSH# must also be asserted (driven low) 
prior to the falling edge of RESET to start BIST. 
FLUSH# must be deasserted (driven high) during 
BIST. A20M# must be deasserted (driven high) dur- 
ing the falling edge of RESET to start BIST. The 
BIST takes approximately 220 clocks, or approxi- 
mately 54 milliseconds with a 20 MHz Intel486 SX 
microprocessor/lntel487 
SX 
Math 
CoProcessor. 


The BIST takes approximately 600 thousand bus 
clocks, or approximately 24 milliseconds, with a 
25 MHz OverDrive Processor. No bus cycles will be 
run by the Intel486 SX microprocessor/Intel Over- 
Drive Processor until the BIST is concluded. Note 
that for the Intel486 SX microprocessor/Intel Over- 
Drive Processor the RESET must be active for 15 
clocks with or without BIST being enabled for warm 
resets. 


The results of BIST is storedjn 
the EAX register. 
The Intel486 SX microprocessor/Intel 
OverDrive 
Processor has successfully passed the BIST if the 
contents of the EAX register are zero. If the results 
in EAX are not zero then the BIST has detected a 
flaw in the Intel486 SX microprocessor/Intel Over- 
Drive Processor. The Intel486 SX microprocessor/ 
Intel OverDrive Processor performs reset and begins 
normal operation at the completion of the BIST. 


The non-random logic, control ROM, on-chip cache 
and translation lookaside buffer (TLB) are tested 
during the BIST. 


The cache portion of the BIST verifies that the 
cache is functional and that it is possible to read and 
write to the cache. The BIST manipulates test regis- 
ters TR3, TR4 and TR5 while testing the cache. 
These test registers are described in Section 8.2. 


The cache testing algorithm writes a value to each 
cache entry, reads the value back, and checks that 
the correct value was read back. The algorithm may 
be repeated more than once for each of the 512 
cache entries using different constants. 


The TLB portion of the BIST verifies that the TLB is • 
functional and that it is possible to read and write to 
the TLB. The BIST manipulates test registers TR6 
and TR7 while testing the TLB. TR6 and TR7 are 
described in Section 8.3. 


The on-chip cache testability hooks are designed to 
be accessible during the BIST and for assembly lan- 
guage testing of the cache. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor contains a cache fill buffer and a cache 
read buffer. For testability writes, data must be writ- 
ten to the cache fill buffer before it can be written to 
a location in the cache. Data must be read from a 
cache location into the cache read buffer before the 
microprocessor can access the data. The cache fill 
and cache read buffer are both 128 bits wide. 


8.2.1 CACHE 
TESTING 
REGISTERS 
TR3, TR4 
AND TR5 


Figure 8.1 shows the three cache testing registers: 
the Cache Data Test Register (TR3), the Cache 
Status Test Register (TR4) and the Cache Control 
Test Register (TR5). External access to these regis- 
ters is provided through MOV reg,TREG and MOV 
TREG, reg instructions. 
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The cache fill buffer and the cache read buffer can 
only be accessed through TR3. Data to be written to 
the cache fill buffer must first be written to TR3. Data 
read from the cache read buffer must be loaded into 
TR3. 


TR3 is 32 bits wide while the cache fill and read 
buffers are 128 bits wide. 32 bits of data must be 
written to TR3 four times to fill the cache fill buffer. 
32 bits of data must be read from TR3 four times to 
empty the cache read buffer. The entry select bits in 
TR5 determine which 32 bits of data TR3 will access 
in the buffers. 


TR4 handles tag, LRU and valid bit information dur- 
ing cache tests. TR4 must be loaded with a tag and 
a valid bit before a write to the cache. After a read 
from a cache entry, TR4 contains the tag and valid 
bit from that entry, and the LRU bits and four valid 
bits from the accessed set. 


TR5 specifies which testability operation will be per- 
formed and the set and entry within the set which 
will be accessed. 


The seven bit set select field determines which of 
the 128 sets will be accessed. 


The functionality of the two entry select bits depend 
on the state of the control bits. When the fill or read 
buffers are being accessed, the entry select bits 
point to the 32-bit location in the buffer being ac- 
cessed. When a cache location is specified, the en- 
try select bits point to one of the four entries in a set. 
Refer to Table 8.1. 


Five testability functions can be performed on the 
cache. The two control bits in TR5 specify the oper- 
ation to be executed. The five operations are: 
1. Write cache fill buffer 
2. Perform a cache testability write 
3. Perform a cache testability read 
4. Read the cache read buffer 
5. Perform a cache flush 


Table 8.1 shows the encoding 
of the two control 
bits 
in TR5 for the cache 
testability 
functions. 
Table 
8.1 
also shows 
the functionality 
of the entry and set se- 
lect bits for each control 
operation. 


The cache tests attempt 
to use as much of the nor- 
mal operating 
circuitry 
as possible. 
Therefore 
when 
cache tests are being performed, 
the cache must be 
disabled 
(the 
CD and 
NW 
bits 
in control 
register 
must be set to 1 to disable 
the cache. 
See Section 
5). 


A testability 
write to the cache is a two step process. 


First the cache 
fill buffer 
must be loaded 
with 
128 
bits of data and TR4 loaded 
with the tag and valid 
bit. Next the contents 
of the fill buffer are written to a 
cache location. 
Sample 
assembly 
code to do a write 
is given in Figure 8.2. 


Table 8.1. Cache Control 
Bit Encoding 
and Effect of 
Control 
Bits on Entry Select and Set Select Functionality 


Control 
Bits 
Operation 
Entry Select Bits 
Set Select Bits 
Bit 1 
Bit 0 
Function 


0 
0 
Enable 
{ 
Fill Buffer Write 
Select 32-bit location 
in fill/read 
Read Buffer Read 
buffer 
- 


0 
1 
Perform Cache Write 
Select an entry in set. 
Select a set to write to 


1 
0 
Perform Cache Read 
Select an entry in set. 
Select a set to read from 


1 
1 
Perform Flush Cache 
- 
- • 
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Sample 
Assembly 
Code 


An example 
assembly 
language 
sequence 
to perform 
a cache write is: 


eax. 
ebx. 
ecx. 
edx 
contain 
the 
cache 
line 
to write 
edi 
contains 
the tag 
information 
to load 
eRO already 
says to enable 
reads/write 
to TR5 


fill 
the 
cache 
buffer 
mov 
esi,O 
mov 
tr5,esi 
mov 
tr3,eax 
mov 
esi,4 
mov 
tr5,esi 
mov 
tr3,ebx 
mov-esi,8 
mov 
tr5,esi 
mov 
tr3,ecx 
mov 
esi,Och 
mov 
tr5,esi 
mov 
tr3,edx 


set up command 
load 
to TR5 
load 
data 
into 
cache 
fill buffer 


mov 
esi,l 
mov 
tr5,esi 


mov 
esi,2 
mov 
tr5,esi 


read 
the data 
from 
the read 
buffer 


mov 
esi,O 
mov 
tr5,esi 
mov 
eax,tr3 
mov 
esi,4 
mov 
tr5,esi 
mov 
ebx,tr3 
mov 
esi,8 
mov 
tr5,esi 
mov 
ecx,tr3 
mov 
esi,Och 
mov 
tr5,esi 
mov 
edx,tr3 


read 
the 
status 
from TR4 


mov 
edi,tr4 
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Loading the fill buffer is accomplished by first writing 
to the entry select bits in TR5 and setting the control 
bits in TR5 to 00. The entry select bits identify one of 
four 32-bit locations in the cache fill buffer to put 
32 bits of data. Following the write to TR5, TR3 is 
written with 32 bits of data which are immediately 
placed in the cache fill buffer. Writing to TR3 initiates 
the write to the cache fill buffer. The cache fill buffer 
is loaded with 128 bits of data by writing to TR5 and 
TR3 four times using a different entry select location 
each time. 


TR4 must be loaded with the 21-bit tag and valid bit 
(bit 10 in TR4) before the contents of the fill buffer 
are written to a cache location. 


The contents of the cache fill buffer are written to a 
cache location by writing TR5 with a control field of 
01 along with the set select and entry select fields. 
The set select and entry select field indicate the lo- 
cation in the cache to be written. The normal cache 
LRU update circuitry updates the internal LRU bits 
for the selected set. 


Note that a cache testability write can only be done 
when the cache is disabled for replaces (the CD bit 
is control register 0 is reset to 1). Also note that care 
must be taken when directly writing to entries in the 
cache. If the entry is set to overlap an area of mem- 
ory that is being used in external memory, that 
cache entry could inadvertently be used instead of 
the external memory. Of course, this is exactly the 
type of operation that one would desire if the cache 
were to be used as a high speed RAM. Also note 
that a memory reference (or any external bus cycle) 
should not occur in between the move to TR4 and 
the move to TR5, in order to avoid having the value 
in TR4 change due to the memory reference. 


8.2.3 CACHE 
TESTABILITY 
READ 


A cache testability read is a two step process. First 
the contents of the cache location are read into the 
cache read buffer. Next the data is examined by 
reading it out of the read buffer. Sample assembly 
code to do a testability read is given in Figure 8.2. 


Reading the contents of a cache location into the 
cache read buffer is initiated by writing TR5 with the 
control bits set to 10 and the desired seven-bit set 
select and two-bit entry select. In response to the 
write to TR5, TR4 is loaded with the 21-bit tag field 
and the single valid bit from the cache entry read. 
TR4 is also loaded with the three LRU bits and four 
valid bits corresponding to the cache set that was 
accessed. The cache read buffer is filled with the 
128-bit value which was found in the data array at 
the specified location. 


The contents of the read buffer are examined by 
performing four reads of TR3. Before reading TR3 
the entry select bits in TR5 must loaded to indicate 


which of the four 32·bit words in the read buffer to 
transfer into TR3 and the control bits in TR5 must be 
loaded with 00. The register read of TR3 will initiate 
the transfer of the 32-bit value from the read buffer 
to the specified general purpose register. 


Note that it is very important that the entire 128-bit 
quantity from the read buffer and also the informa- 
tion from TR4 be read before any memory refer- 
ences are allowed to occur. If memory operations 
are allowed to happen, the contents of the read buff- 
er will be corrupted. This is because the testability 
operations use hardware that is used in normal 
memory accesses for the Intel486 SX microproces- 
sor/Intel OverDrive Processor whether the cache is 
enabled or not. . 


8.2.4 FLUSH 
CACHE 


The control bits in TR5 must be written with 11 to 
flush the cache. None of the other bits in TR5 have 
any meaning when 11 is written to the control bits. 
Flushing the cache will reset the LRU bits and the 
valid bits to 0, but will not change the cache tag or 
data arrays. 


When the cache is flushed by writing to TR5 the 
special bus cycle indicating a cache flush to the ex- 
ternal system is not run (see Section 7.2.11, Special 
Bus Cycles). The cache should be flushed with the 
instruction INVD (Invalidate Data Cache) instruction 
or the WBINVD (Write-back and Invalidate Data 
Cache) instruction. 


8.3 Translation 
Lookaslde 
Buffer 
(TLB) Testing 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor TLB testability hooks are similar to those 
in the Intel386 microprocessor. The testability hooks 
have been enhanced to provide added test features 
and to include new features in the Intel486 SX mi- 
croprocessor/Intel 
OverDrive Processor. The TLB 
testability hooks are designed to be accessible dur- 
ing the BIST and for assembly language testing of 
the TLB. 


8.3.1 TRANSLATION 
LOOKASIDE 
BUFFER 
ORGANIZATION 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor TLB is 4-way set associative and has 
space for 32 entries. The TLB is logically split into 
three blocks shown in Figure 8.3. 


The data block is physically split into four arrays, 
each with space for eight entries. An entry in the 
data block is 22 bits wide containing a 20-bit physi- 
cal address and two bits for the page attributes. The 
page attributes are the PCD (page cache disable) bit 
and the PWT (page write-through) bit. Refer to Sec- 
tion 4.5.4 for a discussion of the PCD and PWT bits. 
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The tag block is also split into four arrays, one for 
each of the data arrays. A tag entry is 21 bits wide 
containing a 17-bit linear address and four protec- 
tion bits. The protection bits are valid M, user/su- 
pervisor (U/S), read/write (R/W) and dirty (D). 


TLB is the same as used by the on-chip cache. For a 
description of this algorithm refer to Section 5.5. 


The third block contains eight three bit quantities 
used in the pseudo least recently used (LRU) re- 
placement algorithm. These bits are called the LRU 
bits. The LRU replacement algorithm used in the 


The two TLB test registers are shown in Figure 8.4. 
TR6 is the command test register and TR7 is the 
data test register. External access to these registers 
is provided through 
MOV reg,TREG and 
MOV 
TREG,reg instructions. 
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TR6 Protection 
Bit 
TR6 Protection 
Bit # 
Meaning on 
Meaning 
on 
(B) 
(B#) 
TLB Write Operation 
TLB Lookup 
Operation 


0 
0 
Undefined 
Miss any TLB TAG Bit B 
0 
1 
Write 0 to TLB TAG Bit B 
Match TLB TAG Bit B if 0 
1 
0 
Write 1 to TLB TAG Bit B 
Match TLB TAG Bit B if 1 
1 
1 
Undefined 
Match any TLB TAG Bit B 


TR6 contains the tag information and control infor- 
mation used in a TLB test. Loading TR6 with tag and 
control information initiates a TLB write or lookup 
test. 


TR6 contains three bit fields, a 20-bit linear address 
(bits 12-31), seven bits for the TLB tag protection 
bits (bits 5-11) and one bit (bit 0) to define the type 
of operation to be performed on the TLB. 


The 20-bit linear address forms the tag information 
used in the TLB access. The lower three bits of the 
linear address select which of the eight sets are ac- 
cessed. The upper 17 bits of the linear address form 
the tag stored in the tag array. 


The seven TLB tag protection bits are described be- 
low. 


V: 
0,0#: 


U,U#: 


The valid bit for this TLB entry 
The dirty bit for/from the TLB entry 
The user/supervisor bit for/from 
the TLB 
entry 
The read/write bit for/from the TLB entry 


Two bits are used to represent the 0, U/S and R/W 
bits in the TLB tag to permit the option of a forced 
miss or hit during a TLB lookup operation. The 
forced miss or hit will occur regardless of the state 
of the actual bit in the TLB. The meaning of these 
pairs of bits is given in Table 8.2. 


The operation bit in TR6 determines if the TLB test 
operation will be a write or a lookup. The function of 
the operation bit is given in Table 8.3. 


TR6 
TLB Operation 
BltO 
to Be Performed 


0 
TLB Write 
1 
TLB Lookup 


TR7 contains the information stored or read from the 
data block durin.ga TLB test operation. Before a TLB 


test write, TR7 contains the physical address and 
the page attribute bits to be stored in the entry. After 
a TLB test lookup hit, TR7 contains the physical ad- 
dress, page attributes, LRU bits and entry location 
from the access. 


TR7 contains a 20-bit physical address (bits 12-31), 
two bits for PCO(bit 11) and PWT (bit 10) and three 
bits for the LRU bits (bits 7-9). The LRU bits in TR7 
are only used during a TLB lookup test. The func- 
tionality of TR7 bit 4 differs for TLB writes and look- 
ups. The encoding of bit 4 is defined in Tables 8.4 
and 8.5. Finally TR7 contains two bits (bits 2-3) to 
specify a TLB replacement pointer or the location of 
a TLB hit. 


TR7 
Replacement 
Pointer 
Bit 4 
Used on TLB Write 


0 
Pseudo-LRU Replacement Pointer 
1 
Data Test Register Bits 3:2 


TR7 
Meaning 
after TLB 
Bit 4 
Lookup Operation 


0 
TLB Lookup Resulted in a Miss 
1 
TLB Lookup Resulted in a Hit 


A replacement pointer is used during a TLB write. 
The pointer indicates which of the four entries in an 
accessed set is to be written. The replacement 
pointer can be specified to be the internal LRU bits 
or bits 2-3 in TR7. The source of the replacement 
pointer is specified by TR7 bit 4. The encoding of bit 
4 during a write is given by Table 8.4. 


Note that both testability writes and lookups affect 
the state of the internal LRU bits regardless of the 
replacement pointer used. All TLB write operations 
(testability or normal operation) cause the written 
entry to become the most recently used. For exam- 
ple, during a testability write with the replacement 
pointer specified by TR7 bits 2-3, the indicated en- 
try is written and that entry becomes the most re- 
cently used as specified by the internal LRU bits. 
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There are two TLB testing operations: write entries 
into the TLB, and perform TLB lookups. One major 
enhancement over TLB testing in the Intel386 micro- 
processor is that paging need not be disabled while 
executing testability writes or lookups. 


Note that any time one TLB set contains the same 
linear address in more than one of its entries, look- 
ing up that linear address will give unpredictable re- 
sults. Therefore a single linear address should not 
be written to one TLB set more than once. 


To perform a TLB write TR7 must be loaded fol- 
lowed by a TR6 load. The register operations must 
be performed in this order since the TLB oparation is 
triggered by the write to TR6. 


TR7 is loaded with a 20-bit physical address and 
values for PCD and PWT to be written to the data 
portion of the TLB. In addition, bit 4 of TR7 must be 
loaded to indicate whether to use TR7 bits 3-2 or the 
internal LRU bits as the replacement pointer on the 
TLB write operation. Note that the LRU bits in TR7 
are not used in a write test. 


TR6 must be written to initiate the TLB write opera- 
tion. Bit 0 in TR6 must be reset to zero to indicate a 
TLB write. The 20-bit linear address and the seven 
page protection bits must also be written in TR6 to 
specify the tag portion of the TLB entry. Note that 
the three least significant bits of the linear address 
specify which of the eight sets in the data block will 
be loaded with the physical address data. Thus only 
17 of the linear address bits are stored in the tag 
array. 


To perform a TLB lookup it is only necessary to write 
the proper tags and control information into TR6. Bit 
o in TR6 must be set to 1 to indicate a TLB lookup. 
TR6 must be loaded with a 20·bit linear address and 
the seven protection bits. To force misses and 
matches of the individual protection bits on TLB 
lookups, set the seven protection bits as specified in 
Table 8.2. 


A TLB lookup operation is initiated by the write to 
TR6. TR7 will indicate the result of the lookup opera- 
tion following the write to TR6. The hit/miss indica- 
tion can be found in TR7 bit 4 (see Table 8.5). 


TR7 will contain the following information if bit 4 indi- 
cated that the lookup test resulted in a hit. Bits 2-3 
will indicate in which set the match occurred. The 22 
most significant bits in TR7 will contain the physical 
address and page attributes contained in the entry. 
Bits 9-7 will contain the LRU bits associated with 
the accessed set. The state of the LRU bits is previ- 
ous to their being updated for the current lookup. 


If bit 4 in TR7 indicated that the lookup test resulted 
in a miss the remaining bits in TR7 are undefined. 


Again it should be noted that a TLB testability lookup 
operation affects the state of the LRU bits. The LRU 
bits will be updated if a hit occurred. The entry which 
was hit will become the most recently used. 


8.4 Tri-State 
Output Test Mode 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor provides the ability to float all its outputs 
and bidirectional pins. This includes all pins floated 
during bus hold as well as pins which are never float- 
ed in normal operation of the chip (HLDA, BREQ, 
FERR# and PCHK#). When the Intel486 SX micro- 
processor/Intel 
OverDrive Processor is in the tri- 


state output test mode external testing can be used 
to test board connections. 


The tri-state test mode is invoked if FLUSH# is 
sampled active at the falling edge of 
RESET. 


FLUSH# is an asynchronous signal. When driven 
asynchronously, FLUSH# should be asserted for 2 
clocks before and 2 clocks after RESET is deassert- 
ed. If FLUSH# is driven synchronously, the tri-state 
output test mode is initiated by driving FLUSH# so 
that it is sampled active in the clock prior to RESET 
going low and ensuring that specified setup and hold 
times are met. The outputs are guaranteed to tri- 
state no later than 10 clocks after RESET goes low 
(see Figure 6.4). The Intel486 SX microprocessor/ 
Intel OverDrive Processor remains in the tri-state 
test mode until the next RESET. 
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8.5 Intel486™ SX CPU 
Microprocessor 
Boundary Scan 
(JTAG) 


The Intel486 
SX CPU (PQFP Version 
Only) provides 
additional 
testability 
features 
compatible 
with 
the 
IEEE 
Standard 
Test 
Access 
Port 
and 
Boundary 
Scan Architecture 
(IEEE Std. 1149.1). The test logic 
provided 
allows 
for 
testing 
to 
insure 
that 
compo- 
nents 
function 
correctly, 
that 
interconnections 
be- 
tween various components 
are correct, 
and that var- 
ious 
components 
interact 
correctly 
on the 
printed 
circuit 
board. 


The boundary 
scan test logic consists 
of a boundary 
scan 
register 
and support 
logic 
that 
are accessed 
through 
a test access 
port (TAP). The TAP provides 
a simple 
serial 
interface 
that 
makes 
it possible 
to 
test all signal traces 
with only a few probes. 


The TAP can be controlled 
via a bus master. 
The 
bus master 
can be either automatic 
test equipment 
or a component 
(PLO) that interfaces 
to the four-pin 
test bus. 


The boundary 
scan test logic contains 
the following 
elements: 


- 
Test access 
port (TAP), consisting 
of input pins 
TMS, TCK, and TOI; and output 
pin TOO. 


- 
TAP controller, 
which interprets 
the inputs on the 
test 
mode 
select 
(TMS) 
line 
and 
performs 
the 
corresponding 
operation. 
The 
operations 
per- 
formed 
by the TAP include controlling 
the instruc- 
tion and data registers 
within the component. 


- 
Instruction 
register 
(IR), which accepts 
instruction 
codes 
shifted 
into the test logic on the test data 
input (TOI) pin. The instruction 
codes are used to 
select the specific 
test operation 
to be performed 
or the test data register 
to be accessed. 


- 
Test 
data 
registers: 
The 
Intel486 
SX CPU con- 
tains 
three 
test 
data 
registers: 
Bypass 
register 
(BPR), 
Oevice 
Identification 
register 
(010), 
and 
Boundary 
Scan register 
(BSR). 


The instruction 
and test data registers 
are separate 
shift-register 
paths connected 
in parallel 
and have a 
common 
serial data input and a common 
serial data 
output 
connected 
to the TAP signals, TOI and TOO, 
respectively. 


The Intel486 
SX CPU contains 
the two required 
test 
- 


data 
registers; 
bypass 
register 
and boundary 
scan 
register. 
In addition, 
they also have a device 
identifi- 
cation 
register. 


Each test data register 
is serially 
connected 
to TOI 
and TOO, with TOI connected 
to the most significant 
bit and TOO connected 
to the least significant 
bit of 
the test data register. 


Oata is shifted one stage (bit position 
within the reg- 
ister) on each rising edge of the test clock 
(TCK). 


In addition 
the Intel486 
SX CPU contains 
a runbist 
register 
to support 
the RUNBIST 
boundary 
scan in- 
struction. 


8.5.2.1 
Bypass Register 


The Bypass 
Register 
is a one-bit 
shift 
register 
that 
provides 
the minimal 
length 
path between 
TOI and 
TOO. This path can be selected 
when no test opera- 
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tion is being performed 
by the component 
to allow 
rapid movement 
of test data to and from other com- 
ponents 
on the board. 
While the bypass 
register 
is 
selected 
data is transferred 
from TOI to TOO without 
inversion. 


8.5.2.2 Boundary 
Scan Register 


The Boundary 
Scan Register 
is a single shift register 
path 
containing 
the 
boundary 
scan 
cells 
that 
are 
connected 
to all input and output pins of the Intel486 
SX CPU. Figure 
8.5 shows 
the 
logical 
structure 
of 
the boundary 
scan register. 
While output cells deter- 
mine 
the 
value 
of the 
signal 
driven 
on the 
corre- 
sponding 
pin, input cells only capture 
data; they do 
not affect the normal operation 
of the device. 
Oata is 
transferred 
without 
inversion 
from 
TOI 
to 
TOO 
through 
the boundary 
scan register 
during scanning. 
The boundary 
scan register 
can be operated 
by the 
EXT EST and 
SAMPLE 
instructions. 
The 
boundary 
scan register 
order is described 
in Section 
8.5.5. 


8.5.2.3 Device 
Identification 
Register 


The Oevice Identification 
Register 
contains 
the man- 
ufacturer's 
identification 
code, 
part 
number 
code, 
and version 
code in the format 
shown 
in Figure 8.6. 


Table 
8.6 lists the codes 
corresponding 
to the 
In- 
tel486 
SX CPU. 


8.5.2.4 Runblst 
Register 


The 
Runbist 
Register 
is a one 
bit register 
used to 
report the results of the Intel486 
SX CPU BIST when 
it is initiated 
by the RUNBIST 
instruction. 
This regis- 
ter is loaded 
with a "1" 
prior to invoking 
the 
BIST 
and is loaded 
with "0" 
upon successful 
completion. 


SYSTEto4 
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VERSION 
PART NUIolBER 
IolANUrACTURER 
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IDENTITY 


Component 
Device 10 (Boundary 
Scan) 
Code/Step 


Intel486 SX 
(cA) 
00427013h 


(B) 
20282013h 
(cB) 
80282013h 


The Instruction Register (IR) allows instructions to 
be serially shifted into the device. The instruction 
selects the particular test to be performed, the test 
data register to be accessed, or both. The instruc· 
tion register is four (4) bits wide. The most significant 
bit is connected to TDI and the least significant bit is 
connected to TOO. There are no parity bits associat· 
ed with the Instruction register. Upon entering the 


Capture·IR TAP controller state, the Instruction reg· 
ister is loaded with the default instruction "0001", 
SAMPLE/PRELOAD. Instructions are shifted into 
the instruction register on the rising edge of TCK 
while the TAP controller is in the SHIFT·IR state. 


8.5.3.1 Intel486™ 
SX CPU Boundary 
Scan 
Instruction 
Set 


The Intel486 SX CPU supports all three mandatory 
boundary scan instructions (BYPASS, SAMPLE/ 
PRELOAD, and EXTEST) along with two optional in· 
structions (IDCODE and RUNBIST). Table 8.7 lists 
the Inte/486 SX CPU boundary scan instruction 
codes. The instructions listed as PRIVATE cause 
TOO to become enabled in the Shift·DR state and 
cause "0" to be shifted out of TOO on the rising 
edge of TCK. Execution of the PRIVATE instructions 
will not cause hazardous operation of the Intel486 
SX CPU. 


EXTEST 
The instruction code is "0000". 
The 
EXTEST instruction allows testing of cir- 
cuitry external to the component pack- 
age, typically board interconnects. 
It 
does so by driving the values loaded 
into the Intel486 SX CPU's boundary 
scan register out on the output pins cor- 
responding to each boundary scan cell 
and capturing the values on Intel486 SX 
CPU input pins to be loaded into their 
corresponding boundary scan register 
locations. I/O pins are selected as input 
or output, depending on the value load- 
ed into their control setting locations in 
the 
boundary 
scan 
register. 
Values 
shifted into input latches in the bounda- 
ry scan register are never used by the 
internal logic of the Intel486 SX CPU. 


NOTE: 


After 
using the 
EXTEST instruction, 
the Intel486 SX CPU must be reset 
before 
normal 
(non-boundary 
scan) 
use. 
SAMPLE/ 
The instruction code is "0001". 
The 


PRELOAD SAMPLE/PRELOAD has two functions 
that it performs. When the TAP control- 
ler is in the Capture-DR state, the SAM- 
PLE/PRELOAD 
instruction 
allows 
a 


"snap-shot" of the normal operation of 
the component without interfering with 
that normal operation. The instruction 
causes boundary scan register cells 
associated with outputs to sample the 
value being driven by the Intel486 SX 
CPU. It causes the cells associated with 
inputs to sample the value being driven 
into the Intel486 SX CPU. On both out- 
puts and inputs the sampling occurs on 
the rising edge of TCK. When the TAP 
controller is in the Update-DR state, the 
SAMPLE/PRELOAD 
instruction 
pre- 
loads data to the device pins to be driv- 
en to the board by executing the EX- 
TEST instruction. Data is preloaded to 
the pins from the boundary scan regis- 
ter on the falling edge of TCK. 


IDCODE 
The instruction code is "0010". The 10- 
CODE instruction selects the device 
identification register to be connected 
to TDI and TOO, allowing the device 
identification code to be shifted out of 
the device on TOO. Note that the de- 
vice identification register is not altered 
by data being shifted in on TO!. 
. 


BYPASS 
The instruction code is "1111". The BY- 
PASS instruction selects the bypass 
register to be connected to TDI and 
TOO, effectively bypassing the test log- 
ic on the Intel486 SX CPU by reducing 
the shift length of the device to one bit. 
Note that an open circuit fault in the 
board level test data path will cause the 
bypass register to be selected following 2 
an instruction scan cycle due to the 
pull-up resistor on the TDI input. This 
has been done to prevent any unwant- 
ed interference with the proper opera- 
tion of the system logic. 


RUNBIST 
The instruction code is "1000". 
The 


RUNBIST instruction selects the one (1) 
bit runbist register, loads a value of "1" 
into the runbist register, and connects it 
to TOO. It also initiates the built-in self 
test (BIST) feature of the Intel486 SX 
CPU, which is able to detect approxi- 
mately 60% of the stuck-at faults on the 
Intel486 SX CPU. The Intel486 SX CPU 
ac/dc specifications for Vcc and CLK 
must be met and RESET must have 
been asserted at least once prior to ex- 
ecuting the RUNBIST boundary scan in- 
struction. After loading the RUNBIST in- 
struction code in the instruction register, 
the TAP controller must be placed in 
the Run-Test/Idle state. BIST begins on 
the first rising edge of TCK after enter- 
ing the Run-Test/Idle state. The TAP 
controller must remain in the Run-Test/ 
Idle state until BIST is completed. It 


Instruction 
Instruction 
Instruction 
Instruction 
Code 
Name 
Code 
Name 


0000 
EXTEST 
1000 
RUNBIST 


0001 
SAMPLE 
1001 
PRIVATE 


0010 
IDCODE 
1010 
PRIVATE 


0011 
PRIVATE 
1011 
PRIVATE 


0100 
PRIVATE 
1100 
PRIVATE 


0101 
PRIVATE 
1101 
PRIVATE 


0110 
PRIVATE 
1110 
PRIVATE 


0111 
PRIVATE 
1111 
BYPASS 


requires 1.2 million clock (ClK) cycles 
to complete BIST and report the result 
to the runbist register. After completing 
the 1.2 million clock (ClK) cycles, the 
value in the runbist register should be 
shifted out on TOO during the Shift-DR 
state. A value of "0" being shifted out 
on TOO indicates BIST successfully 
completed. A value of "1" indicates a 
failure occurred. After executing the 
RUNBIST instruction, the Intel486 SX 
CPU must be reset prior to normal oper- 
ation. 


The TAP controller is a synchronous, finite state ma- 
chine. It controls the sequence of operations of the 
test logic. The TAP controller changes state only in 
response to the following events: 
1. a rising edge of TCK 
2. power-up. 
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The value of the test mode state (TMS) input signal 
at a rising edge of TCK controls the sequence of the 
state changes. The state diagram for the TAP con- 
troller is shown in Figure 8.7. Test designers must 
consider the operation of the state machine in order 
to design the correct sequence of values to drive on 
TMS. 


8.5.4.1 Test-Loglc-Reset 
State 


In this state, the test logic is disabled so that normal 
operation of the device can continue unhindered. 
This is achieved by initializing the instruction register 
such that the IOCOOEinstruction is loaded. No mat- 
ter what the original state of the controller, the con- 
troller enters Test-logic-Reset state when the TMS 
input is held high (1) for at least five rising edges of 
TCK. The controller remains in this state while TMS 
is high. The TAP controller is also forced to enter 
this state at power-up. 


A controller state between scan operations. Once in 
this state, the controller remains in this state as long 
as TMS is held low. In devices supporting the RUN- 
BIST instruction, the BIST is performed during this 
state and the result is reported in the runbist regis· 
ter. For instruction not causing functions to execute 
during this state, no activity occurs in the test logic. 
The instruction register and all test data registers 
retain their previous state. When TMS is high and a 
rising edge is applied to TCK, the controller moves 
to the Select-DR state. 


8.5.4.3 Select-DR-Scan 
State 


This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture·DR state, and a scan se- 
quence for the selected test data register is initiated. 
If TMS is held high and a rising edge is applied to 
TCK, the controller moves to the Select-IR-Scan 
state. 


8.5.4.4 Capture-DR 
State 


In this state, the boundary scan register captures 
input pin data if the current instruction is EXTEST or 
SAMPLE/PRELOAD. The other test data registers, 
which do not have parallel input, are not changed. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-DR state if TMS is high or the Shift-DR state if 
TMS is low. 


In this controller state, the test data register con- 
nected between TDI and TOO as a result of the cur- 
rent instruction, shifts data one stage toward its seri- 
al output on each rising edge of TCK. 


The instruction does not change in this state. 


When the TAP controller is in this state and a rising 
edge is applied to TCK, the controller enters the 
Exit1-DR state if TMS is high or remains in the Shift- 
DR state if TMS is low. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 


controller to enter the Update-DR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Pause-DR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


8.5.4.7 Pause-DR 
State 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the test data 
register in the serial path between TDI and TOO. An 
example of using this state could be to allow a tester 
to reload its pin memory from disk during application 
of a long test sequence. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-DR 
state. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-DR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Shift-DR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


8.5.4.9 Update-DR 
State 


The boundary scan register is provided with a 
latched parallel output to prevent changes at the 
parallel output while data is shifted in response to 
the EXTEST and SAMPLE/PRELOAD instructions. 
When the TAP controller is in this state and the 
boundary scan register is selected, data is latched 
onto the parallel output of this register from the shift· 
register path on the 'falling edge of TCK. The data 
held at the latched parallel output does not change 
other than in this state. 


All shift-register stages in test data register selected 
by the current instruction retains its previous value 
during this state. The instruction does not change in 
this state. 
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This is a temporary controller state. The test data 
register selected by the current instruction retains its 
previous state. If TMS is held low and a rising edge 
is applied to TCK when in this state, the controller 
moves into the Capture-IR state, and a scan se- 
quence for the instruction register is initiated. If TMS 
is held high and a rising edge is applied to TCK, the 
controller moves to the Test-Logic-Reset state. 


In this controller state the shift register contained in 
the instruction register loads the fixed value "0001" 
on the rising edge of TCK. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. When 
the controller is in this state and a rising edge is 
applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or the Shift-IR state if TMS 
is held low. 


In this state the shift register contained in the in- 
struction register is connected between TOI and 
TOO and shifts data one stage towards its serial out- 
put on each rising edge of TCK. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


When the controller is in this state and a rising edge 
is applied to TCK, the controller enters the Exit1-IR 
state if TMS is held high, or remains in the Shift-IR 
state if TMS is held low. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Pause-IR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The pause state allows the test controller to tempo- 
rarily halt the shifting of data through the instruction 
register. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The controller remains in this state as long as TMS 
is low. When TMS goes high and a rising edge is 
applied to TCK, the controller moves to the Exit2-IR 
state. 


This is a temporary state. While in this state, if TMS 
is held high, a rising edge applied to TCK causes the 
controller to enter the Update-IR state, which termi- 
nates the scanning process. If TMS is held low and a 
rising edge is applied to TCK, the controller enters 
the Shift-IR state. 


The test data register selected by the current in- 
struction retains its previous value during this state. 
The instruction does not change in this state. 


The instruction shifted into the instruction register is 
latched onto the parallel output from the shift-regis- 
ter path on the falling edge of TCK. Once the new 
instruction has been latched, it becomes the current 
instruction. 


Test data registers selected by the current instruc- 
tion retain the previous value. 


The boundary scan register contain"sa cell for each 
pin, as well as cells for control of I/O and tri-state 
pins. 


The following is the bit order of the Intel486 SX CPU 
boundary scan register: (from left to right and top to 
bottom) 
TOI -+ 
WRCTL ABUSCTL BUSCTL MISCCTL 
AOS# 
BLAST# 
PLOCK# 
LOCK# 
PCHK# 
BROY# BOFF# BS16# BS8# ROY# KEN# 
HOLD AHOLO CLK HLOA WR# BREQ BEO# 
BE1# 
BE2# 
BE3# 
MIO# 
OC# 
PWT PCO 
EAOS# A20M# 
RESET FLUSH# 
INTR NMI 
Reserved Reserved 031 030029028027026 
025024 
OP3 023 022 021 020019018017 
016 OP2 015 014 013 012 011 010 09 08 
OP1 07 06 05 04 03 02 01 DOOPOA31 A30 
A29 A28 A27 A26 A25 A24 A23 A22 A21 A20 
A19 A18 A17 A16 A15 A14 A13 A12 A11 A10 
A9 A8 A7 A6 UP# A5 A4 A3 A2 -+ TOO 


"Reserved" corresponds to no connect "NC" sig- 
nals on the Intel486 SX CPU. 


All the 'CTL cells are control cells that are used to 
select the direction of bidirectional pins or tristate 
output pins. If "1" 
is loaded into the control cell 


('CTL), the associated pin(s) are tristated or'select- 
ed as input. The following lists the control cells and 
their corresponding pins. 
1. WRCTL controls the 031-00 
and OP3-0PO pins. 


2. ABUSCTL controls the A31-A2 pins. 
3. BUSCTL controls the AOS#, BLAST#, PLOCK#, 
LOCK#, 
WR#, 
BEO#. BE1#, 
BE2#. 
BE3#. 
MIO#, OC#, PWT, and PCO pins. 
4. MISCCTL controls the PCHK#, HLOA,and BREQ 
pins. 


The TAP controller is automatically initialized when a 
device is powered up. In addition, the TAP controller 
can be initialized by applying a high signal level on • 
the TMS input for five TCK periods. 


8.5.7 BOUNDARY 
SCAN 
DESCRIPTION 
LANGUAGE 
(BSDL) 
FILES 


Available 
through 
Intel. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor provides several features which simplify 
the debugging process. The three categories of on- 
chip debugging aids are: 
1. the code execution breakpoint opcode (OCCH), 
2. the single-step capability provided by the TF bit in 
the flag register, and 
3. the code and data breakpoint capability provided 
by the Debug Registers DRO-3, DR6, and DR7. 


A single-byte-opcode breakpoint instruction is avail- 
able for use by software debuggers. The breakpoint 
opcode is OCCH,and generates an exception 3 trap 
when executed. In typical use, a debugger program 
can "plant" the breakpoint instruction at all desired 
code execution breakpoints. The single-byte break- 
point opcode is an alias for the two-byte general 
software interrupt instruction, INT n, where n=3. 
The only difference between INT 3 (OCCh)and INT n 
is that INT 3 is never IOPL-sensitive but INT n is 
IOPL-sensitive in Protected Mode and Virtual 8086 
Mode. 


9.2 Single-Step 
Trap 


If the single-step flag (TF, bit 8) in the EFLAG regis- 
ter is found to be set at the end of an instruction, a 
single-step exception occurs. The single-step ex- 
ception is auto vectored to exception number 1. Pre- 
cisely, exception 1 occurs as a trap after the instruc- 
tion following the instruction which set TF. In typical 
practice, a debugger sets the TF bit of a flag register 
image on the debugger's stack. It then typically 
transfers control to the user program and loads the 
flag image with a signal instruction, the IRET instruc- 
tion. The single-step trap occurs after executing one 
instruction of the user program. 


Since the exception 1 occurs as a trap (that is, it 
occurs after the instruction has already executed), 
the CS:EIP pushed onto the debugger's stack points 
to the next unexecuted instruction of the program 
being debugged. An exception 1 handler, merely by 
ending with an IRET instruction, can therefore effi- 
ciently support single-stepping through a user pro- 
gram. 


9.3 
Debug Registers 


The Debug Registers are an advanced debugging 
feature of the Intel486 SX microprocessor/lntel 
OverDrive Processor. They allow data access break- 
points as well as code execution breakpoints. Since 
the breakpoints are indicated by on-chip registers, 
an instruction execution break-point can be placed 
in ROM code or in code shared by several tasks, 
neither of which can be supported by the INT3 
breakpoint opcode. 


The Intel486 SX microprocessor/Intel 
OverDrive 


Processor contains six Debug Registers, providing 
the ability to specify up to four distinct breakpoints 
addresses, breakpoint control options, and read 
breakpoint status. Initially after reset, breakpoints 
are in the disabled state. Therefore, no break- 
points will occur unless the debug registers are pro- 
grammed. Breakpoints set up in the Debug Regis- 
ters are autovectored to exception number 1. 


9.3.1 LINEAR 
ADDRESS 
BREAKPOINT 
REGISTERS 
(DRO-DR3) 


Up to four breakpoint addresses can be specified by 
writing into Debug Registers DRO-DR3, shown in 
Figure 9.1. The breakpoint addresses specified are 
32-bit linear addresses. Intel486 SX microproces- 
sor/Intel OverDrive Processor hardware continuous- 
ly compares the linear breakpoint addresses in 
DRO-DR3 with the linear addresses generated by 
executing software (a linear address is the result of 
computing the effective address and adding the 
32-bit segment base address). Note that if paging is 
not enabled the linear address equals the physical 
address. If paging is enabled, the linear address is 
translated to a physical 32-bit address by the on- 
chip paging unit. Regardless of whether paging is 
enabled or not, however, the breakpoint registers 
hold linear addresses. 


9.3.2 DEBUG 
CONTROL 
REGISTER 
(DR7) 


A Debug Control Register, DR7 shown in Figure 9.1, 
allows several debug control functions such as en- 
abling the breakpoints and setting up other control 
options for the breakpoints. The fields within the De- 
bug Control Register, DR7, are as follows: 


LENi (breakpoint length specification bits) 


A 2-bit LEN field exists for each of the four break- 
points. LEN specifies the length of the associated 
breakpoint field. The choices for data breakpoints 
are: 1 byte, 2 bytes, and 4 bytes. Instruction execu- 
tion breakpoints must have a length of 1 (LENi 
00). Encoding of the LENi field is as follows: 
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Usage of Least 
LENI 
Breakpoint 
Significant Bits In 
Encoding 
Field Width 
Breakpoint Address 
Register I, (I=0- 3) 


00 
1 byte 
All 32-bits used to 
specify a single-byte 
breakpoint field. 


01 
2 bytes 
A1-A31 used to 
specify a two-byte, 
word-aligned 
breakpoint field. AOin 
Breakpoint Address 
Register is not used. 


10 
Undefined- 
do not use 
this encoding 
11 
4 bytes 
A2-A31 used to 
specify a four-byte, 
dword-aligned 
breakpoint field. AO 
and A1 in Breakpoint 
Address Register are 
not used. 


BREAKPOINT 
0 LINEAR 
AOORESS 


BREAKPOINT 
1 LINEAR 
AOORESS 


BREAKPOINT 
2 LINEAR 
ADDRESS 


BREAKPOINT 
3 LINEAR 
ADDRESS 


NOTE: 
f2J indicates 
Intel 
reserved: 
Dc not 
define; 
SEE SECTION 2.3.10 


The LENi field controls the size of breakpoint field i 
by controlling whether all low-order linear address 
bits in the breakpoint address register are used to 
detect the breakpoint event. Therefore, all break- 
point fields are aligned; 2-byte breakpoint fields be- 
gin on Word boundaries, and 4·byte breakpoint 
fields begin on Dword boundaries. 


The following is an example of various size break- 
point fields. Assume the breakpoint linear address in 
DR2 is 00000005H. In that situation, the following 


0 


ORO 


DR1 


DR2 


DR3 


DR4 


DR5 


B 
B 
B 
B 
DR6 
3 
2 
1 
0 


G 
L 
G 
L 
G 
L 
G 
L 
DR] 
3 
3 
2 
2 
1 
1 
0 
0 


0 


240950-A2 


• 


illustration indicates the region of the breakpoint 
field for lengths of 1, 2, or 4 bytes. 


DR2= 00000005H; 
LEN2 = OOB 
1 


31 


I 
Ib~fld21 
0I:::~: 


DR2=00000005H; 
LEN2 = 01B 


31 
I 
0 


I 
00000008H 


00000004H 


OOOOOOOOH 


+- bkpt fld2 -+ 


I 


DR2= 0000OO05H; LEN2= 
11B 
31 
I 
I 
I 


0 


I 
I 
00000008H 
+- 
bkpt fld2 
-+ 
00000004H 


I 
I 
OOOOOOOOH 


A 2-bit RW field exists for each of the four break- 
points. The 2-bit RW field specifies the type of usage 
which must occur in order to activate the associated 
breakpoint. 


RW 
Usage 
Encoding 
Causing 
Breakpoint 


00 
Instruction execution only 
01 
Data writes only 
10 
Undefined-do 
not use this encoding 
11 
Data reads and writes only 


RW encoding 00 is used to set up an instruction 
execution breakpoint. RW encodings 01 or 11 are 
used to set up write-only or read/write data break- 
points. 


Note that instruction 
execution 
breakpoints 
are 
taken 
as 
faults 
(Le., before the instruction exe- 
cutes), but data 
breakpoints 
are taken 
as traps 


(Le., after the data transfer takes place). 
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Using LENi and RWi to Set Data Breakpoint i 


A data breakpoint can be set up by writing the linear 
address into DRi (i = 0-3). 
For data breakpoints, 


RWi can = 01 (write-only) or 11 (write/read). LEN 
can = 00,01, or 11. 


If a data access entirely or partly falls within the data 
breakpoint field, the data breakpoint condition has 
occurred, and if the breakpoint is enabled, an excep- 
tion 1 trap will occur. 


Using LENi and RWi to Set Instruction Execution 
Breakpoint i 


An instruction execution breakpoint can be set up by 
writing address of the beginning of the instruction 
(including prefixes if any) into DRi (i = 0-3). 
RWi 


must = 00 and LEN must = 00 for instruction exe- 
cution breakpoints. 


If the instruction beginning at the breakpoint address 
is about to be executed, the instruction execution 
breakpoint condition has occurred, and if the break- 
point is enabled, an exception 1 fault will occur be- 
fore the instruction is executed. 


Note that an instruction execution breakpoint ad- 
dress must be equal to the beginning 
byte address 


of an instruction (including prefixes) in order for the 
instruction execution breakpoint to occur. 


GO (Global Debug Register access detect) 


The Debug Registers can only be accessed in Real 
Mode or at privilege level 0 in Protected Mode. The 
GO bit, when set, provides extra protection against 
any Debug Register access even in Real Mode or at 
privilege level 0 in Protected Mode. This additional 
protection feature is provided to guarantee that a 
software debugger can have full control over the De- 
bug Register resources when required. The GO bit, 
when set, causes an exception 1 fault if an instruc- 
tion attempts to read or write any Debug Register. 
The GO bit is then automatically cleared when the 
exception 1 handler is invoked, allowing the excep- 
tion 1 handler free access to the debug registers. 


GE and LE (Exact data breakpoint match, global and 
local) 


The breakpoint mechanism of the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor differs from 
that of the Intel386 microprocessor. The Intel486 SX 
microprocessor/Intel 
OverDrive Processor always 


does exact data breakpoint matching, regardless of 
GE/LE bit settings. Any data breakpoint trap will be 
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reported exactly after completion of the instruction 
that caused the operand transfer. Exact reporting is 
provided by forcing the Intel486 SX microprocessor/ 
Intel OverDrive Processor execution unit to wait for 
completion of data operand transfers before begin- 
ning execution of the next instruction. 


When the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor performs a task switch, the lE bit is 
cleared. Thus, the lE bit supports fast task switching 
out of tasks, that have enabled the exact data break- 
point match for their task-local breakpoints. The lE 
bit is cleared by the Intel486 SX microprocessor/ 
Intel OverDrive Processor during a task switch, to 
avoid having exact data breakpoint match enabled 
in the new task. Note that exact data breakpoint 
match must be re-enabled under software control. 


The Intel486 SX microprocessor/Intel 
OverDrive 
Processor GE bit is unaffected during a task switch. 
The GE bit supports exact data breakpoint match 
that is to remain enabled during all tasks executing 
in the system. 


Note that Instruction 
execution 
breakpoints are al- 
ways reported exactly. 


Gi and li (breakpoint enable, global and local) 


If either Gi or li is set then the associated breakpoint 
(as defined by the linear address in DRi, the length 
in lENi and the usage criteria in RWi) is enabled. If 
either Gi or li is set, and the Intel486 SX microproc- 
essor/Intel 
OverDrive Processor detects the ith 
breakpoint condition, then the exception 1 handler is 
invoked. 


When the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor performs a task switch to a new 
Task State Segment (TSS), all li bits are cleared. 
Thus, the li bits support fast task switching out of 
tasks that use some task-local breakpoint registers. 
The li bits are cleared by the Intel486 SX microproc- 
essor/Intel 
OverDrive 
Processor 
during 
a 
task 
switch, to avoid spurious exceptions in the new task. 
Note that the breakpoints must be re-enabled under 
software control. 


All 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor Gi bits are unaffected during a task 
switch. The Gi bits support breakpoints that are ac- 
tive in all tasks executing in the system. 


9.3.3 DEBUG 
STATUS 
REGISTER 
(DR6) 


A Debug Status Register, DR6 shown in Figure 9.1, 
allows the exception 1 handler to easily determine 


why it was invoked. Note the exception 1 handler 
can be invoked as a result of one of several events: 
1) DROBreakpoint fault/trap. 
2) DR1 Breakpoint fault/trap. 
3) DR2 Breakpoint fault/trap. 
4) DR3 Breakpoint fault/trap. 
5) Single-step (TF) trap. 
6) Task switch trap. 
7) Fault due to attempted debug register access 
when GD=1. 


The Debug Status Register contains single-bit flags 
for each of the possible events invoking exception 1. 
Note below that ooine of these events are faults (ex- 
ception taken before the instruction is executed), • 
while other events are traps (exception taken after 
the debug events occurred). 


The flags in DR6 are set by the hardware but never 
cleared by hardware. Exception 1 handler software 
should clear DR6 before returning to the user pro- 
gram to avoid future confusion in identifying the 
source of exception 1. 


The fields within the Debug Status Register, DR6, 
are as follows: 


Bi (debug fault/trap due to breakpoint 0-3) 


Four breakpoint indicator flags, BO-B3, correspond 
one-to-one with the breakpoint registers in DRO- 
DR3. A flag Bi is set when the condition described 
by DRi, lENi, and RWi occurs. 


If Gi or li is set, and if the ith breakpoint is detected, 
the 
Intel486 SX microprocessor/Intel 
OverDrive 
Processor will invoke the exception 1 handler. The 
exception is handled as a fault if an instruction 
execution breakpoint occurred, or as a trap if a data 
breakpoint occurred. 


IMPORTANT 
NOTE: A flag Bi is set whenever the 
hardware detects a match condition on enabled 
breakpoint i. Whenever a match is detected on at 
least one enabled 
breakpoint i, the hardware imme- 
diately sets all Bi bits corresponding to breakpoint 
conditions matching at that instant, whether enabled 
or not. Therefore, the exception 1 handler may see 
that multiple Bi bits are set, but only set Bi bits corre- 
sponding to enabled 
breakpoints (li or Gi set) are 
true indications of why the exception 1 handler was 
invoked. 


BD (debug fault due to attempted register access 
when GD bit set) 


This bit is set if the exception 1 handler was invoked 
due to an instruction attempting to read or write to 
the debug registers when GD bit was set. If such an 
event occurs, then the GD bit is automatically 
cleared when the exception 1 handler is invoked, 
allowing handler access to the debug registers. 


BS (debug trap due to single-step) 


This bit is set if the exception 1 handler was invoked 
due to the TF bit in the flag register being set (for 
single-stepping). 


BT (debug trap due to task switch) 


This bit is set if the exception 1 handler was invoked 
due to a task switch occurring to a task having an 


Intel486 SX microprocessor/Intel OverDrive Proces- 
sor TSS with the T bit set. Note the task switch into 
the new task occurs normally, but before the first 
instruction of the task is executed, the exception 1 
handler is invoked. With respect to the task switch 
operation, the operation is considered to be a trap. 


9.3.4 USE OF RESUME 
FLAG (RF) IN FLAG 


REGISTER 


The Resume Flag (RF) in the flag word can sup- 
press an instruction execution breakpoint when the 
exception 1 handler returns to a user program at a 
user address which is also an instruction execution 
breakpoint. 


This section describes the Intel486 SX microproces- 
sor/Intel 
OverDrive Processor instruction set. Ta- 
bles 10.1 through 10.3 list all instructions along with 
instruction encoding diagrams and clock counts. 
Further details of the instruction encoding are then 
provided in Section 
10.2, which completely de- 
scribes the encoding structure and the definition of 
all fields occurring within the Intel486 SX microproc- 
essor/lntel487 SX Math CoProcessor instructions. 


10.1 
Intel486™ 
SX Microprocessorl 
Intel487™ 
SX Math CoProcessor 
Instruction 
Encoding and Clock 
Count Summary 


In general, instructions will execute faster on the 
Intel OverDrive Processor than on the Intel486 SX 
microprocessor/lntel487 
SX 
Math 
CoProcessor. 


The following section describes how to calculate 
elapsed time for an instruction for the Intel486 SX 
microprocessor/lntel487 
SX 
Math 
CoProcessor. 


See Section 12.3, Instruction Set Summary, for infor- 
mation on calculating elapsed time for instructions 
executed on the Intel OverDrive Processor. 


To calculate elapsed time for an instruction execut- 
ed on the Intel486 SX microprocessor/lntel487 SX 
Math CoProcessor, multiply the instruction clock 
count, as listed in Tables 10.1 through 10.3, by the 
Intel486 SX microprocessor/lntel487 
SX Math Co- 
Processor clock period (e.g., 50 ns for a 20 MHz 
Intel486 SX microprocessor/lntel487 
SX Math Co- 
Processor). 


For more detailed information on the encodings of 
instructions, refer to Section 10.2 Instruction Encod- 


ings. Section 10.2 explains the general structure of 
instruction encodings, and defines exactly the en- 
codings of all fields contained within the instruction. 


The Intel486 SX microprocessor/lntel487 
SX Math 
CoProcessor instruction clock count tables give 
clock counts assuming data and instruction access- 
es hit in the cache. A separate penalty column de- 
fines clocks to add if a data access misses in the 
cache. The combined instruction and data cache hit 
rate is over 90%. 


A cache miss will force the Intel486 SX microproces- 
sor/Intel 
OverDrive Processor/lntel487 
SX Math 
CoProcessor to run an external bus cycle. The 2 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor/lntel487 SX Math CoProcessor 32-bit burst bus 
is defined as r-b-w. 


Where: 
r = The number of clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle in a non-burst read. 
b = The number of clocks for the second and sub- 
sequent cycles in a burst read. 


w = The number of clocks for a write. 


The fastest bus the Intel486 SX microprocessor/ 
Intel OverDrive Processor/lntel487 
SX Math Co- 
Processor can support is 2- 1- 2 assuming 0 wait 
states. The clock counts in the cache miss penalty 
column assume a 2-1 - 2 bus. For slower busses 
add r- 2 clocks to the cache miss penalty for the 
first dword accessed. Other factors also affect in- 
struction clock counts. 


Instruction 
Clock Count Assumptions 
1. The external bus is available for reads or writes at 
all times. Else add clocks to reads until the bus is 
available. 


2. Accesses are aligned. Add three clocks to each 
misaligned access. 


3. Cache fills complete before subsequent accesses 
to the same line. If a read misses the cache dur- 
ing a cache fill due to a previous read or pre-fetch, 
the read must wait for the cache fill to complete. If 
a read or write accesses a cache line still being 
filled, it must wait for the fill to complete. 
4. If an effective address is calculated, the base reg- 
ister is not the destination register of the preced- 
ing instruction. If the base register is the destina- 
tion register of the preceding instruction add 1 to 
the clock counts shown. Back-to-back PUSH and 
POP instructions are not affected by this rule. 
5. An effective address calculation uses one base 
register and does not use an index register. How- 
ever, if the effective address calculation uses an 
index register, 1 clock may be added to the clock 
count shown. 


6. The target of a jump is in the cache. If not, add r 
clocks for accessing the destination instruction of 
a jump. If the destination instruction is not com- 
pletely contained in the first dword read, add a 
maximum of 3b clocks. If the destination instruc- 
tion is not completely contained in the first 16 byte 
burst, add a maximum of another r+ 3b clocks. 


7. If no write buffer delay, w clocks are added only 
in the case in which all write buffers are full. Typi- 
cally, this case rarely occurs. 


8. Displacement and immediate not used together. 
If displacement and immediate used together, 1 
clock may be added to the clock count shown. 
9. No invalidate cycles. Add a delay of 1 clock for 
each invalidate cycle if the invalidate cycle con- 
tends for the internal cache/external bus when 
the Intel486 SX microprocessor/Intel OverDrive 
Processor/lntel487 
SX 
Math 
CoProcessor 
needs to use it. 


10. Page translation hits in TLB. A TLB miss will add 
13,21 or 28 clocks to the instruction depending 
on whether the Accessed and/or Dirty bit in nei- 
ther, one or both of the page entries needs to be 
set in memory. This assumes that neither page 
entry is in the data cache and a page fault does 
not occur on the address translation. 


11. No exceptions are detected during instruction 
execution. Refer to Interrupt Clock Counts Table 
for extra clocks if an interrupt is detected. 
12. Instructions that read multiple consecutive data 
items (Le. task switch, POPA, etc.) and miss the 
cache are assumed to start the first access on a 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+3b) 
clocks to the cache miss penalty. 
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Table 10.1. Intel486TM SX Microprocessorlintel487™ 
SX Math 


CoProcessor Integer Clock Count Summary 


INSTRUCTION 
FORMAT 
Cache Hit 
Penalty If 
Not •• 
CachaMI •• 


INTEGER OPERATIONS 


MOY ~ Move: 


reg110reg2 
1000100W 
11 
reg1 reg21 
1 


reg2 10reg1 
1000101w 
11 
reg1 reg21 
1 


memory to reg 
1000101w 
mod 
reg 
rIm 1 
1 
2 


reg to memory 
1000100w 
mod 
reg 
rIm I 
1 


Immediate to reg 
1100011 
w 
11000 
reg Iimmediate data 
1 


or 
I 
1011w reg I immediate data 
1 


Immediate to Memory 
I 1100011 
w I mod 000 
Idisplacement 
1 
rim 
immediate 


Memory to Accumulator 
I 1010000w 
I full displacement 
1 
2 


Accumulator 
to Memory 
I 1010001w 
I full displacement 
1 


MOYSX/MOYZX 
= Move with Slgn/Zero 
Extenalon 


reg2 to reg1 
I 00001111 
I 1011 
z11 
w 
111 
reg1 reg21 
3 


memory to reg 
I 00001111 
I 1011 
z11 
w I mod 
reg 
r/ml 
3 
2 


z 
Instruction 


0 
MOVZX 


1 
MOVSX 


PUSH ~ Puah 


reg 
11111111 
111 
110 
reg 1 
4 


or 
01010 
reg 1 
1 


memory 
11111111 
I mod 110 
r/ml 
4 
1 
1 


immediate 
011010s0 
I immediate data 
1 


PUSHA ~ Puah All 
01100000 
I 
11 


~P 
~ Pop 


1 
111 
reg 1 
reg 
10001111 
000 
4 
1 


or 
101011 
reg 1 
1 
2 


memory 
I 10001111 
1mod 000 
r/ml 
5 
2 
1 


POPA ~ Pop All 
1 01100001 
1 
9 
7/15 
16/32 


XCHG ~ Exchange 


reg1 with reg2 
1000011w 
111 
reg1 reg21 
3 
2 


Accumulator 
with reg 
10010 
reg I 
3 
2 


Memory with reg 
1000011w 
1mod 
reg 
r/ml 
5 
2 


NOP ~ No Operation 
10010000 
1 
1 


LEA ~ Load EA to Reglater 
10001101 
I mod 
reg 
rIm I 
no index register 
1 


with index register 
2 


• 


Table 10.1. Intel486TM SX Microprocessor/intel487™ 
SX Math 
CoProcessor Integer Clock Count Summary (Continued) 


INSTRUCTION 
FORMAT 
Cache Hit 
Penelty If 
Not •• 
eacheMI •• 


INTEGER OPERATIONS (Continued) 


Inetructlon 
TTT 


ADD ~ Add 
000 
ADC = Add with CarTy 
010 
AND ~ Logical AND 
100 


OR ~ Logical OR 
001 


SUB ~ Subtract 
101 
SBB ~ Subtract with Borrow 
011 


XOR = Logical Exclusive OR 
110 


reg1 to reg2 
I OOTTTOOw 
11 
reg1 reg21 
1 


reg2toreg1 
I 00TTT01w 
11 
reg1 reg21 
1 


memory to register 
I 00TTT01w 
mod 
reg 
rlml 
2 
2 


register to memory 
I OOTTTOOw 
mod 
reg 
r/ml 
3 
6/2 
U/L 


immediate to register 
I 100000sw 
11 
TTT 
reg I immediate register 
1 


immediate to accumulator 
I 00TTT10w 
immediate data 
1 


immediate to memory 
I 100000sw 
mod TTT 
rIm I immediate data 
3 
6/2 
UlL 


Instruction 
TTT 


INC = Increment 
000 
DEe ~ Decrement 
001 


reg 
I 1111111w 
111 
TTT 
reg I 
1 


or 
101 TTT 
reg I 
1 


memory 
I 1111111w 
I mod TTT 
r/ml 
3 
6/2 
U/L 


Instruction 
TTT 


NOT = Logical Complement 
010 
NEG = Negate 
011 


reg 
I 1111011 
w 
111 
TTT 
reg I 
1 


memory 
I 1111011 
w I mod TTT 
rim 1 
3 
6/2 
U/L 


CMP ~ Compare 


reg1 with reg2 
0011100w 
111 
reg1 reg21 
1 


reg2 with reg1 
0011101 
w 
111 
reg1 reg21 
1 


memory with register 
0011100w 
I mod 
reg 
rim I 
2 
2 


register with memory 
0011101 
w I mod 
reg 
rlml 
2 
2 


immediate with register 
100000sw 
111 
111 
reg I immediate data 
1 


immediate with acc. 
0011110w 
I immediate data 
1 


immediate with memory 
100000sw 
I mod 111 
rim I immediate data 
2 
2 


TEST ~ Logical Compare 


reg1 and reg2 
1000010w 
11 
reg1 reg21 
1 


memory and register 
1000010w 
mod 
reg 
rim I 
2 
2 


immediate and register 
1111011 
w 
11 
000 
reg I immediate data 
1 


immediate and acc. 
1010100w 
immediate data 
1 


immediate and memory 
1111011 
w 
mod 000 
rIm I immediate data 
2 
2 


intel~ 


Table 
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CoProcessor 
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INSTRUCTION 
FORMAT 
ClcheHIt 
Penillylf 
Not •• 
ClcheMI 
•• 


INTEGER OPERATIONS 
(Continued) 


MUL ~ Multiply (un"lIned) 


ace. with register 
I 1 1 1 1 011 
w 
111 
100 
reg 1 


Multiplier-Byte 
13/18 
MN/MX.3 


Word 
13/26 
MNIMX.3 


Dword 
13/42 
MNIMX.3 


ace. with memory 
I 1111011 
w 
1mod 
100 
rIm I 


Multiplier-Byte 
13/18 
1 
MN/MX,3 


Word 
13/26 
1 
MN/MX,3 


Dword 
13/42 
1 
MN/MX,3 


IMUL ~ Integer Multiply ("lined) 


ace. with register 
I 1111011 
w 
111 
101 
reg 1 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


ace. with memory 
1 1111011 
w 
1mod 
101 
r/ml 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


reg1 with reg2 
1 00001111 
I 10101111 
111 
reg1 
r8Q2! 


Multiplier-Byte 
13/18 
MN/MX.3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


register 
with memory 
1 00001111 
I 10101111 
1mod 
reg 
r/ml 


Multiplier-Byte 
13/18 
1 
MN/MX,3 


Word 
13/26 
1 
MN/MX,3 


Dword 
13/42 
1 
MN/MX,3 


reg1 with imm. to reg2 
1 0110105 
1 
111 
regl 
reg21 immediate 
data 


Multiplier-Byte 
13/18 
MN/MX,3 


Word 
13/26 
MN/MX,3 


Dword 
13/42 
MN/MX,3 


memo with imm. to reg. 
1 0110t051 
I mod 
reg 
rIm I immediate 
data 


Multiplier-Byte 
13/18 
2 
MN/MX,3 


Word 
13/26 
2 
MN/MX,3 


Dword 
13/42 
2 
MN/MX,3 
DIY ~ Divide (unsigned) 


ace. by register 
1 1111011 
w 
111 
110 
reg 1 


Divisor-Byte 
16 


Word 
24 
Dword 
40 


ace. by memory 
I 1111011 
w 
1mod 
110 
rIm I 


( 


Divisor-Byte 
16 
Word 
24 
Dword 
40 


IDlY ~ Integer Divide ("lined) 


ace. by register 
1 1111011 
w 
111 
111 
reg 1 


Divisor-Byte 
19 
Word 
27 
Dword 
43 


• 


Table 
10.1. Intel486TM 
SX 
Microprocessor/intel487™ 
SX 
Math 


CoProcessor 
Integer 
Clock 
Count 
Summary 
(Continued) 


INSTRUCTION 
FORMAT 
,CacheHll 


Penally If 
Notea 
Cache MI•• 


INTEGER OPERATIONS (Continued) 


ace. by memory 
1 1111011 
w 
1mod 111 
rim I 


Divisor-Byte 
20 


Word 
28 


Dword 
44 


CBW - 
Convert 
Byte to Word 
1 10011000 
I 
3 


CWO - 
Convart 
Word to Dword 
1 10011001 
I 
3 


Inatructlon 
TTT 


ROl - 
Rotate left 
000 


ROR = Rotate Right 
001 


RCl = Rotate through Carry left 
010 


RCR - 
Rotate through Carry Right 
011 


SHl/SAl 
~ Shift logicall Arithmetic left 
100 


SHR = Shift logical Right 
101 


SAR = Shift Arithmetic Right 
111 


Not Through 
Carry (ROl, ROR, SAl, SAR, SHl, and SHR) 


reg by 1 
I 1101000w 
111 
TTT 
reg 1 
3 


memory by 1 
I 1101000w 
Imod TTT 
r/ml 
4 
6 


reg by Cl 
1 1101001 
w 
111 
TTT 
reg 1 
3 


memorybyCl 
1 1101001 
w I mod TTT 
r/ml 
4 
6 


reg by immediate count 
1 1100000w 
111 
TTT 
reg I immediate 8-bit data 
2 


mem by immediate 
count 
1 1100000w 
I mod TTT 
rIm I immediate 8·bit data 
4 
6 


Through 
Carry (RCL and RCR) 


reg by 1 
1101000w 
111 
TTT 
reg 1 
3 


memory by 1 
1101000w 
1mod TTT 
r/ml 
4 
6 


reg by Cl 
1101001 
w 
111 
TTT 
reg 1 
8/30 
MN/MX,4 


memorybyCl 
1101001 
w 
1mod TTT 
r/ml 
9/31 
MN/MX,5 


reg by immediate count 
1100000w 
111 
TTT 
reg I immediate 8-bit data 
8/30 
MN/MX,4 


mem by immediate count 
1100000w 
I mod TTT 
rIm I immediate 8-bit data 
9/31 
MN/MX,5 


Instruction 
TTT 


SHlD = Shift left Double 
100 


SHRD = Shift Right Double 
101 


register with immediate 
00001111 
10TTT100 
11 
reg2 reg1 1imm 8-bit data 
2 


memory by immediate 
00001111 
10TTT100 
mod 
reg 
rIm I imm 8-bit data 
3 
6 


register by CL 
00001111 
10TTT101 
11 
reg2 reg11 
3 


memorybyCl 
00001111 
10TTT101 
mod 
reg 
rim 1 
4 
5 


BSWAP = Byte Swap 
00001111 
11001 
reg 
1 


XAOO - 
Exchange and Add 


reg1. reg2 
1 00001111 
11100000W 
111 
reg2 reg11 
3 


memory, reg 
1 00001111 
11100000W 
1mod 
reg 
rim I 
4 
6/2 
U/l 


CMPXCHG ~ Compare and Exchange 


reg1, reg2 
I 00001111 
11011000W 
111 
reg2 reg1 I 
6 


memory, reg 
I 00001111 
11011000W 
1mod 
reg 
r/ml 
7/10 
2 
6 


infel~ 


Table 10.1. Intel486™ SX Microprocessorlintel487™ 
SX Math 
CoProcessor Integer Clock Count Summary (Continued) 


INSTRUCTION 
FORMAT 
Cache Hit 
Penalty 
If 
Nol •• 
cache 
Mlaa 


CONTROL 
TRANSFER 
(within 
segment) 


NOTE: Times are jump taken/not 
taken 


Jccc 
= Jump on ccc 


8·bit displacement 
I 
0111 
tttn 
I 
8-bitdisp. 
I 
3/1 
TINT, 
23 


full displacement 
I 00001111 
I 
1000lttn 
I full displacement 
3/1 
TINT, 23 


NOTE: Times are jump taken/not 
taken 


SETcccc 
~ Set By1e on cccc (TImes are cccc truelfalee) 


reg 
I 00001111 
I 
1001tttn 
111 
000 
reg I 
4/3 


memory 
I 
00001111 
I 
1001tltn 
I mod 
000 
r/ml 
3/4 


Mnemonic 
Condition 
tttn 
ceee 
0 
Overflow 
0000 


NO 
No Overflow 
0001 


B/NAE 
BelowlNot 
Above or Equal 
0010 


NB/AE 
Not Belowl Above or Equal 
0011 
EIZ 
EquallZero 
0100 


NE/NZ 
Not Equal/Not 
Zero 
0101 


BEINA 
Below or Equal/Not 
Above 
0110 


NBE/A 
Not Below or Equall Above 
0111 


S 
Sign 
1000 


NS 
Not Sign 
1001 


PIPE 
Parity/Parity 
Even 
1010 
NP/PO 
Not Parily/Parily 
Odd 
1011 


UNGE 
Less Than/Not 
Greater or Equal 
1100 


NL/GE 
Not Less Than/Greater 
or Equal 
1101 


LEING 
Less Than or Equal/Greater 
Than 
1110 


NLE/G 
Not Less Than or Equal/Greater 
Than 
1111 


LOOP ~ LOOP CX TImes 
I 
11100010 
I 
8-bitdisp. 
I 
7/6 
UNL,23 


LooPZlLooPE 
~ Loop with 
I 
11100001 
I 
8-b~ disp. 
I 
9/6 
LINL,23 


Zero/Equal 


LooPNZ/LooPNE 
= Loop while 
I 
11100000 
I 
8-bitdisp. 
I 
9/6 
LINL,23 


Not Zero 


JCXZ 
- 
Jump on CX Zero 
I 
11100011 
I 
8-bitdisp. 
I 
8/5 
TINT, 23 


JECXZ 
~ Jump on ECX Zero 
I 
11100011 
I 
8-bitdisp. 
I 
8/5 
TINT, 23 


(Address Size Prefix Differentiates 
JCXZ for JECXZ) 


JMP = Unconditional 
Jump (within 
segmant) 


Short 
I 11101011 
I 
8-bitdisp. 
I 
3 
7,23 


Direct 
I 11101001 
I full displacement 
3 
7,23 


Register Indirect 
I 11111111 
111 
100 
reg I 
5 
7,23 


Memory Indirect 
I 
11111111 
I mod 
100 
r/ml 
5 
5 
7 


CALL 
~ Call (within 
segment) 


Direct 
I 
11101000 
I full displacement 
3 
7,23 


Register Indirect 
I 11111111 
111 
010 
reg I 
5 
7,23 


Memory Indirect 
I 11111111 
I mod 
010 
rIm I 
5 
5 
7 


RET ~ Return 
from CALL (within 
segment) 
I 11000011 
I 
5 
5 


Adding Immediate to SP 
I 
11000010 
I 
16-bit disp. 
I 
5 
5 


• 


intel~ 


INSTRUCTION 
FORMAT 
Cache Hit 
Penally If 
Notee 
CacheMlee 


CONTROL TRANSFER (within segment) (Continued) 


ENTER - 
Enter Procedure 
I 
11001000 
I,6-bitdisp.,8·bitlevel 


Level = 0 
14 


Level - 
1 
17 


Levei (L) > 1 
17+3L 
8 


LEAVE = Leave Procedure 
I 
11001001 
I 
5 
1 


MULTIPLE·SEGMENT INSTRUCTIONS 


MOV = Move 


reg. to segment reg. 
I 
10001110 
111 
sreg3 
reg I 
3/9 
0/3 
RV/P.9 


memory to segment reg. 
I 
10001110 
I mod sreg3 
r/ml 
3/9 
2/5 
RV/P.9 


segment reg. to reg. 
I 
10001100 
111 
sreg3 
reg I 
3 


segment reg. to memory 
I 
10001100 
I mod sreg3 
r/ml 
3 


PUSH - 
Push 


segment reg. 
I 000sreg211 
01 
3 


(ES, CS, 55, or DS) 


segment reg. (FS or GS) 
I 00001111 
110 
sreg30 0 0 I 
3 


POP ~ Pop 


segment reg. 
1000sreg2111 
3/9 
2/5 
RV/P,9 


(ES, 55, or DS) 


segment reg. (FS or GS) 
00001111 
10 
Sreg30011 
3/9 
2/5 
RV/P,9 


LOS - 
Load Pointer to OS 
11000101 
mod 
reg 
r/ml 
6/12 
7/10 
RV/P, 9 


LES = Loed Pointer to ES 
11000100 
mod 
reg 
r/ml 
6/12 
7/10 
RV/P, 9 


LFS = Load Polnterlo 
FS 
00001111 
10110100 
I mod 
reg 
rim I 
6/12 
7/10 
RV/P, 9 


LGS - 
Load Pointer to GS 
00001111 
10110101 
I mod 
reg 
r/ml 
6/12 
7/10 
RV/P, 9 


LSS = Load Pointer to SS 
00001111 
10110010 
I mod 
reg 
rim I 
6/12 
7/10 
RV/P,9 


CALL = Can 


Direct intersegment 
I 
10011010 
I unsigned full offset, selector 
18 
2 
R,7,22 


to same level 
20 
3 
P,9 


thru Gate to same level 
35 
6 
P,9 


to inner level, no parameters 
69 
17 
P,9 


to inner level, x parameter (d) words 
77+4X 
17+n 
P,11,9 


toTSS 
37+TS 
3 
P,10,9 


thru Task Gate· 
38+TS 
3 
P,10,9 


Indirect intersegment 
I 
11111111 
I mod 011 
r/ml 
17 
8 
R,7 


to same level 
20 
10 
P,9 


thru Gate to same level 
35 
13 
P,9 


to inner level. no parameters 
69 
24 
P,9 


to inner level, x parameter (d) words 
77+4X 
24+n 
P,11,9 


toTSS 
37+TS 
10 
P,10,9 
thru Task Gate 
38+TS 
10 
P,10,9 


RET - 
Return from CALL 


intersegment 
I 
11001011 
I 
13 
8 
R,7 


to same level 
17 
9 
P,9 


to outer level 
35 
12 
P,9 


intersegment 
adding 
I 
11001010 
I 
16·bit disp. 
I 
imm.toSP 
14 
8 
R,7 


to same level 
18 
9 
P,9 
to outer level 
36 
12 
P,9 
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INSTRUCTION 
FORMAT 
Cache Hit 
Penally II 
Notea 


Cache Mid 


MULTIPLE-SEGMENT 
INSTRUCTIONS 
(Continued) 


JMP = Unconditional 
Jump 


Direct intersegment 
1 11101010 
I unsigned 
full offset. 
selector 
17 
2 
A, 7, 22 


to same level 
19 
3 
P,9 


Ihru Call Gate 10 same level 
32 
6 
P,9 


IhruTSS 
42+TS 
3 
P,10,9 


Ihru Task Gate 
43+TS 
3 
P,10,9 


Indirect 
intersegment 
I 11111111 
1mod 101 
r/ml 
13 
9 
A,7,9 


to same level 
18 
10 
P,9 


thru Call Gale 10 same level 
31 
13 
P,9 


thru TSS 
41+TS 
10 
P,10,9 


thru Task Gale 
42+TS 
10 
P,10,9 


BIT MANIPULATION 


BT ~ Teelbll 


register, 
immediate 
1 00001111 
1 10111010 
111 
100 
reg I imm. 8-bit data 
3 


memory 
I immediate 
1 00001111 
1 10111010 
1mod 100 
rIm I imm. 8-bit data 
3 
1 


1 


\ 


110100011 
111 
reg1 I 
reg1, reg2 
00001111 
reg2 
3 


memory, reg 
1 00001111 
1 10100011 
Imod 
reg 
r/ml 
8 
2 


Inltructlon 
TIT 


BTS ~ Test Bit and Set 
101 


BTA ~ Tesl Bil and Aesel 
110 
BTC ~ Test Bit and Compliment 
111 


register, 
immediate 
1 00001111 
1 10111010 
1,1 
TTT 
reg I imm. 8-bit data 
6 


memory, 
immediate 
1 00001)11 
1 10111010 
Imod TTT 
rIm I imm. 8-bit data 
8 
2/0 
U/L 


reg1, reg2 
100001111 
1 10TTT011 
111 
reg2 reg11 
6 


memory, 
reg 
1 00001111 
1 10TTT011 
Imod 
reg 
r/ml 
13 
3/1 
U/L 


BSF = seen Bit Forwlrd 


reg1, reg2 
1 00001111 
1 10111100 
111 
reg2 reg11 
6/42 
MN/MX,12 


memory, 
reg 
I 00001111 
1 10111100 
1mod 
reg 
rim 1 
7/43 
2 
MN/MX,13 


BSR ~ seen BII Revar •• 


reg1, reg2 
I 00001111 
1 10111101 
111 
reg2 reg11 
6/103 
MN/MX,14 


memory, 
reg 
I 00001111 
I 10111101 
1mod 
reg 
rim 1 
71104 
1 
MN/MX,15 


STRING INSTRUCTIONS 


CMPS - 
Compo •• Byle Word 
I 1010011 
w I 
8 
6 
16 


LODS ~ Load Byle/Word 
I 1010110w 
I 
5 
2 


to ALl AX/EAX 


MOVS ~ Move Byle/Word 
1 1010010w 
1 
7 
2 
16 


SCAS = seen Byle/Word 
1 1010111 
w 
1 
6 
2 


STOS ~ Store Byle/Word 
1 1010101w 
1 
5 
from ALl AX/EX 


XLA T = Tranalata 
String 
I 11010111 
I 
4 
2 


• 


Table 10.1. Intel486™ SX Mlcroprocessor/lntel487™ 
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INSTRUCTION 
FORMAT 
Cache Hit 
Penaltyll 
Notes 
CacheMI •• 


REPEATED STRING INSTRUCTIONS 


Repeated byCount in CX or ECX (C~ Count in CX or ECX) 


REPE CMPS ~ Compare String 
I 11110011 
I 1010011 
w I 


(Find Non-Match) 
C~O 
5 
C>O 
7+7c 
16,17 


REPNE CMPS = Compare Siring 
I 11110010 
I 1010011 
w I 


(Find Match) 
C~O 
5 


C>O 
7+7c 
16,17 


REP LODS ~ Load Siring 
I 11110010 
I 1010110w 
I 


C~O 
5 


C>O 
7+4c 
16,18 


REP MOVS - 
Move String 
I 
11110010 
I 1010010w 
I 


C~O 
5 
C~l 
13 
1 
16 
C>l 
12+3c 
16,19 


REPE SCAS - 
seen String 
I 11110011 
I 1010111 
w I 


(Find Non-ALl AX/EAX) 
C-O 
5 
C>O 
7+50 
20 


REPNE SCAS - 
seen Siring 
I 11110010 
I 1010111 
w I 


(Find ALl AX/EAX) 
C~O 
5 


C>O 
7+50 
20 


REP STOS - 
Store Siring 
I 11110010 
I 1010101w 
I 


C~O 
5 


C>O 
7+4c 


FLAG CONTROL 


CLC - 
Clear Carry FIeg 
I 
11111000 
I 
2 


STC - 
set carry Rag 
11111001 
I 
2 


CMC ~ Complement 
carry FIeg 
11110101 
I 
2 


CLD ~ Cle•• DIrection FIeg 
11111100 
I 
2 


STD - 
sat DIrection FIeg 
11111101 
I 
2 


CLI - 
Clear Interrupt 
11111010 
I 
5 
Enable Rag 


STI ~ sat Interrupt 
11111011 
I 
5 
Enable Flag 


LAHF ~ Load AH Into Rag 
I 
10011111 
I 
3 


SAHF ~ Store AH Into Raga 
I 10011110 
I 
2 


PUSHF - 
Puah Flega 
I 10011100 
I 
4/3 
RVIP 


POPF - 
Pop F1_ 
I 10011101 
I 
9/6 
RV/P 


DECIMAL ARITHMETIC 


AAA ~ ASCII AdJuatlor 
Acid 
I 00110111 
I 
3 


AAS = ASCII Adjuatlor 
I 00111111 
I 
3 


SUbtract 


AAM - 
ASCII AdJualfor 
I 
11010100 
I 00001010 
I 
15 
Multiply 
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INSTRUCTION 
FORMAT 
Cache Hit 
Penalty If 
Not •• 
CacheMI •• 


DECIMAL ARITHMETIC (Continued) 


AAD - 
ASCII AdJuatlor 
, 
11010101 
, 
00001010 
, 
14 


Divide 


DAA ~ Decimal AdJuat for Add 
, 00100111 
, 
2 


DAS = Decimal Adjuat for Subtract 
, 00101111 
, 
2 


PROCESSOR CONTROL INSTRUCTIONS 


HLT = Halt 
I 11110100 
I 
4 


MOV ~ Move To and From Control/Debug/Teat 
Reglat 
ro 


eRO from register 
I 00001111 
00100010 
11 
000 
regI 
17 
2 


CR2/CR3 from register 
00001111 
00100010 
11 
eee 
regI 
4 


Reg from CRO-3 
00001111 
00100000 
11 
eee 
regI 
4 


DRO-3 from register 
00001111 
00100011 
11 
eee 
reg' 
10 


DR6-7 from register 
00001111 
00100011 
11 
eee 
regI 
10 


Register from DR6-7 
00001111 
00100001 
11 
eee 
regI 
9 


Register from ORO-3 
00001111 
00100001 
11 
eee 
regI 
9 


TR3 from register 
00001111 
00100110 
11 
011 
regI 
4 


TR4-7 from register 
00001111 
00100110 
11 
eee 
regI 
4 


Register from TR3 
00001111 
00100100 
11 
011 
regI 
3 


Register from TR4-7 
00001111 
00100100 
11 
eee 
regI 
4 


CLTS - 
Clear Taak Switched Flag 
I 00001111 
00000110 
7 
2 


INVD = Invalidate Data Cache 
I 00001111 
00001000 
4 


WBINVD - 
Wrlte·Back and Invalidate I 00001111 
I 00001001 
I 
5 


OataCache 


INVLPG - 
Invalidate TLB Entry 


INVLPG memory 
I 00001111 
I 00000001 
I mod 111 
r/ml 
12/11 
HINH 


PREFIX BYTES 


Addre •• Size Prefix 
, 01100111 
, 
1 


LOCK ~ Bua Lock Prefix 
I 11110000 
I 
1 


Operand Size Prefix 
I 01100110 
I 
1 


Segment Override Prefix 


CS: 
00101110 
1 


OS: 
00111110 
1 


ES: 
00100110 
, 


FS: 
01100100 
1 


GS: 
01100101 
1 


55: 
I 00110110 
1 


• 
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INSTRUCTION 
FORMAT 
CacheHII 
Penally If 
Notea 
CacheMloa 


PROTECTION 
CONTROL 


ARPL ~ AdJuat Requeated 
Privilege Level 


From register 
1 01100011 
111 
reg1 reg21 
9 


From memory 
1 01100011 
1mod 
reg 
rim I 
9 


LAR ~ Load Acceaa Rlghta 


From register 
1 00001111 
1 00000010 
111 
reg1 reg21 
11 
3 


From memory 
1 00001111 
1 00000010 
Imod 
reg 
r/ml 
11 
5 


LGOT = Load Globa' Oeacrlptor 


Table 
register 
1 00001111 
1 00000001 
ImodOl0 
r/ml 
12 
5 


LIOT ~ Load Interrupt 
Daacrlptor 


Tableregister 
I 0000.1111 
1 00000001 
1mod 011 
rim I 
12 
5 


LLOT ~ Load Local Deacrlptor 


Table 
register 
from reg. 
1 00001111 
I 00000000 
111 
010 
reg1 
11 
3 


Table register from memo 
1 00001111 
1 00000000 
Imod010 
r/ml 
11 
6 


LMSW = Load Machine Statua Word 


From register 
1 00001111 
1 00000001 
111 
110 
reg! 
13 


From memory 
1 00001111 
1 00000001 
1mod 
110 
rim I 
13 
1 


LSL ~ Load Segment 
Umll 


From register 
I 00001111 
1 00000011 
111 
regl reg21 
10 
3 


1 
! 


i 


1mod 
r/ml 
From memory 
00001111 
00000011 
reg 
10 
6 


LTR = Load Taok Regleter 


From Register 
1 00001111 
1 00000000 
111 
011 
regI 
20 


From Memory 
I 00001111 
1 00000000 
1mod 011 
rim 1 
20 


SGOT ~ Store Global Deacrlptor 
Table 


1 00001111 
I 00000001 
Imod 000 
r/ml 
10 


SlOT = Store Interrupt Daacrlptor 
Table 


1 00001111 
1 00000001 
1mod 001 
r/ml 
10 


SLOT = Store Local Daacrlptor 
Table 


To register 
I 00001111 
1 00000000 
111 
000 
reg1 
2 


To memory 
1 00001111 
I 00000000 
1mod 000 
r/ml 
3 


SMSW ~ Store Meehlne Statua Word 


To register 
I 00001111 
1 00000001 
111 
100 
regI 
2 


To memory 
1 00001111 
I 00000001 
1mod 
100 
rim 1 
3 


" 
STR ~ Store Taak Reglater 


roregister 
I 00001111 
1 00000000 
111 
001 
reg! 
2 


To memory 
1 00001111 
I 00000000 
1mod 001 
rim 1 
3 


VERR ~ Verlly Reed Acoaaa 


Register 
1 00001111 
1 00000000 
111 
100 
r/ml 
11 
3 


Memory 
I 00001111 
1 00000000 
1mod 
100 
rim I 
11 
7 


VERW = Verlly Wrlte Accaaa 


roregister 
I 00001111 
1 00000000 
111 
101 
reg1 
11 
3 


To memory 
I 00001111 
1 00000000 
Imod 
101 
r/ml 
11 
7 


Table 
10.1.lnteI486TM 
SX Microprocessorlintel487™ 
SX Math 
CoProcessor 
Integer 
Clock Count Summary 
(Continued) 
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Table 10.1. Intel486TM SX Mlcroprocessor/lntel487™ 
SX Math 
CoProcessor Integer Clock Count Summary (Continued) 


INSTRUCTION 
FORMAT 
Cache Hit 
Pe",,1ty If 
Note. 
Cache Mias 


INTERRUPT 
INSTRUCTIONS 


INT n = Interrupt 
Type n 
I 11001101 
I 
type 
I 
INT+4/0 
RV/P.21 


INT 3 = Interrupt 
Type 3 
I 
11001100 
I 
INT+O 
21 


INTO = Interrupt 
4 " 
I 11001110 
I 
Overflow 
Aag sat 


Taken 
INT+2 
21 


NotTaken 
3 
21 


BOUND = Interrupt 
511 Detect 
I 01100010 
I mod 
reg 
r/ml 


V.lue 
Out Range 


Ifin range 
7 
7 
21 


If out of range 
INT+24 
7 
21 


IRET = Interrupt 
Return 
I 
11001111 
I 


Real ModelVirtual 
Mode 
15 
8 


Protected Mode 


To same 
level 
20 
11 
9 


To outer level 
36 
19 
9 


To nested task (EFLAGS.NT 
- 
1) 
TS+32 
4 
9,10 


Extemallntarrupt 
INT+11 
21 


NMI - 
Non-Mukable 
Interrupt 
INT+3 
21 


Page Fault 
INT+24 
21 


VM86 Exception. 


ell 
INT+8 
21 


STI 
INT+8 
21 


INTn 
INT+9 


PUSHF 
INT+9 
21 
POPF 
INT+8 
21 


IRET 
INT+9 


IN 


Fixed Port 
INT+50 
21 


Variable Port 
INT+51 
21 


OUT 


Fixed Port 
INT+50 
21 


Variable Port 
INT+51 
21 


INS 
INT+50 
21 


OUTS 
INT+50 
21 


REP INS 
INT+51 
21 


REPOUTS 
INT+51 
21 


Table 10.1(a).Task Switch Clock Counts Table 


Value forTS 
Method 
Cache 
Miss 
Hit 
Penalty 


VM/lnte1486TMOXCPU-lntel486 SX CPU-lnteI487TM SX MCP/286 TSS To 
162 
55 
Intel486 OXCPU-lntel486 SX CPU-lntel487 SX MCPTSS 
VMllntel486 OXCPU-lntel486 SX CPU-lntel487 SX MCP/286 TSS To 286 TSS 
143 
31 
VMllntel486 OXCPU-lntel486 SX CPU-lntel487 SX MCP/286 TSS To VM TSS 
140 
37 


• 
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Table 
10.1(b). Interrupt 
Clock Counts Table 


Method 
Value for INT 


Cache Hit 
Miss Penalty 
Notes 


Real Mode 
26 
2 


Protected 
Mode 
Interrupt/Trap 
gate, same level 
44 
6 
9 
Interrupt/Trap 
gate, different 
level 
71 
17 
9 
Task Gate 
37 + TS 
3 
9, 10 


Virtual Mode 
Interrupt/Trap 
gate, different 
level 
82 
17 
Task gate 
37 + TS 
3 
10 


Abbreviations 
16/32 
UlL 
MN/MX 
L/NL 
RV/P 
R 
P 
TINT 
H/NH 


Definition 
16/32 bit modes 
unlocked/locked 
minimum/maximum 
loop/no loop 
real and virtual mode/protected mode 
real mode 
protected mode 
taken/not taken 
hit/no hit 


NOTES: 
1. Assuming that the operand address and stack address fall in different cache sets. 
2. Always locked. no cache hit case. 
3. Clocks = 10 + max(log2(lm!),n) 
m = multiplier value (min clocks for m= 0) 
n = 3/5 for ±m 
4. Clocks = (quotient(count/operand length)} "7 + 9 
= 8 if count,;; 
operand length (8/16/32) 
5. Clocks = (quotient(count/operand length)I"7+9 
= 9 if count,;; 
operand length (8/16/32) 
6. Equal/not equal cases (penalty is the same regardless of lock). 
7. Assuming that addresses for memory read (for indirection), stack push/pop, and branch fall in different cache sets. 
8. Penalty for cache miss: add 6 clocks for every 16 bytes copied to new stack frame. 
9. Add 11 clocks for each unaccessed descriptor load. 
10. Refer to task switch clock counts table for value of TS. 
11. Add 4 extra clocks to the cache miss penalty for each 16 bytes. 
For notes 12-13: (b = 0-3. non-zero byte number); 


(i = 0-1. non-zero nibble number); 
(n = 0-3. non bit number in nibble); 
12. Clocks = 8+4 
(b+1) 
+ 3(i+1) 
+ 3(n+1) 
= 6 if second operand = 0 
13. Clocks = 9+4(b+ 
1) + 3(i+ 1) + 3(n+ 1) 
= 7 if second operand = 0 
For notes 14-15: (n = bit position 0-31) 
14. Clocks = 7 + 3(32-n) 
6 if second operand = 0 
15. Clocks = 8 + 3(32-n) 
7 if second operand = 0 
16. Assuming that the two string addresses fall in different cache sets. 
17. Cache miss penalty: add 6 clocks for every 16 bytes compared. Entire penalty on first compare. 
18. Cache miss penalty: add 2 clocks for every 16 bytes of data. Entire penalty on first load. 
19. Cache miss penalty: add 4 clocks for every 16 bytes moved. 


(1 clock for the first operation and 3 for the second) 
20. Cache miss penalty: add 4 clocks for every 16 bytes scanned. 


(2 clocks each for first and second operations) 
21. Refer to interrupt clock counts table for value of INT 
22. Clock count includes one clock for using both displacement and immediate. 
23. Refer to assumption 6 in the case of a cache miss. 


Table 10.2.lnteI486™ SX Mlcroprocessor/lntel487™ 
SX Math CoProcessor 
I/O Instructions 
Clock Count Summary 


Real 
Protected 
Protected 
Vlrlual86 
INSTRUCTION 
FORMAT 
Mode 
Mode 
Notel 
Mode 
(CPL,;;IOPL) 
(CPL>IOPL) 
Mode 


I/O INSTRUCTIONS 


IN = Inputlrom: 


Fixed Port 
1111001 
Ow I 
port number 
1 
14 
9 
29 
27 


VartablePort 
I 111011 
Ow I 
14 
8 
28 
27 


OUT = Output 
to: 


Fixed Port 
11110011 
wi 
port number 
1 
16 
11 
31 
29 


Vartable Port 
11110111 
wi 
16 
10 
30 
29 


INS = Input Byte/Word 
1 011011 
Ow 1 
17 
10 
32 
30 


from 
DXPorl 


OUTS = Output 
Byte/Word 
1 0110111 
w 1 
17 
10 
32 
30 
1 


to DX Pori 


REP INS = Input String 
I 11110010 
10110110wl 
16+8c 
10+8c 
30+8c 
29+8c 
2 


REP OUTS = Output 
String 
1 1111001010110111W! 
17+5c 
11+5c 
31+5c 
30+5c 
3 


NOTES: 
1. Two clock 
cache 
miss penalty 
in all cases. 


2. c = count 
in CX or ECX. 


3. Cache 
miss penalty 
in all modes: 
Add 2 clocks 
for every 
16 bytes. 
Entire penalty 
on second 
operation. • 
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Table 
10.3. Intel487TM 
SX Math CoProceuor 
Floating 
Point 
Clock 
Count 
SUmmary 


CMlleHlt 
c-. 
Execullon 
"""'11 
Avt(Lower 
INSTRUCTION 
FORMAT 
Avto.- 
CMlle_ 
Nolea 


=~l 
flInoe.·. 
Upper ••••••• 1 


DATATRANSFER 


FLD - _LOlld 
to ST(OI 


32-bitmemcxy 
111011 
001!mod 
000 
rIm! 
a+b/dLtp. 
I 
3 
2 


64-bit memo<)' 
111011 
101!mod 
000 
rIm! 
a+b/dLtp. 
I 
3 
3 


8().b~memcxy 
111011 
0111mod 
101 
rlml 
a+b/dlsp. 
I 
6 
4 


ST(I) 
111011 
oOlh 
1000 
ST(QI 
4 


FlLD - Integer Load to ST(O) 


16-bitmemcxy 
h 
1011 
111!modOOO 
rImI 
a+b/dLtp. 
I 
14.5(13-16) 
2 
4 


32·M memory 
111011 
0111modOOO 
r/ml 
t+b/dlsp. 
I 
11.5(9-12) 
2 
4(2-4) 


64-bit memory 
hl011 
l111mod 
101 
rImI 
s-l-b/dLtp. 
I 
16.6(10-18) 
3 
7.8(2-8) 


F18LD= BCDLoad to 8T(0) 
111011 
l111mod 
100 
r/ml 
a+b/dLtp. 
I 
75(70-103) 
4 
7.7(2-B} 


FST - Store Reallrom ST(O) 


32-bitmemcxy 
111011 
oOllmod 
010 
rImI 
a+b/dLtp. 
I 
7 
1 


64-bitmemcxy 
hl0l1 
10 I! mod 010 
rImI 
s-l-b/dLtp. I 
8 
2 


STQ) 
111011 
101111010 
ST(QI 
3 


FSTP - Store Rea' from ST(Ollllld Pop 


32-bit memcxy 
111011 
011!modOll 
rIm! 
s-Hl/dLtp. I 
7 
1 


64-bit memcxy 
111011 
1011mod 
011 
rIm I 
a+b/dLtp. 
I 
8 
2 


8().b~memory 
111011 
0111mod 
111 
rImI 
a+b/dLtp. 
I 
8 


ST(I} 
111011 
101111001 
sTml 
3 


FIST - Store InletS, from ST(O) 


16-bitmemcxy 
111011 
111!mod 
010 
rIm 1 
a+b/dLtp. 
1 
33.4(29-34) 


32·bit memory 
111011 
011!mod 
010 
r/ml 
a+b/dLtp 
I 
32.4(28-34) 


FlSTP - Store Integer from ST(Oland Pop 


16-bit memory 
111011 
l111mod 
011 
r/ml 
a+b/dLtp. 
I 
33.4(28-34) 


32-brtmemcxy 
hl011 
0111modOl1 
rIm! 
a+b/dLtp. 
I 
33.4(29-34) 


64-bit memory 
111011 
l111mod 
111 
rIm! 
a+b/dLtp. 
I 
33.4(28-34) 


F18STP- Store BCDfrom 
hl011 
l111mod 
110 
rIm! 
s-l-bl dLtp. I 
175(172-178) 
ST(O)lllld Pop 


FXCH - Exc:tlange ST(O)lllld ST(I) 
hl011 
oOlh 
1001 
ST(l)1 
4 


COMPARISONINSTRUCTIONS 


FCOM - Compare ST(O)with Real 


32-bit memory 
111011 
OOO!mod 
010 
rImI 
a+b/dlsp. 
I 
4 
2 
1 


64-bit memory 
hl011 
100lmod 
010 
rImI 
s-l-b/disp. 
I 
4 
3 
1 


ST(Q 
111011 
000111010 
ST(l)1 
4 
1 


FCOMP - Compare ST(O)willi Raallllld Pop 


32-bltmemcxy 
111011 
0001 mod 011 
rIm! 
a+b/dLtp. 
I 
4 
2 
1 


64-bit memcxy 
hl011 
100lmod 
011 
rIm! 
a+b/dLtp. 
I 
4 
3 
1 


ST(I) 
111011 
000111011 
ST(l)1 
4 
1 
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•••Table 10.3. Intel48."... SX Math COProcessor Floating Potnt Clock Count Summary (Continued) 


CacIleltlt 
e-rr.nt 


••••• 
If 
EUOUlIon 


IH8TRUCTlON 
FORMAT 
Avt(LoWer 
CllGhe •••• 
AVII(L_ 
Notes 
~ 
... 
:..-) 
Upper-"-e) 


COMPARISONINSTRUCTIONS(continUed) 


FCOMPP • COItIJleN81'(0) WIth 
1" 
0 11 
110h,0, 
10011 
5 
1 
81'(1) lIfld Pop Twloe 


FlCOM - Compare $T(O)wllhInte.-r 


16-bI1l1Klm01Y 
111 011 
110lmod 
010 
rIm 1 
s+bIdlsP. I 
18(16-20) 
2 
1 


32-bl1metllOlY 
!t,011 
010lmod 
010 
rIm 1 
s+b/dl8p. 
1 
16.5(15-17) 
2 
1 


FICOMP - Com,.... 
81'(0) WIth~ 


1$-1)j111Klm01Y 
!t,011 
110lmod 
011 
rIm 1 
s+b/dlsP. 
I 
18(16-20) 
2 
1 


32-bilmemory 
1" 
011 
ololmod 
011 
rlml 
s+b/dlsP. 
I 
16.5(15-17) 
2 
1 


FTST - Compere ST(O)with 0.0 
!t lOll 
001j,,10 
01001 
4 
1 


FUCOM- UlIonlered-.... 
1'1011 
101111100 
$T(l)1 
4 
I 
81'(0)WIth81'(1) 


F\lCOMP - u_-.... 
111011 
1011,1101 
8T(QI 
4 
1 
ST(O)WIth81'(1)lIfld Pop 


F\lCOMPP - ~-.... 
1,,011 
10,1,,101 
10011 
5 
I 
81'(0) WIthST(I)lIfld Pop Twloe 


FXAM- ExemIne STtO) 
111011 
00111110 
01011 
8 


CONSTANTS 


FLOZ - Load +0.0 lido 81'(0) 
111011 
OcOl11HO 
11101 
4 


FLOI - Load + 1.0 Into ST(O) 
I11011 
00111110 
1 Ocool 
4 


FLOPt - Load 11 Into ST(O) 
1" 
011 
00111110 
10111 
6 
2 


FLOL2T= Load k>l2(10) Into ST(O) 
1,,011 
0011,1,0 
1001 I 
8 
2 


Fl.Dl.ft = LoecIk>l2(e) Into 81'(0) 
!t 1011 
0011,110 
10101 
8 
2 


FLOLGi = LoecI1og1Ol2)lido ST(O) 
111011 
00Hll10 
11001 
8 
2 


FLDLN2 = LoadklU2l 
lIdoST(O) 
h lOll 
0011,110 
11011 
8 
2 


ARITlfMETIC 


FAOO = Add R4NlI WIthSTtO) 


ST(O)••.••ST(O)+ 32-bl1metllOlY 
!t,011 
oooimod 
000 
rIm I 
s+b/dlsP. I 
10(8-20) 
2 
7(5-17) 


81'(Oc) 
••••• 81'(0) + 64-bit metllOlY 
111011 
1001 mod 000 
rlml 
s+b/dlsP· 
I 
10(8-20) 
3 
7(5-17) 


ST(d) - 
8T(0) + 81'(1) 
!t 101 I 
dOOIt 
1000 
sTml 
10(8-20) 
7(6-17) 


FAOOP· 
Add •••• WlthST(O)lIfld 
1,,0ft 
1 I 01,1 
OOcO 
81'(1)1 
10(8-20) 
7(5-17) 
Pop(ST(I) - 
ST(O)+ 81'(1) 


FSU8 = Subtnct 
•••• frOm81'(0) 


ST(Oc)- ST(Oc)- 32-bl1rnernory 
111011 
OOOlmod 
100 
rImI 
s+b/dlsP. 
I 
10(8-201 
2 
r(5-17) 


81'(0)- 
81'(01- 64-bit1lKlmOlY 
laol1 
100lmod 
100 
rim I 
&oi-bldlsP· I 
10(6-20) 
3 
7(5-17) 


ST(d) - 
8T(0) - ST(l) 
111011 
doollil01 
81'(1)1 
10(8-20) 
7(5-17) 


FSU8f' = Subtnct 
•••• frOm ST(O) 
1,1011 
110111101 
8rIDI 
10(8-20) 
7(5-17) 
lIfldPop (STII)_ 
$T(O)- ST(I) 


•• 
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Table 
10.3. Intel487TM 
SX Math CoProcessor 
FIoaUng 
Point 
Clock 
Count 
Summary 
(Continued) 


C8ClleHil 
c:oncun-t 


PeneIty" 
Ell-.tlon 


INSTRUCTION 
FORMAT 
AVO~ 
C8Clle •••• 
AVO(Lower 
Nola 
u:,~) 
=.-.) 


ARITHMETIC(Continued) 


FlIU8fl = Subtract •••• ~ 
(Subtract STIO)f.- re8I) 


ST(O)+- 32-bit memooy - ST(O) 
111011 
0001_ 
101 
rIm 1 
a-i-b/diop. 
I 
10(8-20) 
2 
7(5-17) 


ST(O)+- 64-bi1memooy- ST(O) 
111011 
1001_ 
101 
rIm I 
a-i-b/diop. 
I 
10(8-20) 
3 
715-17) 


STld) +- ST(i) - ST(O) 
111011 
dOOIt 
1100 
ST(i)1 
10(8-20) 
715-17) 


fSU8RP 
= Subtract reel ~ 
111011 
110111100 
ST(QI 
10(8-20) 
7(5-17) 
_ 
Pop (8T(I)"" 
ST(I) - ST(O) 


FIIUL • 1IullIply •••• w11IlST(O) 


ST(O)+- ST(O)X 32-bi1memooy 
!t 1011 
0001_ 
001 
rIm I 
a-i-bl dIop. I 
11 
2 
8 


STIO)•.•• ST(O)X 64-bi1memooy 
111011 
1001_ 
001 
rImI 
a-i-b/diep. 
I 
14 
3 
11 


ST(d) +- STIO)X ST(I) 
!tl0ll 
dOOtl1001 
ST(I)\ 
18 
13 


FIIUU' • MultiplyST(O)w11Il5T(I) 
111011 
110111001 
ST(QI 
16 
13 
_ 
Pop (ST(I)+- 8TIO) x ST(I) 


FOIY• IltvIde ST(O)by AeeI 


STIO)+- STIO)/32-bit memooy 
111 011 
0001_ 
110 
rIm 1 
a-i-bIdiop. I 
73 
2 
70 
3 


ST(O)•.•• STIOj/64-bi1memory 
!t 1011 
1001_ 
100 
"m I 
a-i-bl diep. 
I 
73 
3 
70 
3 


STld) +- ST(O)/ST(i) 
111011 
dooll1111 
ST(I)I 
73 
70 
3 


FOIVP - IltvIde ST(O)by 8T(I)_ 
!t 1011 
110111111 
5T(i)1 
73 
70 
3 
Pop (ST(I)+- ST(O)/ST(I) 


FOIYR- 1ltvIde•••• ~ 
(Aee118T(0)) 


STIO)•.•• 32-bil memooy/STIO) 
111011 
0001_ 
111 
rImI 
a-i-bIdiep. 
1 
73 
2 
70 
3 


STIO)+- 64-b~ memooy/STIO) 
111011 
1001_ 
111 
rImI 
a-i-b/diep. 
I 
73 
3 
70 
3 


STld) +- ST(i)/ST(O) 
111011 
dooll1110 
5T(i)1 
73 
70 
3 


FOIVRP • 1ltvIde •••• ___ 
111011 
110j,1110 
5T(I)I 
73 
70 
3 
Pop (ST(I)+- 8T(l)/8T(O) 


FIAOO· 
Add ~to 
STIO) 


STIO)+- 5T(O) + 16ob1tmemory 
111011 
1101 m<llI 000 
rlnll 
a-i-bIdiep. 
1 
2.4(20-35) 
2 
7(5-17) 


8T(O)+- ST(O)+ 32-bit memooy 
111011 
0101_ 
000 
rImI 
a-i-bl dIop. 
! 
22.5(19-32) 
2 
7(5-17) 


FISU8 - llubtnIcl ~ 
f.- ST(D) 


ST(O)+- ST(O)- 16ob1tmemooy 
111011 
1101_ 
100 
rIm! 
a-i-bI dIop. 
1 
24(20-36) 
2 
715-17) 


5T(O)•.•• STIO)- 32·bl!memooy 
111011 
0101_ 
100 
rIm! 
a-i-b/diep. 
1 
22.5(19-32) 
2 
7(5-17) 


FISUBR = Integer Subtract ReverMd 


5T(O)+- 16ob1tmetnOIY- ST(O) 
!t 1 011 
1101_ 
101 
rIm 1 
a-i-bIdiop. 
1 
24120-35) 
2· 
7(5-17) 


ST(O)+- 32-bitmemooy - 51(0) 
111011 
0101_ 
101 
r/ml 
a-i-bI dIop. I 
22.5(19-32) 
2 
7(5-17) 


F111UL- IIullIply Integer w11IlST(O) 


ST(O)+- ST(O)x 16-b1tmemooy 
111011 
1101_ 
001 
rIm I 
a-i-bIdiop. 
1 
25123-27) 
2 
8 


5TIO)••• ST(O)x 32-bit memooy 
!t 1011 
0101_ 
001 
rIm I 
a-i-b/dl8p. 
! 
23.5(22-2.4) 
2 
8 


F1D1Y• ~ 
IltvIde 


ST(O)+- STIO)II6-b1t memooy 
111011 
1101_ 
110 
rIm! 
a-i-bldlop. 
! 
87(85-89) 
2 
70 
3 


Sl(D) +- STIO)/32-bi1 memooy 
111011 
0101_ 
110 
rim I 
a-i-bIdiop. 
1 
85.6(84-86) 
2 
70 
3 


inteiaP 


Table 10.3.lntel487T11SX Math CoProce8aor floating 
PoInt Clock Count Summary (Continued) 


ClIche HIt 
e-t 
......,N 
Ex-uon 


INSTRUCTION 
FORMAT 
A"II("- 
ClIche ••• 
A"S(~ 
IIol8a 
•••••••• 
Range ••• 


Upper ••••• 
) 
Upper ••••• 
) 


ARlTHlIETIC (Continued) 


FlDlYR = ~~ 
R.-..d 


ST(OI+-16-bi1 memory/ST(O) 
1" 
011 
11 01 mod 111 
rImI 
tH-b/dilp. 
I 
87(85-69) 
2 
70 
3 


ST(O)+- 32-bil memory/ST(O) 
j,1011 
010lmod 
111 
rIm I 
tH-b/dlep. 
I 
85.5(84-86) 
2 
70 
3 


FSQRT = IIquere Root 
1,1011 
001h,11 
10101 
85.5(83-87) 
70 


FlICALE = llc8le ST(O)by ST(ll 
j,1011 
00lhl11 
11011 
31(30-321 
2 


FXTRACT= ExtnIct compoo••nta 
1,101, 
00111111 
01001 
19(18-20) 
4(2-4) 


or8T(0) 


FPREM = Partial Reminder 
11 1011 
00111111 
10001 
84(70-1381 
2(2-8) 


FPREM1 = httlel 
RIIt1IIndet(IEEEI 
111011 
00111111 
01011 
94.5(72-167) 
5.5(2-18) 


FRNDINT= Round 8T(0) 10 Integer 
111011 
00111111 
11001 
29.1(21-30) 
7.4(2-8) 


FASS = AbeoIute v8lUeof ST(DI 
j, 1011 
00111110 
00011 
3 


FCHS = CIlentIe IIgn Of8T(0) 
1,1011 
00111110 
00001 
6 


TFlANSCENDENTAL 


FCOS = e-OfST(O) 
1,,01, 
001hll1 
11111 
241(193-279) 
2 
6,7 


FPTAN = hrlIeI 
tIIngent OfST(O) 
j, 1011 
oOlhlll 
00101 
244(200-273) 
70 
8.7 


FPATAN = hrlIeI M:IJlnglInt 
111011 
001hll1 
00111 
299{218-3031 
5(2-17) 
6 


FSlN = _OfST(O) 
1,,01, 
00,1,,11 
11 101 
241(193-279) 
2 
6,7 


FSlNCOS = ___ 
orST(O) 
111011 
001!tl11 
10111 
291(243-329) 
2 
8.7 


F2XMl = :zBTCOI - 
1 
1,101, 
0011,111 
00001 
242(140-279) 
2 
6 


FYL2X= ST(l) x !oga(ST(011 
j, 1011 
0011,1,1 
00011 
311(198-329) 
13 
6 


FYUXPl - ST(ll x IoI2(8T(OI + 1.0) 1, 1 0 1 1 
0011,111 
10011 
313(171-328) 
13 
6 


PFIOCESSORCONTIlOL 


FlNIT = lnIIIdze FPU 
111011 
01111110 
00111 
17 
4 


FSTSW AX - m-_ 
word 
b 1011 
11111110 
00001 
3 
5 
Into AX 


FSTSW = m-_word 
111011 
1011mod 
111 
rImI 
tH-b/dlep. 
I 
3 
5 
Into memory 


FLDCW - LolId-,""Word 
j, 1 01 I 
0011 mod 101 
rImI 
tH-b/dillp. 
I 
4 
2 


FSTCW - StON -'"" 
word 
j, 101 I 
OOllmod 
111 
rImI 
e-l-b/dlep. I 
3 
5 


FCLEX = Cleat exoepU- 
111 011 
011j,110 
00101 
7 
4 


FSTENV = m-erMIOl_ 
j, 101 I 
oOllmod 
110 
rImI 
tH-b/dilp. 
I 
Real and V_ mode8 16-bi1Address 
67 
4 
Real and Vir1uaImodes 32-bil Address 
67 
4 
Protec1ed mode 16-bl1Address 
58 
4 
Protected mode 32-bllAddreu 
58 
4 


FUl£NY - LolId _lIMIlt 
1,1011 
OOdmod 
100 
rIm I 
tH-b/dlep. 
I 
Real and Virtualmodes 16-b1tAddress 
44 
2 
Real and V_ mode8 32-bll Address 
44 
2 
Protected mode 16-bl1Address 
34 
2 
Protec1ed mode 32-b11Address 
34 
2 


• 
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Table 10.3. Intel487™ SX Math CoProcessor Floating Point Clock Count Summary (Continued) 


~ 


e-Illt 
Concun'ent 
Execution 
Pen8IlyIf 
Ave (I.oWlIr 
No•• 
FORMAT 
A"Il(Lower 
e- 
•••• 
IfIS'mUCTlON 
RMge ••• 
RMge ••• 
Upper Rtlnge) 
Upper ,...) 


PfIOCESSOA CONTROL(Continued) 
FSAVE- 
___ 
h,0" 
1011mod 
110 
rImI 
s-l-b/diap. 
I 
Real and VII1lJaJmodes 16-ll1tAddress 
154 
4 
Real and Virtualmodes 32-bi1Address 
154 
4 
Ptotected mode 16-b1tAddress 
143 
4 
Protected mode 32-bi1Address 
143 
4 
FflSTOA - R.tore 
__ 
1,1011 
1011mod 
100 
rImI 
&-iobl 
I 
Real and Virtualmodes 16-b1tAddress 
131 
23 
Real and VII1IJaImodes 32-bi1Address 
131 
27 
Protecled mode 16-b1tAddress 
120 
23 
Ptotected mode 32-bi1Address 
120 
27 


FINCSTP - lncr_t 
~ 
PoInter 
11 1 0 1 1 
00,1,11, 
01111 
3 


FDECSTP - o-t~ 
PoInterI'1 
011 
0011,111 
01101 
3 


FFREE - Free ST(1l 
1" 
011 
101h 
1000 
ST(I)I 
3 


FNOP - No oper_ 
1" 
011 
0011,101 
00001 
3 


WAIT- Welt until FPU I'elIdy 
I 10011011 
I 
(Minimum/Maximum) 
1/3 


NOTES: 
1. If operand is 0 clock counts = 27. 
2. If operand Is 0 clock counts = 28. 
3. If CW.PC ioolC8tes 24 bit precision then subtract 38 clocks. 
II CW.PC indicates 53 bit precision then subtract 11 clocks. 


4 If there is a numeric elTor pending from a previous instruction add 17 clocks. 
5: If there is a numeric error pending from a previous Instruction add 18 clocks. 
6. The INT pin is polled several times while this instruction is executing to assure short interrupt latency. 
7. If ABS(operand) 
is greater than 'l'f14then add n clocks. Where n = (operand/(7I'/4». 


10.2.1 OVERVIEW 


All instruction 
encodings 
are subsets 
of the general 
instruction 
format 
shown 
in Figure 10.1. Instructions 
consist 
of one or two primary opcode 
bytes, possibly 
an address 
specifier 
consisting 
of the 
"mod 
rIm" 
byte and "scaled 
index" 
byte, a displacement 
if re- 
quired, and an immediate 
data field if required. 


Within 
the primary 
opcode 
or opcodes, 
smaller 
en- 
coding 
fields 
may be defined. 
These 
fields vary ac- 
cording 
to the class 
of operation. 
The field.s def~ne 
such 
information 
as direction 
of the operation, 
size 
of the displacements, 
register 
encoding, 
or sign ex- 
tension. 


Almost 
all 
instructions 
referring 
to 
an 
operand 
in 
memory 
have 
an addressing 
mode 
byte 
following 
the primary 
opcode 
byte(s). 
This byte, the mod rIm 
byte, specifies 
the address 
mode to be used. Certain 
encodings 
of the 
mod rIm 
byte indicate 
a second 


addressing 
byte, the scale-index-base 
byte, follows 
the 
mod rIm 
byte 
to fully 
specify 
the 
addressing 
mode. 


Addressing 
modes 
can include 
a displacement 
im- 
mediately 
following 
the mod rim byte, or scaled 
in- 
dex byte. If a displacement 
is present, 
the possible 
sizes are 8, 16 or 32 bits. 


If the 
instruction 
specifies 
an immediate 
operand, 
the 
immediate 
operand 
follows 
any 
displacement 
bytes. The immediate 
operand, 
if specified, 
is always 
the last field of the instruction. 


Figure 
10.1 illustrates 
several 
of the fields 
that can 
appear 
in an instruction, 
such as the mod field and 
the rim field, but the Figure does not show all fields. 
Several 
smaller 
fields also appear 
in certain 
instruc- 
tions, 
sometimes 
within 
the 
opcode 
bytes 
them- 
selves. Table 
10.4 is a complete 
list of all fields 
ap- 
pearing 
in 
the 
Intel486 
SX 
microprocessor/lntel 
OverDrive 
Processor 
instruction 
set. Further 
ahead, 
following 
Table 
10.4, are detailed 
tables 
for each 
field. 
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ITTTTTTTT 
ITTTTTTTT 
I mod TTT rim I ss index base Id32116181 none data32 116181 none 
:z 
0 7 
~\7 
6 5 3 2 0 }\7 
6 5 3 2 0J\ 
} \ 
} 


T 
T 
------ 


register and address 
mode specifier 


opcode 
(one or two bytes) 
(T represents an 
opcode bit.) 


"mod rim" 
byte 
immediate 
data 
(4, 2, 1 bytes 
or none) 


address 
displacement 
(4, 2, 1 bytes 
or none) 


Table 10.4. Fields within Intel486TM SX Microprocessorl 
Intel OverDrive Processor Instructions 


Field Name 
Description 
Number of Bits 


w 
Specifies if Data is Byte or Full Size (Full Size is either 16 or 32 Bits 
1 
d 
Specifies Direction of Data Operation 
1 
s 
Specifies if an Immediate Data Field Must be Sign-Extended 
. 1 
reg 
General Register Specifier 
3 
mod rim 
Address Mode Specifier (Effective Address can be a General Register) 
2 for mod; 
3 for rim 
ss 
Scale Factor for Scaled Index Address Mode 
2 
index 
General Register to be used as Index Register 
3 
base 
General Register to be used as Base Register 
3 
sreg2 
Segment Register Specifier for CS, SS, DS, ES 
2 
sreg3 
Segment Register Specifier for CS, SS, DS, ES, FS, GS 
3 
mn 
For Conditional Instructions, Specifies a Condition Asserted 
or a Condition Negated 
4 


NOTE: 
Tables 
10.1-10.3 
show encoding 
of individual 
instructions. 


10.2.2 32-BIT EXTENSIONS OF THE 
INSTRUCTION SET 


With the Intel486 SX microprocessorlintel 
Over- 
Drive Processor, the 8086/80186/80286 
instruction 
set is extended in two orthogonal directions: 32-bit 
forms of all 16-bit instructions are added to support 
the 32-bit data types, and 32-bit addressing modes 
are made available for all instructions referencing 
memory. This orthogonal instruction set extension is 
accomplished having a Default (D) bit in the code 
segment descriptor, and by having 2 prefixes to the 
instruction set. 


Whether the instruction defaults to operations of 
16 bits or 32 bits depends on the setting of the D bit 
in the code segment descriptor, which gives the de- 
fault length (either 32 bits or 16 bits) for both oper- 
ands and effective addresses when executing that 
code segment. In the Real Address Mode or Virtual 
8086 
Mode, no code 
segment 
descriptors 
are 
used, but a D value of 0 is assumed internally by 
the 
Intel486 
SX microprocessorllntel 
OverDrive 


Processor when operating in those modes (for 16-bit 
default 
sizes compatible 
with the 8086/801861 
80286). 


Two prefixes, the Operand Size Prefix and the Effec- 
tive Address Size Prefix, allow overriding individually 
the Default selection of operand size and effective 
address size. These prefixes may precede any op- 
code bytes and affect only the instruction they pre- 
cede. If necessary, one or both of the prefixes may 
be placed before the opcode bytes. The presence of 
the Operand Size Prefix and the Effective Address 
Prefix will toggle the operand size or the effective 
address size, respectively, to the value "opposite" 
from the Default setting. For example, if the default 
operand size is for 32-bit data operations, then pres- 
ence of the Operand Size Prefix toggles the instruc- 
tion to 16-bit data operation. As another example, if 
the default effective address size is 16 bits, pres- 
ence of the Effective Address Size prefix toggles the 
instruction to use 32-bit effective address computa- 
tions. 


• 


These 32-bit extensions are available in all Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor 
modes, including the Real Address Mode or the Vir- 
tual 8086 Mode. In these modes the default is al- 
ways 16 bits, so prefixes are needed to specify 
32-bit operands or addresses. For instructions with 
more than one prefix, the order of prefixes is unim- 
portant. 


Unless specified otherwise, instructions with 8-bit 
and 16-bit operands do not affect the contents of 
the high-order bits of the extended registers. 


10.2.3 ENCODING OF INTEGER 
INSTRUCTION FIELDS 


Within the instruction are several fields indicating 
register selection, addressing mode and so on. The 
exact encodings of these fields are defined immedi- 
ately ahead. 


10.2.3.1 Encoding of Operand Length (w) Field 


For any given instruction performing a data opera- 
tion, the instruction is executing as a 32-bit operation 
or a 16-bit operation. Within the constraints of the 
operation size, the w field encodes the operand size 
as either one byte or the full operation size, as 
shown in the table below. 


Operand Size 
Operand Size 
wFleld 
During 16·81t 
During 32·81t 
Data Operations 
Data Operations 


0 
8 Bits 
8 Bits 
1 
16 Bits 
32 Bits 


10.2.3.2 Encoding of the General 
Register (reg) Field 


The general register is specified by the reg field, 
which may appear in the primary opcode bytes, or as 
the reg field of the "mod r/m" byte, or as the rIm 
field of the "mod r/m" byte. 


intel~ 


Encoding of reg Field When w Field 
Is not Present In Instruction 


Register Selected 
Register Selected 
reg Field 
During 16·81t 
During 32·81t 
Data Operations 
Data Operations 


000 
AX 
EAX 
001 
CX 
ECX 
010 
OX 
EOX 
011 
BX 
EBX 
100 
SP 
ESP 
101 
BP 
EBP 
110 
SI 
ESI 
111 
01 
EOI 


Encoding of reg Field When w Field 
Is Present In Instruction 


Register Specified by reg Field 
During 16·81tData Operations: 


Function of w Field 
reg 
(whenw 
= 0) 
(whenw 
= 1) 


000 
AL 
AX 
001 
CL 
CX 
010 
OL 
OX 
011 
BL 
BX 
100 
AH 
SP 
101 
CH 
BP 
110 
OH 
SI 
111 
BH 
01 


Register Specified by reg Field 
During 32·81t Data Operations 


Function of w Field 
reg 
(whenw 
= 0) 
(when w = 1) 


000 
AL 
EAX 
001 
CL 
ECX 
010 
OL 
EOX 
011 
BL 
EBX 
100 
AH 
ESP 
101 
CH 
EBP 
110 
OH 
ESI 
111 
BH 
EOI 


10.2.3.3 Encoding of the segment 
Register (sreg) Field 


The sreg field in certain instructions is a 2-bit field 
allowing one of the four 80286 segment registers to 
be specified. The sreg field in other instructions is a 
3-bit field, allowing the Intel486 SX microprocessorl 
Intel OverDrive Processor FS and GS segment reg- 
isters to be specified. 


2-Blt 
Segment 


sreg2 Field 
Register 
Selected 


00 
ES 
01 
CS 
10 
SS 
11 
DS 


3-Blt 
Segment 


sreg3 Field 
Register 
selected 


000 
ES 
001 
CS 
010 
SS 
011 
DS 
100 
FS 
101 
GS 
110 
do not use 
111 
do not use 


10.2.3.4 Encoding of Address Mode 


Except for special instructions, such as PUSH or 
POP,where the addressing mode is pre-determined, 
the addressing mode for the current instruction is 
specified by addressing bytes following the primary 
.opcode. The primary addressing byte is the "mod 
rIm" byte, and a second byte of addressing informa- 
tion, the "s-i-b" 
(scale-index-base) byte, can be 
specified. 


The s-i-b byte (scale-index-base byte) is specified 
when using 32-bit addressing mode and the "mod 
rIm" byte has rIm = 100 and mod = 00,01 or 10. 
When the sib byte is present, the 32-bit addressing 
mode is a function of the mod, ss, index, and base 
fields. 


The primary addressing byte, the "mod rIm" byte, 
also contains three bits (shown as TTT in Figure 
10.1) sometimes used as an extension of the pri- 
mary opcode. The three bits, however, may also be 
used as a register field (reg). 


When calculating an effective address, either 16-bit 
addressing or 32-bit addressing is used. 16-bit ad- 
dressing uses 16-bit address components to calcu- 
late the effective address while 32-bit addressing 
uses 32-bit address components to calculate the ef- 
fective address. When 16-bit addressing is used, the 
"mod rIm" byte is interpreted as a 16-bit addressing 
mode specifier. When 32-bit addressing is used, the 
"mod rIm" byte is interpreted as a 32-bit addressing 
mode specifier. 


Tables on the following three pages define all en- 
codings of all 16-bit addressing modes and 32-bit 
addressing modes. 
• 


mod rIm 
Effective 
Address 


00000 
OS:[BX+SI] 
00001 
OS:[BX+OI] 
00010 
SS:[BP+SI] 
00011 
SS:[BP+OI] 
00100 
OS:[SI] 
00101 
OS: [01] 
00110 
OS:d16 
00 111 
OS:[BX] 


01000 
OS: [BX + SI + dB] 
01 001 
OS: [BX + 01 + dB] 
01 010 
SS: [BP + SI + dB] 
01 011 
SS: [BP + 01 + dB] 
01 100 
OS:[SI+dB] 
01 101 
OS:[OI+dB] 
01 110 
SS:[BP+dB] 
01 111 
OS:[BX+dB] 


Register 
Specified 
by rIm 
During 16·Blt Data Operations 


mod rIm 
Function 
of w Field 


(whenw=O) 
(when w = 1) 


11000 
AL 
AX 
11 001 
CL 
CX 
11 010 
OL 
OX 
11 011 
BL 
BX 
11 100 
AH 
SP 
11 101 
CH 
BP 
11110 
OH 
SI 
11111 
BH 
01 


mod rIm 
Effective 
Address 


10000 
OS:[BX+SI+d16] 
10001 
OS:[BX+01+d16] 
10010 
SS:[BP+SI+d16] 
10011 
SS:[BP+ 
01+ d16] 
10100 
OS:[SI+d16] 
10 101 
OS:[01+d16] 
10110 
SS:[BP+d16] 
10111 
OS:[BX+d16] 


11 000 
register-see 
below 
11 001 
register-see 
below 
11 010 
register-see 
below 
11 011 
register-see 
below 
11 100 
register-see 
below 
11 101 
register-see 
below 
11110 
register-see 
below 
11111 
register-see 
below 


Register 
Specified 
by rIm 
During 32·Blt Data Operations 


mod rIm 
Function 
of w Field 


(when w=O) 
(when w = 1) 


11 000 
AL 
EAX 
11 001 
CL 
ECX 
11 010 
OL 
EOX 
11 011 
BL 
EBX 
11 100 
AH 
ESP 
11 101 
CH 
EBP 
11 110 
OH 
ESI 
11111 
BH 
EOI 
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mod rIm 
Effective 
Address 


00000 
DS:[EAX] 
00001 
DS:[ECX] 
00010 
DS:[EDX] 
00011 
DS:[EBX] 


00100 
s-i-b is present 
00101 
DS:d32 
00110 
DS:[ESI] 
00 111 
DS:[EDI] 


01000 
DS:[EAX+d8] 
01001 
DS:[ECX+d8] 
01 010 
DS:[EDX+d8] 
01 011 
DS:[EBX+d8] 
01100 
s-j-b is present 
01 101 
SS:[EBP+d8] 
01 110 
DS:[ESI+d8] 
01 111 
DS:[EDI+d8] 


Register 
Specified 
by reg or rIm 
During 
16·Blt Data Operations: 


mod rIm 
Function 
of w field 


(when w=O) 
(whenw= 
1) 


11 000 
AL 
AX 


11 001 
CL 
CX 


11 010 
DL 
DX 
11 011 
BL 
BX 


11100 
AH 
SP 


11 101 
CH 
BP 


11110 
DH 
SI 


11111 
BH 
DI 


mod rIm 
Effective 
Address 


10000 
DS: [EAX + d32] 


10001 
DS: [ECX + d32] 


10010 
DS: [EDX + d32] 


10011 
DS:[EBX+d32] 


10100 
s-i-b is present 


10 101 
SS: [EBP + d32] 


10110 
DS:[ESI+d32] 


10111 
DS:[EDI+d32] 


11 000 
register-see 
below 


11 001 
register-see 
below 


11 010 
register-see 
below 


11 011 
register-see 
below 


11 100 
register-see 
below 


11 101 
register-see 
below 


11 110 
register-see 
below 


11111 
register-see 
below 


Register 
Specified 
by reg or rIm 


During 32·Bit Data Operations: 


mod rIm 
Function 
of w field 


(whenw=O) 
(when w= 
1) 


11 000 
AL 
EAX 


11 001 
CL 
ECX 


11 010 
DL 
EDX 


11 011 
BL 
EBX 


11 100 
AH 
ESP 
11 101 
CH 
EBP 


11110 
DH 
ESI 


11111 
BH 
EDI 
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mod base 
Effective 
Address 


00000 
os: [EAX + (scaled index)] 
00001 
OS: [ECX + (scaled index)] 
00010 
OS: [EOX + (scaled index)] 
00011 
OS: [ESX + (scaled index)] 
00100 
SS: [ESP + (scaled index)] 
00101 
OS: [d32 + (scaled index)] 
00110 
OS: [ESI + (scaled index)] 
00111 
OS: [EOI + (scaled index)] 


01000 
OS: [EAX + (scaled index) + d8] 
01001 
OS: [ECX + (scaled index) + d8] 
01010 
OS: [EOX + (scaled index) + d8] 
01 011 
OS: [ESX + (scaled index) + d8] 
01 100 
SS: [ESP + (scaled index) + d8] 
01 101 
SS: [ESP + (scaled index) + d8] 
01 110 
OS: [ESI + (scaled index) + d8] 
01 111 
OS: [EOI + (scaled index) + d8] 


10000 
OS: [EAX + (scaled index) + d32] 
10001 
OS: [ECX + (scaled index) + d32] 
10010 
OS: [EOX + (scaled index) + d32] 
10011 
OS: [ESX + (scaled index) + d32] 
10100 
SS: [ESP + (scaled index) + d32] 
10101 
SS: [ESP + (scaled index) + d32] 
10110 
OS: [ESI + (scaled index) + d32] 
10111 
OS: [EOI + (scaled index) + d32] 


NOTE: 
Mod field in "mod rim" 
byte; ss, index, base fields in 


"s-i-b" byte. 


ss 
SCale Factor 


00 
x1 
01 
x2 
10 
x4 
11 
x8 


Index 
Index Register 


000 
EAX 
001 
ECX 
010 
EOX 
011 
ESX 
100 
no index reg" 
101 
ESP 
110 
ESI 
111 
EOI 


--IMPORTANT 
NOTE: 
When index field is 100, indicating "no index register," then 
ss field MUST equal 00. If index is 100 and ss does not 
equal DO,the effective address is undefined. 
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10.2.3.5 
Encoding 
of Operation 
Direction 
(d) Field 


In many two-operand 
instructions 
the d field is pres- 
ent 
to 
indicate 
which 
operand 
is considered 
the 
source 
and which 
is the destination. 


d 
Direction 
of Operation 


0 
Register/Memory 
<- - Register 
"reg" 
Field Indicates 
Source Operand; 
"mod r/m" 
or "mod ss index base" 
Indicates 
Destination 
Operand 


1 
Register <- - Register/Memory 
"reg" 
Field Indicates 
Destination 
Operand; 
"mod 
r/m" 
or "mod ss index base" 
Indicates 
Source Operand 


10.2.3.6 
Encoding 
of Sign-Extend 
(s) Field 


The s field 
occurs 
primarily 
to instructions 
with im- 
mediate 
data fields. The s field has an effect 
only if 
the size of the immediate 
data is 8 bits and is being 
placed 
in a 16-bit or 32-bit destination. 


Effect 
on 
Effect 
on 
s 
Immediate 
Immediate 
Data8 
Data 16132 


0 
None 
None 


1 
Sign-Extend 
Data8 to Fill 
None 
16-Bit or 32·Bit Destination 


10.2.3.7 
Encoding 
of Conditional 
Test 
(tUn) 
Field 


For the 
conditional 
instructions 
(conditional 
jumps 
and set on condition), 
mn is encoded 
with n indicat- 
ing to use the condition 
(n = 0) or its negation 
(n = 1), 
and m giving the condition 
to test. 


Mnemonic 
Condition 
tttn 


0 
Overflow 
0000 


NO 
No Overflow 
0001 
B/NAE 
Below/Not 
Above or Equal 
0010 
NB/AE 
Not Below/Above 
or Equal 
0011 
E/Z 
Equal/Zero 
0100 
NE/NZ 
Not Equal/Not Zero 
0101 
BE/NA 
Below or Equal/Not Above 
0110 
NBE/A 
Not Below or Equal/Above 
0111 
S 
Sign 
1000 
NS 
Not Sign 
1001 
PIPE 
Parity/Parity Even 
1010 
NP/PO 
Not Parity/Parity Odd 
1011 
LlNGE 
Less Than/Not 
Greater or Equal 
1100 
NL/GE 
Not Less Than/Greater 
or Equal 
1101 
LE/NG 
Less Than or Equal/Greater 
Than 
1110 
NLE/G 
Not Less or Equal/Greater 
Than 
1111 


10.2.3.8 
Encoding 
of Control 
or Debug 
or Test 
Register 
(eee) 
Field 


For the loading 
and storing 
of the Control, 
Debug 
and Test registers. 


eeeCode 
Reg Name 


000 
CRO 
010 
CR2 
011 
CR3 


Do not use any other encoding 


eeeCode 
Reg Name 


000 
DRO 
001 
DR1 
010 
DR2 
011 
DR3 
110 
DR6 
111 
DR7 


Do not use any other encoding 


eeeCode 
Reg Name 


011 
TR3 
100 
TR4 
101 
TR5 
110 
TR6 
111 
TR7 


Do not use any other encoding 
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Instruction 
Optional 
First Byte 
Second Byte 
Fields 


11011 
OPA 
1 
mod 
1 
I 
OPB 
rim 
s-i-b 
I 
disp 


11011 
MF 
OPA 
mod 
OPB 
rim 
s-i-b 
I 
disp 


11011 
d 
P 
OPA 
1 
1 
OPB 
ST(i) 


11011 
0 
0 
1 
1 
1 
1 
I 
OP 
11011 
0 
1 
1 
1 
1 
1 
I 
OP 


10.2.4 ENCODING OF FLOATING POINT 
INSTRUCTION FIELDS 


Instructions for the FPU assume one of the five 
forms shown in the following table. In all cases. 
instructions are at least two bytes long and begin 
with the bit pattern 11011B. 


OP = Instruction opcode, possible spilt into two 
fields OPA and OPB 


MF = Memory Format 
00---32-bit real 
01-32-bit 
integer 
10-64-bit 
real 
11-16-bit 
integer 


P = Pop 
0-00 
not pop stack 
1-Pop 
stack after operation 


d = Destination 
o-Destination 
is ST(O) 
1-Destination 
is ST(i) 


R XOR d = o-Destinatlon 
(op) Source 
R XOR d = 1-50urce 
(op) Destination 


ST(i) = Register stack element i 
000 = Stack top 
001 = Second stack element 
••• 
111 = Eighth stack element 


mod (Mode field) and rim (Register/Memory 
specifter) have the same interpretation as the 
corresponding fields of the integer instructions. 


s-i-b (Scale Index Base) byte and di$p (displace- 
ment) are optionally present in instructions that 
have mod and rim fields. Their presence de- 
pends on the values of mod and rim, as for inte- 
ger instructions. 


11.0 
DIFFERENCES 
BETWEEN 
THE 
Intel486™ 
SX 
MICROPROCESSOR/Intel 
OverDrive 
PROCESSOR 
AND 
THE Intel386™ 
MICROPROCESSOR 
PLUS THE 
Intel387TM 
MATH 
COPROCESSOR 
EXTENSION 


The differences between the Intel486 SX microproc- 
essor/Intel 
OverDrive Processor and the Intel386 
microprocessor are due to performance enhance- 
ments. The differences are listed below. 


1. Instruction clock counts have been reduced to 
achieve higher performance. See Section 10. 


2. The Intel486 SX microprocessor/Intel OverDrive 
Processor bus is significantly faster than the In- 
tel386 microprocessor bus. Differences include a 
1X clock, parity support, burst cycles, cacheable 
cycles, cache invalidate cycles and 8-bit bus sup- 
port. The Hardware Interface and Bus Operation 
Sections (Sections 6 and 7) of the data sheet 
should 
be 
carefully 
read to 
understand the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 
Processor bus functionality. 


3. To support the on-chip cache new bits have been 
added to control register 0 (CD and NW) (Section 
2.1.2.1), new pins have been added to the bus 
(Section 6) and new bus cycle types have been 
added (Section 7). The on-chip cache needs to 
be enabled after reset by clearing the CD and 
NW bit in CRO. 


4. Six new instructions have been added: 


Byte Swap (BSWAP) 
Exchange-and-Add (XADD) 
Compare and Exchange (CMPXCHG) 
Invalidate Data Cache (INVD) 
Write-back and Invalidate Data Cache (WBINVD) 
Invalidate TLB Entry (INVLPG) 
5. There are two new bits defined in control register 
3, the page table entries and page directory en- 
tries (PCD and PWT) (Section 4.5.2.5). 


6. A new page protection feature has been added. 


This feature required a new bit in control register 
o (WP) (Section 2.1.2.1 and 4.5.3). 


7. A new Alignment Check feature has been add- 
ed. This feature required a new bit in the flags 
register (AC) (Section 2.1.1.3) and a new bit in 
control register 0 (AM) (Section 2.1.2.1). 


8. The replacement algorithm for the translation 
lookaside buffer has been changed from a ran- 
dom algorithm to a pseudo least recently used 
algorithm like that used by the on-chip cache. 
See Section 5.5 for a description of the algo- 
rithm. 
9. Three new testability registers, TR3, TR4 and 
TR5, have been added for testing the on-chip 
cache. TLB testability has been enhanced. See 
Section 8. 
10. The prefetch queue has been increased from 16 
bytes to 32 bytes. A jump always needs to exe- 
cute after modifying code to guarantee correct 2 
execution of the new instruction. 
11. After reset, the ID in the upper byte of the DX 
register is 04. The contents of the base regis- 
ters including the floating point registers may 
be different after reset. 


In addition to the previouslYmentioned enhance- 
ments, the Intel OverDrive Processor offers the 
following features: 


1. The complete InteI387 Math CoProcessor in- 
struction set and register set have been add- 
ed. No I/O 
cycles are performed during 
Floating Point instructions. The instruction 
and data pointers are set to 0 after FINIT/ 
FSAVE. Interrupt 9 can no longer occur, in- 
terrupt 13 occurs instead. 
2. The Intel OverDrive Processor supports new 
floating point error reporting modes to guar- 
antee DOS compatibility. These new modes 
required a new bit in control register 0 (NE) 
(Section 2.1.2.1) and new pins (FERR# and 
IGNNE#) (Section 6.2.13 and 7.2.14). 
3. In some cases FERR<# is asserted when the 
next floating point instruction is encountered 
and in other cases it is asserted before the 
next floating point instruction is encountered, 
depending upon the execution state the in- 
struction causing exception (see Sections 
6.2.13 and 7.2.14). For both of these cases, 
the Inte1387Math CoProcessor asserts ER- 
ROR# when the error occurs and does not 
wait for the next floating point instruction to 
be encountered. 
4. After reset, the 10 in the upper byte of the OX 
register is 04. The contents of floating point 
registers may be different after reset 


12.0 
DIFFERENCES 
WITH THE 
Intel486TM SX 
MICROPROCESSOR 
IN PGA vs PQFP 


The differences between the Intel486 SX Microproc- 
essor in PGA package versus PQFP package are 
listed below for quick reference when converting a 
system design from PGA to one in PQFP. 
1. Sections 13.4.1 and 13.4.2 show the derating 
curves for the Intel486 SX Microprocessor in PGA 
and PQFP respectively. The Intel486 SX Micro- 
processor in PQFP has smaller output buffers 
than the ones in the PGA, hence the difference in 
the derating curves. The systems/boards must be 
evaluated for sufficient timing margins when con- 
verting from the Intel486 SX Microprocessor in 
PGA to the Intel486 SX Microprocessor in PQFP. 
Please note that both versions of the CPU have 
the same AC characteristics at 50 pF loads. For 
other loads (pF), an appropriate delay (ns) should 
be added (or subtracted to the AC timings de- 
pending on the package (PGA or PQFP) being 
used for the Intel486 SX Microprocessor. 


2. D.C. Characteristics: 
Although both versions of the Intel486 SX Micro- 
processor have the same A.C. Characteristics, 
the D.C. Characteristics are different as listed be- 
low. 
Parameter 
Icc max(mA) 
@16MHz 
@20MHz 
@25MHz 
@33MHz 


PGA 
525 
600 
700 
780 


PQFP 
450 
500 
560 
575 


CIN(pF) 
Input Capacitance 
20 
10 
Co (pF) 
I/O Capacitance 
20 
10 
CCLK(pF) 
ClK Capacitance 
20 
6 
3. Power Down Mode: 
This mode is available in the Intel486 SX Micro- 
processor in PQFP only. See section 2.0.1 for de- 
tailed information. The OverDrive Processor is 
supported by the Intel486 SX Microprocessor in 
PQFP only. See section 2.0.1 and 6.6.2.1. 


4. Boundary Scan (JTAG): 


The boundary scan feature is supported in the In- 
tel486 SX Microprocessor in PQFP only. See sec- 
tion 8.5. 
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The Intel OverDrive Processor is essentially an en- 
hanced Intel486 Microprocessor. There are three 
functional differences between the Intel OverDrive 
Processor and Intel486 Microprocessors. First, the 
Intel OverDrive Processor has an internal clock dou- 
bling circuit which decreases the time required to 
execute instructions. Second, the Intel OverDrive 
Processor does not support the JTAG boundary 
scan test feature (available with the PQFPversion of 
the Intel486 SX Microprocessor). Third, the Intel 
OverDrive Processor has a different CPU revision 
identification than the Intel486 SX or Intel486 OX 
CPUs. These three differences are described in the 
following sections according to how they effect the 
CPU functionality. Additional OverDrive Processor 
specific information is provided in the "Intel Over- 
Drive Processor for Intel486 SX Microprocessor- 
Based Systems," order number 290436-001. 


The Intel OverDrive Processor bus has been de- 
signed to be identical with the Intel486 Microproces- 
sor bus. Although the external clock is internally 
doubled and data and instructions are manipulated 
in the CPU core at twice the external frequency, the 
external bus is functionally identical with the Intel486 
CPU. 


The four boundary scan test signals (TCK, Test 
clock; TMS, Test Mode select; TDI, Test Data Input; 
TOO, Test Data Output), defined for the PQFP Intel 
486 SX CPU, are not specified for the Intel Over- 
Drive Processor. 


The UP# (Upgrade Present) signal, which is defined 
as an input for the PQFP Intel486 SX CPU, is an 
output signal on the Intel OverDrive Processor. The 
UP# pin on the Intel OverDrive Processor provides 
a logical low output signal which can be used to en- 
able logic to recognize and configure the system for 
the Intel OverDrive Processor. 


The OX register always contains the component 
identifier at the conclusion of RESET. The Intel 
OverDrive Processor has a different revision identifi- 
er in the DL register than the Intel486 SX or Intel486 
OX Microprocessors. When the OverDrive Proces- 
sor is installed in a system the component identifier 
is supplied by the OverDrive Processor, rather than 
the original CPU. The stepping identification portion 
of the component identification will change with dif- 
ferent revisions of the OverDrive Processor. The de- 
signer should only assume that the component iden- 
tification for the OverDrive Processor will be 043xH, 
where 'x' is the stepping identifier. 


As detailed in Section 13.1,the Intel OverDrive Proc- 
essor does not support the JTAG boundary scan 
testability feature. 


13.3 
Instruction 
Set Summary 


The Intel OverDrive Processor supports all Intel486 
extensions to the 
8086/80186/80286 
instruction 


set. In general, instructions will execute faster on the 
Intel OverDrive Processor than the Intel486 Micro- 
processor. Specifically, an instruction that only uses 
memory from the on-chip cache executes at the full 
core clock rate while all bus accesses execute at the 
bus clock rate. To calculate the elapsed time of an • 
instruction, the number of clock counts for that in- 
struction must be multiplied by the clock period for 
the system. The instruction set clock count summary 
tables from Section 10.0 can be used for the Over- 
Drive Processor with the following modifications: 
- 
Clock counts for a cache hit: This value repre- 
sents the number of internal CPU core clocks for 
an instruction that requires no external bus ac- 
cesses or the base core clocks for an instruction 
requiring external bus accesses. 


- 
Penalty clock counts for a cache miss: This value 
represents the worst-case approximation of the 
additional number of external clock counts that 
are required for an instruction which must access 
the external bus for data (a cache miss). This 
number must be multiplied by 2 to convert it to 
an equal number of internal CPU core clock 
counts and added to the base core clocks to 
compute the total number of core clocks for this 
instruction. 


The actual number of core clocks for an instruction 
with a cache miss may be less than the base clock 
counts (from the cache hit column) plus the penalty 
clock counts (2 times the cache miss column num- 
ber). The clock counts in the cache miss penalty 
column can be a cumulative value of external bus 
clocks (for data reads) and internal clocks for manip- 
ulating the data which has been loaded from the ex- 
ternal bus. The number of clocks which are related 
to external bus accesses are correctly represented 
in terms of internal core clocks by multiplying by two. 
However, the clock counts related to internal data 
manipulation should not be multiplied by two. There- 
fore the total number of CPU core clock counts for 
an instruction with a cache miss represents a worst- 
case approximation. 


To calculate the execution time for an OverDrive 
Processor instruction, multiply the total CPU core 
clock counts by the core clock period. For example, 
in a 25 MHz system the core clock period is 50 ns 
(1/50 
MHz). 


Additionally, 
the 
assumptions 
specified 
below 
should be understood in order to estimate instruc- 
tion execution time. 


A cache miss will force the OverDrive Processor to 
run an external bus cycle. The OverDrive Processor 
32-bit burst bus is defined as r-b-w. 


Where: 
r = 
The number of bus clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle is a non-burst read. 
b = 
The number of bus clocks for the second and 
subsequent cycles in a burst read. 
w = The number of bus clocks for a write. 


The fastest bus the OverDrive Processor can sup- 
port is 2 - 1- 2 assuming 0 waits states. The clock 
counts in the cache miss penalty column assume a 
2 - 1- 2 bus. For slower busses add r- 2 clocks to 
the cache miss penalty for the first dword accessed. 
Other factors also affect instruction clock counts. 


Instruction Clock Count Assumptions 
1. The external bus is available for reads or writes at 
all times. Else add bus clocks to reads until the 
bus is available 
2. Accesses are aligned. Add three core clocks to 
each misaligned access. 
3. Cache fills complete before subsequent accesses 
to the same line. If a read misses the cache dur- 
ing a cache fill due to a previous read or prefetch, 
the read must wait for the cache fill to complete. If 
a read or write accesses a cache line still being 
filled, it must wait for the fill to complete. 


4. If an effective address is calculated, the base reg- 
ister is not the distination register of the preceding 
instruction. If the base register is the destination 
register of the preceding instruction add 1 to the 
core clock counts shown. Back-to-back PUSH 
and POP instructions are not affected by this rule. 
5. An effective address calculation uses one base 
register and does not use an index register. 
However, if the effective address calculation 
uses an index register. 1 core clock may be add- 
ed to the clock shown. 


6. The target of a jump is in the cache. If not, add r 
clocks for accessing the destination instruction 
of a jump. If the destination instruction is not 
completely contained in the first dword read, add 
a maximum of 3b bus clocks. If the destination 
instruction is not completely contained in the first 
16 byte burst, add a maximum of another r+ 3b 
bus clocks. 
7. If no write buffer delay, w bus clocks are added 
only in the case in which all write buffers are full. 


8. Displacement and immediate not used together. 


If displacement and immediate used together, 1 
core clock may be added to the core clock count 
shown. 
9. No invalidate cycles. Add a delay of 1 bus clock 
for each invalidate cycle if the invalidate cycle 
contends for the internal cache/external 
bus 
when the OverDrive Processor needs to use it. 


10. Page translation hits in TLB. A TLB miss will add 
13, 21 or 28 bus clocks + 1 possible core clock 
to the instruction depending on whether the Ac- 
cessed and/or Dirty bit in neither, one or both of 
the page entries needs to be set in memory. This 
assumes that neither page entry is in the data 
cache and a page fault does not occur on the 
address translation. 
11. No exceptions are detected during instruction 
execution. Refer to interrupt core Clock Counts 
Table for extra clocks if an interrupt is detected. 
12. Instructions that read multiple consecutive data 
items (i.e., task switch, POPA, etc.) and miss the 
cache are assumed to start the first access on a 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+ 3b) 
bus clocks to the cache miss penalty. 


The following should be considered when designing 
a system for upgrade with an Intel OverDrive Proc- 
essor. 


13.4.1 Intel OverDrive 
PROCESSOR 
DETECTION 


The 
component 
identifier 
and 
stepping/revision 
identifier for the Intel OverDrive Processor is read- 
able in DH and DL registers respectively, immediate- 
ly after RESET, where 


DH = 04h 


DL = 30h-3Fh. 


As it is difficult to differentiate between the Intel486 
DX CPU and the Intel OverDrive Processor in soft- 
ware, it is recommended that the BIOS save the 
contents of the DX register, immediately after RE- 
SET, so that this information can be used later, if 
required, to identify an Intel OverDrive Processor in 
the system. 


NOTE: 
Initialization routines for Intel486 SX CPU systems 
should check for the presence of a floating point 
unit and set the CROregister accordingly. 


intel~ 


The Intel OverDrive Processor executes instructions 
at twice the frequency of the input clock. Thus, soft- 
ware (or instruction based) timing loops will execute 
faster on the Intel OverDrive Processor than on the 
Intel486 DX or Intel486 SX CPU (at the same input 
clock frequency). Instructions such as NOP, LOOP, 
and JMP $+ 2, have been used by BIOS to imple- 
ment timing loops that are required, for example, to 
enforce recovery time between consecutive access- 
es for I/O devices. These instruction based timing 
loop implementations may require modification for 
systems intended to be upgradable with the Intel 
OverDrive Processor. 


In order to avoid any incompatibilities, it is recom- 
mended that timing requirements be implemented in 
hardware rather than in software. This provides 
transparency and also does not require any change 
in BIOS or I/O device drivers in the future when 
moving to higher processor clock speeds. As an ex- 
ample, a timing routine may be implemented as fol- 
lows: The software performs a dummy I/O instruc- 
tion to an unused I/O port. The hardware for the bus 
controller logic recognizes this I/O instruction and 
delays the termination of the I/O cycle to the CPU 
by keeping RDY# or BRDY# deasserted for the ap- 
propriate amount of time. 


13.5 Thermal Management 


The heat generated by the Intel OverDrive Proces- 
sor requires that heat dissipation be managed care- 
fully. The 25 MHz, 33 MHz Intel OverDrive Processor 
is supplied with a heat sink attached with adhesive 
to the package. 25 MHz, 33 MHz system designs 
must, therefore, provide space for the heat sink on 
the Intel OverDrive Processor. 


13.5.1 THE Intel OverDrlve™ 
PROCESSOR 
WITH ATTACHED HEAT SINK 


The heat sink for the Intel OverDrive Processor is 
adhesively attached to the standard 169-lead, PGA 
package. Figure 13.1 shows a drawing of the Intel 
OverDrive Processor with the heat sink (see Table 
13.1 for dimensions). 


The maximum and minimum dimensions for the 
169-pin, PGA package with heat sink are shown in 
Table 13.1. 


Table 13.1. Intel OverDrlve™ Processor, 
169-Pin, PGA Package Dimensions 
with Heat Sink Attached 


Dimension (Inches) 
Mln 
Max 


A. Heat Sink Width 
1.520 
1.550 


B. PGA Package Width 
1.735 
1.765 


C. Heat Sink Edge Gap 
0.065 
0.155 


D. Heat Sink Height 
0.212 
0.260 
E. Adhesive Thickness 
0.008 
0.012 


F. Package Height from Stand- 
0.140 
0.180 
Ofts 


G. Total Height from Package 
0.360 
0.452 
Stand-Ofts to Top of Heat 
Sink 


The standard product markings and logo for the Intel 
OverDrive Processor with the attached heat sink will 
be included on a 1 in2 plate located on the top, cen- 
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ter of the heat sink. The heat sink is omni-directional 
which 
allows 
the 
air to flow 
from 
any direction 
to 
achieve 
adequate 
cooling. 
The 
thermal 
resistance 
values 
for 
the 
Intel 
OverDrive 
Processor 
with 
at- 
tached 
heat sink are shown 
in Table 
13.2. 


Table 13.2. Thermal 
Resistance 
for the Intel 
OverDrlve™ 
Processor 
with Attached 
Heat Sink 


6JS = 
2.5·C/W 
0 
800 


5.7 


NOTE: 
'The thermal resistance from junction to· ambient (9JA> in 
static air is actually a linear function of power dissipation. 
The value shown in the table (14.0'C/W) 
represents the 
worst case expected value which is derived from a power 
dissipation of 2.9W. The maximum expected power dissipa- 
tion of 6W yields a 9JA value of 13.1'C/W. 


13.5.2 
Intel OverDrlve™ 
PROCESSOR 
WITHOUT 
HEAT 
SINK 


The 20 MHz Intel OverDrive 
Processor, 
for 16 MHz 
and 20 MHz i486 SX CPU systems, 
is supplied 
with- 
out a heat sink. Table 
13.3 contains 
the thermal 
re- 
sistance 
values 
for the 
Intel 
OverDrive 
Processor 
without 
heat sink. 


Table 13.3. Thermal 
Resistance 
for the Intel 
OverDrlve™ 
Processor 
without 
Heat Sink 


6JC = 
2.0·C/W 
0 


6JA("C/W) 
19.0' 


800 


10.0 


NOTE: 
'The thermal resistance from junction to ambient (9JA) in 
static air is actually a linear function of power dissipation. 
The value shown in the table (19.0'C/W) 
represents the 
worst case expected value which is derived from a power 
dissipation of 2.9W. The maximum expected power dissipa- 
tion of 4W yields a 0JA value of 18.5'C/W. 


The methodology 
for calculating 
the heat dissipation 
for the 20 MHz Intel OverDrive 
Processor 
(without 
a 
heat sink) and the 25 MHz Intel OverDrive 
Processor 
(with a heat sink) are identical 
except 
that the refer- 
ence point for measuring 
the product 
temperature 
is 
different. 
The 
20 
MHz 
Intel 
OverDrive 
Processor 
specifies 
Tcase (the temperature 
at the outside 
cen- 
ter of the PGA package, 
opposite 
the pins) and 6JC 
(the thermal 
resistance 
from 
the silicon 
junction 
to 
the 
package 
case) 
but the 
25 MHz, 33 MHz 
Intel 
OverDrive 
Processor 
specifies 
Tsink· (the tempera- 
ture at the outside 
center 
base of the heat sink, not 
on the heat sink rilarking 
plate or cooling 
posts) and 
6JS (the thermal 
resistance 
from the silicon junction 
to the 
heat 
sink base). 
The 
relationships 
between 
temperature, 
thermal 
resistance 
and 
power 
are 
shown 
in the following 
equations: 


Tcase = Tambient + (Pmax ' 9CA) and 


Tsink = Tambient + (Pmax ' 0SA) 


where, 


Tambient = Ambient Temperature 


Pmax = Power (Icc ' Ycel. 


9CA = 0JA - 
9JC. 


9SA = 9JA - 
9JS· 
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The following sections describe recommended elec- 
trical connections for the Intel486 SX microproces- 
sor/lntel 
OverDrive Processor, and its electrical 


specifications. Because allintel486 SX microproces- 
sor designs should accommodate either the Over- 
Drive Processor or the Intel487 SX Math CoProces- 
sor, the specifications given for the OverDrive Proc- 
essor are the worst case values from the Intel487 
SX Math CoProcessor and OverDrive Processor. 


The Intel486 SX microprocessor/lntel487 
SX Math 


CoProcessor is implemented in CHMOS IV technol- 
ogy and has modest power requirements. The Intel 
OverDrive Processor is implemented in CHMOS V 
technology. However, the high clock frequency out- 
put buffers can cause power surges as multiple out- 
put buffers drive new signal levels simultaneously. 
For clean on-chip power distribution at high frequen- 
cy, 24 Vcc and 28 Vss pins feed the Intel486 SX 
microprocessor/lntel OverDrive Processor. 


Power and ground connections must be made to all 
external Vcc and GND pins of the Intel486 SX mi- 
croprocessor/lntel OverDrive Processor. On the cir- 
cuit board, all Vcc pins must be connected on aVec 
plane. All Vss pins must be likewise connected on a 
GND plane. 


14.1.2 POWER 
DECOUPLING 
RECOMMENDATIONS 


Liberal decoupling capacitance should be placed 
near the Intel486 SX microprocessor/lntel 
Over- 


Drive Processor. The Intel486 SX microprocessor/ 
Intel OverDrive Processor driving its 32-bit parallel 
address and data busses at high frequencies can 
cause transient power surges, particularly when driv- 
ing large capacitive loads. 


Low inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. Inductance can be reduced by shortening 
circuit board traces between the Intel486 SX micro- 
processor/Intel 
OverDrive Processor and decou- 


piing capacitors as much as possible. Capacitors 
specifically for PGA packages are also commercially 
available. 


14.1.3 OTHER 
CONNECTION 
RECOMMENDATIONS 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Active LOW in- 
puts should be connected to Vcc through a pullup 
resistor. Pullups in the range of 20 Kn are recom- 
mended. Active HIGH inputs should be connected to 
GND. 


Table 14.1 is a stress rating only, and functional op- 
eration at the maximums is not guaranteed. Function 
operating conditions are given in 14.3 D.C. Specifi- 
cations and 14.4 A.C. Specifications. 


Extended exposure to the Maximum Ratings may af- 
fect device reliability. Furthermore, although the 
Intel486 SX microprocessor/lntel OverDrive Proces- 
sor contains protective circuitry to resist damage 
from static electric discharge, always take precau- 
tions to avoid high static voltages or electric fields. 


• 


Table 
14.1. Absolute 
Maximum 
Ratings 


Case Temperature 
under Bias 
- 65·C to + 11O·C 
Voltage 
on Any Pin with 
Storage Temperature 
-65·C 
to + 150·C 
Respect 
to Ground 
-0.5 
to VCC + 0.5V 


Supply Voltage 
with 
Respect 
to Vss 
-0.5Vto 
+6.5V 


14.3 D.C. Specifications 
Functional 
Operating 
Range: Vcc 
= 5V ±5%; 
(Note 
1) 


Table 
14.2. Intel486™ 
SX Microprocessor 
D.C. Parametric 
Values 
(for PGA package) 


Symbol 
Parameter 
Mln 
Unit 
Notes 


VIL 
Input low 
Voltage 
-0.3 
V 


VIH 
Input High Voltage 
2.0 
V 


VOL 
Output low 
Voltage 
V 
(Note 2) 


VOH 
Output High Voltage 
V 
(Note 3) 


Icc 
Power Supply Current (16 MHz) 
mA 
(Note 4) 
Power Supply Current (20 MHz) 
Power Supply Current (25 MHz) 
Power Supply Current (33 MHz) 


ICCF 
Power Supply Current with Intel 
mA 
(Note 4) 
Microprocessor 
Tri-Stated 
(FI 
ClK = 16 MHz 
395 
ClK = 20 MHz 
430 
ClK = 25 MHz 
530 
ClK = 33 MHz 
545 


III 
Input leakage 
CUFfen 
±15 
p.A 
(Note 5) 


IIH 
Input leakage 
Current 
200 
p.A 
(Note 6) 


IlL 
Input leakage 
Current 
-400 
p.A 
(Note 7) 


ILO 
Output 
leakage 
C 
t 
±15 
p.A 


CIN 
Input Capacita 
20 
pF 
Fc = 1 MHz(8) 


Co 
I/O or Output 
acitance 
20 
pF 
Fc = 1 MHz(8) 


CCLK 
ClK 
Capacitance 
20 
pF 
Fc = 1 MHz(8) 


NOTES: 
1. The functional 
operating 
temperature 
range 
is: 
Intel486 
SX CPU, TCASE = O'C to + 85'C. 


Intel487 
SX MCP, TCASE = O'C to + 85'C. 


OverDrive 
Processor-16 
MHz, 20 MHz, TCASE = O'C to +95'C. 
OverDrive 
Processor-25 
MHz, 33 MHz, TSINK = OOCto + 85·C. 


2. This parameter 
is measured 
at: 


Address, 
Data, BEn 
4.0 mA 
Definition, 
Control 
5.0 mA 
3. This parameter 
is measured 
at: 


Address, 
Data, 
BEn 
- 1.0 mA 
Definition, 
Control 
-0.9 
mA 
4. Typical 
supply 
current: 


Ice 
@ 16 MHz = 340 mA 
(Normal 
Operation) 


@ 20 MHz = 450 mA 


@ 25 MHz = 550 mA 


@ 33 MHz = 590 mA 
ICCF 
@ 16 MHz = 265 mA 
i486 SX CPU Tri-stated 
(Floating) 


@ 20 MHz = 375 mA 


@ 25 MHz = 470 mA 


@ 33 MHz = 565 mA 
5. This parameter 
is for inputs 
without 
pullups 
or pulldowns 
and 0 ,;; VIN ,;; VCC. 


6. This parameter 
is for inputs 
with pulldowns 
and VIH = 2.4V. 


7. This parameter 
is for inputs 
with pull ups and VIL = 0.45V. 


8. Not 100% 
tested. 


Table 14.3.Intel486TMSX Microprocessor D.C.Parametric Values (for PQFPpackage) 


Symbol 
Parameter 
Min 
Max 
Unit 
Notes 


Vil 
Input low 
Voltage 
-0.3 
V 


VIH 
Input High Voltage 
2.0 
V 


VOL 
Output low 
Voltage 
V 


VOH 
Output High Voltage 
V 


Icc 
Power Supply Current 
ClK 
= 16 MHz 
ClK 
= 20 MHz 
ClK 
= 25 MHz 
ClK 
= 33 MHz 


(Note 1) 


(Note 2) 


(Note 3) 


ICCF 
Power Supply Current 
mA 
(Note 7) 


CPU in Power Down 
• 


III 
Input leakage 
curfent 
±15 
,..,A 
(Note 4) 


IIH 
Input leakage 
Current 
200 
,..,A 
(Note 5) 


III 
Input leakage 
Curr 
t 
-400 
,..,A 
(Note 6) 


ILO 
Output leakag 
±15 
,..,A 


CIN 
Input Capacit 
10 
pF 
FC = 1 MHz(7) 


Co 
1/0 or Output Capacitance 
10 
pF 
FC = 1 MHz(7) 


CClK 
ClK 
Capacitance 
6 
pF 
Fc = 1 MHz(7) 


NOTES: 
1. This parameter is measured at: 


Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 


Address, Data, BEn 
-1.0 
mA 
Definition, Control 
-0.9 
mA 
3. Typical supply current: 
Ice 280 mA @ ClK = 16 MHz 
(Normal Operation) 
340 mA @ ClK = 20 MHz 
430 mA @ ClK = 25 MHz 
590 mA @ ClK = 33 MHz 


4. This parameter is for inputs without pullups or pulldowns and 0 ,;; VIN ,;; Vee. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and VIL = 0.45V. 
7. Not 100% tested. 


Intel486TM 
SX MICROPROCESSOR 
intel® 


Table 14.4. Intel OverDrlve™ Processor D.C. Parametric Values 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input low 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
V 


VOL 
Output low 
Voltage 
V 
(Note 1) 


VOH 
Output High Voltage 
V 
(Note 2) 


Icc 
Power Supply Current 
(Note 3) 


ClK 
= 16 MHz 
mA 
ClK 
= 20 MHz 
ClK 
= 25 MHz 
ClK 
= 33 MHz 


III 
Input leakage 
Cu 
/LA 
(Note 4) 


IIH 
Input leakage 
Cur 
200 
/LA 
(Note 5) 


IlL 
Input leakage 
Curre 
-400 
/LA 
(Note 6) 


'La 
Output leakage 
Cur 
±15 
/LA 


CIN 
Input Capac it 
20(8) 
pF 
Fc = 1 MHz(7) 


Co 
I/O or Out 
apacitance 
20(8) 
pF 
Fc = 1 MHz(7) 


CCLK 
ClK 
Capacitance 
20(8) 
pF 
Fc = 1 MHz(7) 


NOTES: 
1. This parameter"is measured at: 


Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 


Address, Data, BEn 
- 1.0 mA 
Definition, Control 
- 0.9 mA 
3. Typical supply current: 


Icc 
@ 16 MHz = 525 mA 


@ 20 MHz = 625 mA 


@ 25 MHz = 775 mA 


@ 33 MHz = 975 mA 
4. This parameter is for inputs without pullups or pulldowns and 0 ,;; VIN ,;; Vcc. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and VIL = 0.45V. 
7. Not 100% tested. 
8. The Intel487 SX Math CoProcessor is not offered at 33 MHz. 
The following are the capacitance specifications for the OverDrive Processor at 33 MHz: 


CIN = 13pF 
Co=17pF 
CCLK = 15 pF 


Intel OverDrive Processor output delays are speci- 
fied with minimum and maximum limits, measured as 
shown. The minimum Intel486 SX microprocessorl 
Intel OverDrive Processor delay times are hold times 
provided to external circuitry. Intel486 SX microproc- 
essor/lntel 
OverDrive Processor input setup and 
hold times are specified as minimums, defining the 
smallest acceptable sampling window. Within the 
sampling window, a synchronous input signal must 
be stable for correct Intel486 SX microprocessorl 
Intel OverDrive Processor operation. 


The A.C. specifications, given in Tables 14.5, 14.6 
and 14.7 consist of output delays, input setup re- 
quirements and input hold requirements. All A.C. 
specifications are relative to the rising edge of the 
ClK signal. 


A.C. specifications measurement is defined by Fig- 
ures 14.1-14.3. Inputs must be driven to the voltage 
levels indicated by Figure 14.3 when A.C. specifica- 
tions are measured. Intel486 SX microprocessorl 


Table 14.5. 16 MHz Intel486TM SX Microprocessorl 
Intel OverDrive™ 
Processor 
A.C. Characteristics 
Vcc = 5V ±5010;(Note 1); CL = 50 pF unless otherwise specified 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
16 
MHz 
1X Clock 


t1 
ClK Period 
62.5 
125 
ns 
13.1 


t1a 
ClK Period Stability 
0.1% 
Adjacent Clocks 


t2 
ClK High Time 
20 
ns 
13.1 
at2V 


t3 
ClKlowTime 
20 
ns 
13.1 
atO.8V 
tI 
ClKFaliTime 
8 
ns 
~~.1 
2V to 0.8V 


ts 
ClK Rise Time 
8 
~ 
13.1 
0.8Vt02V 


ts 
A2-A31, PWT, PCD, BEO-3#, 
3 
26 
~-1'Js 
13.5 
M/IO#, 
D/C#, 
W/R#, ADS#, 
lOCK#,FERR#,BREQ,HlDA 


~ 
~~. f' 


Valid Delay 
~~ 


t7 
A2-A31, PWT, PCD, BEO-3#" ~~ 
ns 
13.6 
After Clock Edge(2) 
M/IO#, 
D/C#, 
W/R#, A~~ 
.• 
,,*':,:v 
lOCK # Float Delay 
,. 


te 
PCHK# Valid Delay A',-'- 
"'. 
'A~~""l'35 
ns 
13.4 


tea 
BLAST#, PlOCK#'Valrd Del~~' 
,~ 
35 
ns 
13.5 


t9 
BLAST#, PLOCK# Float~ 
'-: 
42 
ns 
13.6 
After Clock Edge(2) 


t10 
DO-D31'DPO~oa.tv~. 
3 
30 
ns 
13.5 
Delay 


t11 
00-031, 
DPO- 
e Data Float 
42 
ns 
13.6 
After Clock Edge(2) 
Delay 


t12 
EADS# Setup Time 
12 
ns 
13.2 


t13 
EADS# Hold Time 
3 
ns 
13.2 


t14 
KEN#, BS16#, BS8# Setup Time 
12 
ns 
13.2 


t1S 
KEN#, BS16#, BS8# Hold Time 
3 
ns 
13.2 


t16 
ROY#, BRDY# Setup Time 
12 
ns 
13.2 


t17 
ROY#, BRDY# Hold Time 
3 
ns 
13.2 


t1e 
HOLD, AHOLD, BOFF# Setup Time 
12 
ns 
13.2 


t19 
HOLD, AHOlD, BOFF# Hold Time 
3 
ns 
13.2 


• 


Table 
14.5. 16 MHz Intel486™ 
SX Mlcroprocessorl 
Intel OverDrlve™ 
Processor 
A.C. Characteristics 
(Continued) 


VCC = 5V ±5%; 
(Note 
1); CL = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Min 
Unit 


t20 
RESET,FlUSH#,A20M#,NMI, 
14 
ns 
INTR, IGNNE # Setup Time 


t21 
RESET, FlUSH#, 
A20M#, 
NMI, 
ns 
INTR,IGNNE# 
Hold Time 


t22 
DO-D31, 
DPO-3, 
A4-A31 
Rea 
ns 
Setup Time 


t23 
DO-D31, 
DPO-3, 
A4-A$1 
Read 
3 
ns 
Hold Time 


Figure 


13.2 


NOTE: 
1. Intel486 SX CPU, T.cASE= O°Cto + 8WG 
Intel487 SX MCP, TCASE= O°Cto +8SoC 
OverDrive Processor, TCASE= O°Cto + 9SoC 
2. Not 100% tested, guaranteed by design characterization. 


Table 
14.6.20 
MHz Intel486™ 
SX Microprocessor/ 
Intel OverDrive™ 
Processor 
A.C. Characteristics 


Vcc 
= 5V ±5%; 
(Note 
1); CL = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
20 
MHz 
1X Clock 


t1 
ClK 
Period 
50 
125 
ns 
' 
13.1 


t1a 
ClK 
Period Stability 
0.1% 
Adjacent 
Clocks 


t2 
ClK 
High Time 
16 
ns 
,:~1 
at2V 


t3 
ClKlowTime 
16 
ns,.. 
"'13.1 
atO.8V 


~ 
ClK 
Fall Time 
It, 8 
.•..us 
13.1 
2V to 0.8V 


ts 
ClK 
Rise Time 
{( .'\ 
6 
~s 
13.1 
0.8Vto 
2V 


ts 
A2-A31, 
PWT, PCD, BEO-3#, 
~~~.y 
~~ 
~~ 
13.5 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 


~ 
&< 
lOCK#, 
FERR#, 
BREQ, 
• 


HlDA 
Valid Delay 
~ 
.ff1 " 
t7 
A2-A31, 
PWT, PCD, BE(5~, 
~.- [~,37 
ns 
13.6 
After Clock Edge(2) 


M/IO#, 
D/C#, 
W/RI 
AOS~/:~ 
...• 


lOCK# 
Float Delay 
,. 
... )\ 


t8 
PCHK# 
Valid Delay 
~<'~~ 3'" 
28 
ns 
13.4 


t8a 
BLASH, 
PlOCK# 
Varl&Oel,~ 
3 
28 
ns 
13.5 


t9 
BLAST#, 
PlOC\S.~at 
Delay 
37 
ns 
13.6 
After Clock Edge(2) 


t10 
DO-D31, 
DPO 
~Wfite 
Data 
3 
26 
.ns 
13.5 
Valid Delay 


t11 
DO-D31, 
DPO-3 
Write Data 
37 
ns 
13.6 
After Clock Edge(2) 
Float Delay 


t12 
EADS # Setup Time 
10 
ns 
13.2 


t13 
EADS# 
Hold Time 
3 
ns 
13.2 


Table 
14.6.20 
MHz Intel486TM 
SX Mlcroprocessorl 
Intel OverDrlve™ 
Processor 
A.C. Characteristics 
(Continued) 


Vcc = 5V ±5%; 
(Note 
1); CL = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
10 
ns 
13.2 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
3 
~ 
ns 
13.2 


t16 
RDY #, BRDY # Setup Time 
10 
ns 
13.2 


t17 
RDY #, BRDY # Hold Time 
,; 
3.• 
ns 
13.2 


t18 
HOLD, AHOlD, 
BOFF # Setup Tim§' 
(( 
ns 
13.2 


t19 
HOLD, AHOlD, 
BOFF # Hold Time 
3,' 
ns 
13.2 


t20 
RESET, FLUSH #, A20M 11 NMI 
~~ 
~ 
12 
ns 
13.2 
INTR, IGNNE # Setup Time 
•• 


t21 
RESET, FlUSH#, 
A20M#;~'~ 
3 
ns 
13.2 
INTR, IGNNE# 
Hold Ti~ 


t22 
DO-D31, 
DPO-3, 
A4<rA31 Read 
6 
ns 
13.2 
Setup Time 


t23 
DO-D31, 
DPO-3, 
A4-A31 
Read 
3 
ns 
13.2 
Hold Time 


NOTE: 
1. Intel486 SX CPU, TCASE = O·Cto + 85·C 
Intel487 SX MCP, TCASE = O·Cto +85·C 
OverDrive Processor, TCASE = ~·C to +95·C 
2. Not 100% tested, guaranteed by design characterization. 


Table 
14.7.25 
MHz Intel486TM 
SX Mlcroprocessorl 
Intel OverDrlve™ 
Processor 
A.C. Characteristics 


VCC = 5V ±5%; 
(Note 
1); CL = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
25 
MHz 
1X Clock 


t1 
ClK 
Period 
40 
125 
n~ 
13.1 


t1a 
ClK 
Period Stability 
0.1% 
.••..~~ 
Adjacent 
Clocks 


t2 
ClK 
High Time 
14 
.~ 
-~ 
ns 
13.1 
at2V 


t3 
ClKlowTime 
1~ 
..,J. ~ 
ns 
13.1 
at 0.8V 
4 
ClK 
Fall Time 
'~~ ,'~ 
ns 
13.1 
2Vto 
0.8V 


t5 
ClK 
Rise Time 
..•~ 
¥ 
~tJ.. ,1'." 
ns 
13.1 
0.8V to 2V 


~ 
A2-A31, 
PWT, PCD, BEO-3:i: 
.40 
- 
6...• 19 
ns 
13.5 
MIIO#, 
D/C#, 
W/R#, 
ADS#t:~ ~V' 


lOCK#,FERR#,BRE~~ 
HlDA 
Valid Delay 


t7 
A2-A31, 
PWT, PCD, Bal-3#, 
28 
ns 
13.6 
After Clock Edge(2) 


M/IO#, 
D/C#, 
W/R#, 
ADS#, 


lOCK # Float Delay 


t8 
PCHK# 
Valid Delay 
3 
24 
ns 
13.4 


• 


Table 14.7.25 MHz Intel486TM SX Mlcroprocessorl 
Intel OverDrive™ Processor A.C. Characteristics 
(Continued) 


Vcc = 5V ±5%; (Note 1); Cl = 50 pF unless otherwise specified 


Symbol 
Parameter 
Mln 
Max 
Unit 


tea 
BLAST#, PLOCK# Valid Delay 
3 
24 


t9 
BLAST#, PLOCK# Float Delay 
28 


tlO 
DO-D31, DPO-3 Write Data Valid 
Delay 


DO-D31, DPO-3 Write Data Float 
Delay 


EADS# Setup Time 


EADS# Hold Time 


KEN#, BS16#, BS8# SetupTi 


KEN#, BS16#, BS8# Hold Tm 


RDY#, BRDY# Setup Ti 


RDY#, BRDY# Hold 


HOLD, AHOLD, BoFF# Set 


HOLD, AHOLD, BOFF# 


RESET, FLUSH#, AM, 
N 
INTR,~GNNE# S 
·me 


RESET, FLUS 
INTR,IG 
NE# 


DO-D31, DPO-3, A4-A31 Read 
Setup Time 


DO-D31, DPO-3, A4-A31 Read 
Hold Time 


NOTE: 
1. Intel486 
SX CPU, TCASE = O'C to + 85'C 
Intel487 
SX MCP, TCASE = O'C to + 85'C 
OverDrive 
Processor, 
TSINK = O'C to +85'C 
2. Not 100% 
tested, 
guaranteed 
by design 
characterization. 


Figure 


13.5 


13.6 


After Clock Edge(2) 


13.2 


13.2 


13.2 


13.2 


ns 
13.2 


ns 
13.2 


ns 
13.2 


ns 
13.2 


ns 
13.2 


ns 
13.2 


ns 
13.2 


ns 
13.2 


Table 
14.8.33 
MHz Intel486TM 
SX Mlcroprocessorllntel 
OverDrlve™ 
Processor 
A.C. Characteristics 


VCC = 5V ±5%; 
(Note 
1); CL = 50 pF unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
33 
MHz 
1X Clock 


t1 
ClK 
Period 
30 
125 
ns 
13.1 


t1a 
ClK 
Period Stability 
0.1% 
Adjacent 
Clocks 


t2 
ClK 
High Time 
11 
ns 
13.1 
at2V 


t3 
ClK 
low 
Time 
11 
ns 
13.1 
atO.8V 


t4 
ClK 
Fall Time 
3 
ns 
13.1 
2Vto 
0.8V 


t5 
ClK 
Rise Time 
3 
ns 
13.1 
0.8Vto2V 


te 
A2-A31, 
PWT, PCO, BEO-3#, 
3 
16 
ns 
13.5 
M/IO#, 
O/C#, 
W/R#, 
AOS#, 
lOCK#, 
BREQ, HlOA 
Valid Delay 


t7 
A2-A31, 
PWT, PCO, BEO-3#, 
20 
ns 
13.6 
After Clock Edge(1) 
M/IO#, 
O/C#, 
W/R#, 
AOS#, 


lOCK # Float Delay 


t8 
PCHK# 
Valid Delay 
3 
22 
ns 
13.4 


t8a 
BLAST # , PLOCK # Valid Delay 
3 
20 
ns 
13.5 


t9 
BLAST # , PLOCK # Float Delay 
20 
ns 
13.6 
After Clock Edge(1) 


t10 
00-031, 
OPO-3 Write Data Valid 
3 
18 
ns 
13.5 
pelay 


t11 
00-031, 
OPO-3 Write Data Float 
20 
ns 
13.6 
After Clock Edge(1) 
Delay 


t12 
EAOS # Setup Time 
5 
ns 
13.2 


t13 
EAOS# 
Hold Time 
3 
ns 
13.2 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
5 
ns 
13.2 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
3 
ns 
13.2 


t16 
ROY #, BROY # Setup Time 
5 
ns 
13.3 


t17 
ROY #, BROY # Hold Time 
3 
ns 
13.3 


t18 
HOLD, AHOlO 
Setup Time 
6 
ns 
13.2 


t18a 
BOFF # Setup Time 
8 
ns 
13.2 


t19 
HOLD, AHOlO, 
BOFF # Hold Time 
3 
ns 
13.2 


t2D 
RESET, FlUSH#, 
A20M#, 
NMI, 
5 
ns 
13.2 
INTR Setup Time 


t21 
RESET, FLUSH #, A20M #, NMI, 
3 
ns 
13.2 
INTR Hold Time 


t22 
00-031, 
OPO-3, A4-A31 
Read 
5 
ns 
13.3 
Setup Time 


t23 
00-031, 
OPO-3, A4-A31 
Read 
3 
ns 
13.3 
Hold Time 


NOTE: 
1. Intel486 SX CPU, TCASE 
= D'C to +85'C 
Intel487 SX MCP, TCASE 
= to +85'C 
OverDrive Processor, TSINK = D'C to + 85'C 


• 


Table 14.9.lnteI486 SX Microprocessor 
A.C. Characteristics 
for Boundary Scan Test Inputs 
(available in PQFPversion only) 


VCC = 5V ±5%; 
TCASE = O·C to + 85·C; CL = 50 pF. All Inputs and Outputs 
are TTL Level. 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t24 
TCK Frequency 
25 
MHz 
1X Clock 


t25 
TCK Period 
40 
ns 
(Note 2) 


t26 
TCK High Time 
40 
ns 
@2.0V 


t27 
TCK Low Time 
40 
ns 
@0.8V 


t28 
TCK Rise Time 
4 
ns 
(Note 1) 


t29 
TCKFaliTime 
4 
ns 
(Note 1) 


t30 
TDI, TMS Setup Time 
8 
ns 
13.7 
(Note 3) 


t31 
TDI, TMS Hold Time 
10 
ns 
13.7 
(Note 3) 


t32 
TOO Valid Delay 
3 
25 
ns 
13.7 
(Note 3) 


t33 
TOO Float Delay 
36 
ns 


t34 
All Outputs 
(Non-Test) 
Valid Delay 
3 
25 
ns 
13.7 
(Note 3) 


t35 
All Outputs 
(Non-Test) 
Float Delay 
36 
ns 
13.7 
(Note 3) 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
ns 
13.7 
(Note 3) 


t37 
All Inputs (Non-Test) 
Hold Time 
7 
ns 
13.7 
(Note 3) 


NOTES: 
1. Rise/Fall 
times 
are measured 
between 
0.8V and 2.0V. Rise/Fall 
times can be relaxed 
by 1 ns per 10 ns increase 
in TCK 
period. 
2. TCK period 
;;0, CLK period. 
3. Parameter 
measured 
from TCK. 


BOFF-. 
AHOLD, 
[ 
HOLD 


RESET, FLUSH- 
A20W-,f:)~~~;J[ 
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NWI 


A4-A31 
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00-031 [ 
DPO-DP3 


• 


BLAST". PLOCK" 
[ 


CLK [ 


A2-A31. 
PWT.PeD. 
BEO-3". MilO". [ 
D/c". w/R". 
ADS". 
LOCK"{fDlll#.IBREO. 
HLDA 


DO-D31. DPO-3. [ 
(WRITE) 


BLAST". PLOCK" 
[ 


CLK [ 


A2-A31. 
PWT.PeD. 
BEO-3". MilO". 
D/c".w/R". ADS". 
[ 
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[ 
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____ 
--t35d 
-------X...--------r 


Figure 14.7.Test Signal Timing Diagram 
(Available In PQFP Version Only) 


14.4.1 Typical Output Valid Delay versus Load 
Capacitance Under Worst Case 
Conditions for Intel486 SX CPU/lntel 
OverDrive Processor 
... 


-=-S 
nom+4 
...o... 
:> 
l= 
:>o 
-'<l 
~ 


NOTE: 
This graph will not be linear outside 
of the CL range shown. 
nom=nominal 
value 
given 
in A.C. Characteristics 
table. 


14.4.2 Typical Output Valid Delay versus Load 
Capacitance Under Worst Case 
Conditions for Intel486 SX CPU 
(In PQFP) 
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-=- 
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d 
Q... 
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!:; 
o 
-' 
~ 
n 
~ 
* 
... 


75 
100 
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l1. (picofarads) 


NOTE: 
This graph will not be linear outside 
of the CL range 
shown. 
nom = nominal 
value 
given in A.C. Characteristics 
table. 


14.4.3 Typical Output Rise Time versus Load 
Capacitance Under Worst-Case 
Conditions 


7 


75 
100 
125 
150 


l1. (plcofanods) 


NOTE: 
This graph will not be linear outside 
of the CL range shown. 


14.5 
Designing for ICD·486 or 
ICETM·486 Using the ADAPT Pin 
Scrambler 
Board for the 
Intel486™ 
SX Microprocessor! 
Intel OverDrive 
Processor 
(Advance 
Information) 


The ICD-486 (In-Circuit Debugger) is a hardware as- 
sisted debugger for the Intel486 SX microprocessor/ 
Intel OverDrive Processor. The ICE-486 (In-Circuit 
Emulator) is an emulator for the Intel486 SX micro- 
processor/Intel 
OverDrive Processor. The ADAPT 
Pin Scrambler Board is a small circuit board which 
provides an Intel486 OX socket on top and can be 
jumpered to plug into an Intel486 SX, Intel487 SX or 
Intel486 OX socket. This Pin Scrambler Board is 
used to convert an Intel486 SX or Intel487 SX sock- • 
et into an Intel486 OX socket which the ICD-486 or 
ICE-486 can plug into. 


To use the ICD-486 or ICE-486, the Intel486 SX mi- 
croprocessor/Intel OverDrive Processor component 
must be removed from its socket and replaced with 
the Pin Scrambler Board with its jumpers set to the 
Intel486 SX microprocessor/Intel OverDrive Proces- 
sor position. The ICD-486 or ICE-486 can then be 
plugged into the ADAPT Pin Scrambler Board. Be- 
cause of the operating frequency of the Intel486 SX 
microprocessor/I ntel OverDrive Processor systems, 
there is no buffering of signals between the Intel486 
OX CPU in the ICD-486 or ICE-486 and the target 
system. A direct result of the non-buffered intercon- 
nect is that the ICD-486 or ICE-486 shares the ad- 
dress and data bus of the target system. In order for 
the ICD-486 or ICE-486 to function properly (without 
the Optional Isolation Board installed), the design of 
the target system must meet the following restric- 
tions: 
1. The bus controller must only enable data trans- 
ceivers onto the data bus during valid read cycles 
of the Intel486 SX microprocessor/Intel 
Over- 
Drive Processor. 


2. ROY# cannot be used with BREQ to terminate 
outstanding bus requests (I.e., when using the 
ICD-486 or ICE-486, BREQ will be asserted when 
there is not a corresponding assertion of ADS#). 
3. Before another bus master drives the local proc- 
essor address bus, the other bus master must 
gain access to the address bus through the use of 
HOLD/HLDA, AHOLD, or BOFF#. 


4. The user system must be able to drive one addi- 
tional CMOS load (approximately 25 pF) on all 
signals that go to the Intel486 OX emulation proc- 
essor. 


If the target system does not satisfy these restric- 
tions, the Optional Isolation Board (OIB) should be 
used to isolate the emulation processor from the tar- 
get system (see following section). 


In addition 
to the 
above 
restrictions, 
the 
ICD-486, 


ICE-486 and ADAPT 
Pin Scrambler 
Board have sev- 
eral 
electrical 
and 
mechanical 
characteristics 
that 


should 
be taken 
into consideration 
when 
designing 


the 
Intel486 
SX 
microprocessor/Intel 
OverDrive 


Processor 
system. 


ICD-486 adds up to 20 pF to each of the Intel486 SX 
microprocessor/Intel 
OverDrive 
Processor 
signals. 


For additional 
detail, refer to the ICD-486/33 
In-Cir- 


cuit 
Debugger 
Fact 
Sheet, 
literature 
number 
280872-003. 


ICE-486 adds up to 55 pF to the ClK 
signal, and up 


to 35 pF to each of the other Intel486 
SX microproc- 


essor/Intel 
OverDrive 
Processor 
signals. 
For addi- 


tional 
detail, 
refer 
to the 
i486 
In-Circuit 
Emulator 


Fact Sheet, 
literature 
number 
280894-001. 


In either 
case, 
the 
ADAPT 
Pin 
Scrambler 
Board 


adds 
up 
to 
an 
additional 
15 pF 
to 
each 
of 
the 


Intel486 
SX microprocessor/Intel 
OverDrive 
Proces- 


sor signals. 


ICD-486 
adds ± 15 /LA loading 
to the data bus sig- 


nals and ± 5 /LA loading to the address 
and control 


signals. For additional 
detail, refer to the ICD-486/33 


In-Circuit 
Debugger 
Fact 
Sheet, 
literature 
number 


280872-003. 


ICE-486 adds ± 15 /LA loading to the ClK 
and data 


bus signals 
and ± 5 /LA loading 
to the address 
and 


control 
signals. For additional 
detail, refer to the i486 


In-Circuit 
Emulator 
Fact 
Sheet, 
literature 
number 


280894-003. 


Power 
Requirements 


For noise 
immunity 
and CMOS 
latch-up 
protection 


the ICD-486 
and ICE-486 are powered 
by the target 


system 
through 
the 
power 
and ground 
pins of the 


Intel486 
SX microprocessor/Intel 
OverDrive 
Proces· 


sor socket. 


ICD-486 draws up to 1500 mA including 
the Intel486 
DX emulation 
processor 
current. 


ICE·486 draws up to 2200 mA including 
the Intel486 


DX emulation 
processor 
current. 


No Connects 


Pins specified 
as N.C. in the Intel486 
SX microproc- 


essor/Intel 
OverDrive 
Processor 
pin 
description 


must 
be 
left 
unconnected. 
Connection 
of 
any 
of 


these pins to power, ground, or any other signal may 
cause 
the processor, 
ICD-486, 
or ICE-486 
to mal- 


function. 


Intel486 
SX Microprocessor/Intel 
OverDrive 


Processor 
Location 
and Orientation 


The 
ICD-486, 
ICE-486, 
and ADAPT 
Pin Scrambler 


Board may require lateral clearance. 
Figures 
13.8a- 


13.8f 
show 
the 
clearance 
requirements 
of 
these 


products. 
A hinge 
cable 
is provided 
with 
both 
the 


ICD·486 
and 
ICE-486 
to 
assist 
in connecting 
to 


sockets 
in enclosed 
spaces. 
The hinge cable 
adds 


an additional 
15 pF of capacitive 
loading 
and 
ap- 


proximately 
0.5 ns of propagation 
delay to each sig- 


nal. 


OPTIONAL 
ISOLATION 
BOARD 
(OIB) 


Due to their 
unbuffered 
designs, 
the 
ICD·486 
and 


ICE-486 are susceptible 
to errors on the target 
sys- 


tem's 
bus. The OIB installs 
between 
the ICD-486 
or 


ICE-486 
and the ADAPT 
Pin Scrambler 
Board 
and 


allows 
the 
ICD-486 
or ICE-486 
to function 
in sys- 


tems with faults (Le., shorted 
signals). After electrical 


verification, 
the OIB may be removed. 
The OIB has 


the following 
electrical 
and mechanical 
characteris- 


tics: 


Buffer 
Characteristics 


The 
OIB 
buffers 
the 
address 
and 
data 
busses 
as 


well 
as the 
byte 
enables, 
ADS#<, W/R#<, 
M/IO#<, 


BLAST #<,HlDA 
and RDY #<.For details on A.C. and 


D.C. loading characteristics, 
refer to the ICD-486/33 


In-Circuit 
Debugger 
Fact 
Sheet, 
literature 
number 


280872-003, 
and the 
i486 
In-Circuit 
Emulator 
Fact 


Sheet, 
literature 
number 
280894-001. 
To guarantee 


proper 
operation 
with 
the 
OIB, 
the 
clock 
period 


should 
be increased 
by the round 
trip buffer 
delay 


(10 ns) unless the target 
system 
design already 
has 


enough 
timing margin. 


Unbuffered 
Signals 


Signals 
not 
listed 
above 
as 
buffered 
are 
passed 


through 
the OIB and will have additional 
capacitive 
loading 
due to the connectors 
and circuit 
board 
of 


up to 10 pF. 


inteJQP 


Power 
Requirements 


The OIB is also powered by the target system 
through the Intel486 SX microprocessor/lntel Over- 
Drive Processor socket and requires 500 mA in addi- 
tion to the ICD-486 or ICE-486 requirements. 


OIB Clearance 
Requirements 


The OIB requires an extra 0.85" of vertical clear- 
ance in the target system above the ADAPT Pin 
Scrambler Board. 


RELOCATABLE 
EXPANSION 
MEMORY 
(REM) 


BOARD 
. 


Power 
Requirements 


The REM is also powered by the target system 
through the Intel486 SX microprocessor/Intel Over- 
Drive Processor socket and requires 2100 mA in ad- 
dition to the ICD-486 or ICE-486 requirements. 


REM Clearance 
Requirements 


The REM requires an extra 0.85" of vertical clear- 
ance in the target system above the ADAPT Pin 
Scrambler Board. 
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Figure 14.8c. ICD·486 Probe Dimensions with Optional Isolation Board (OIB) 


and ADAPT Pin Scrambler Board Installed 
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Figure 14.8d. ICD·486 Probe Dimensions with Logic Analyzer Interface (LAI) Board 


and ADAPT Pin,Scrambler Board Installed 


--=-1 
-=---, II'~ 


i ~: ~~~~~~~~: 
' 
t 0.85" 


--=1 
===--, II'~' 


: 
>1 
: 
: ' 
2.SS" 
,_:_---> 


Figure 14.8f. ICE-486 
Probe Dimensions 
with Relocatable 
Expansion 
Memory 
Board, 


Optional Isolation Board (oIB) and ADAPT 
Pin Scrambler 
Board Installed 


• 


D--.~ 
0, 
• 
s, 


911.65 
@@@@@@@@@@@@ @@@@@ 


REF. 
@@ @@@ @@@@@@@@@@@@ 
@@@@@@@@@@@@@@@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
-- 
@@@ 
@@@ 
@@@ 
@@@ 
( 
\ 
@@@ 
@@@ 
@@@ 
D 
@@@ 
@@@ 


PIN C3 
@@ @ 
'-...../ 
@@@ 
@@@ 
@@@ 
@@ 
@@@ 
@ 
@L.- 
---I@@@ 


@@ 
@@@@@@@@@@@ 
@@ 
@@@@@@@@@@@@@@@o@ 
L-@@@@@@@@@@@@@@@o@ 


~::: 
REF. 


45° 
CHAMFER 


(INDEX CORNER) 


SWAGGED 
PIN 
(4 Pl) 


Family: Ceramic Pin Grid Array Package 


Symbol 
Millimeters 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
2.8 
3.5 
SOLID LID 
0.110 
0.140 
SOLID LID 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


0 
44.07 
44.83 
1.735 
1.765 


01 
40.51 
40.77 
1.595 
1.605 


91 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
168 
168 


Sl 
1.52 
2.54 
0.060 
0.100 


ISSUE 
IWS 
REVX 
7/15/88 


intel~ 


SEATING 
PLANE 


918 (All 
PINS) 
Lf 


SWAGGED 


PIN 
DETAil 


Letter or 
Description 
of Dimensions 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance 
from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


D 
Largest overall package 
dimension 
of length 


D1 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing 
between 
true lead position 
centerlines 


L 
Distance 
from seating plane to end of lead 


81 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling dimension: millimeter. 
2. Dimension "el" 
("e") is non-cumulative. 


3. Seating plane (standoff) is defined by P.C. board hole size: 0.0415-0.0430 inch. 
4. Dimensions "B", "Bl" 
and "C" are nominal. 


5. Details of Pin 1 identifier are optional. 
• 


. 
I.I••··@H·®-·®I·®I 


, 
-n-m 
\ 
'" 
0 
0 


" 
I•••• @I,I·®-·®I·®I 


! 
, 
" L 
-' 
• 
•·••·@I,I·®-·®I·®I 
-$- 
O.OOI@C 
A@-'@ 
.® 
f 
f 
,---------_.- ~. 
0 i 
Or 
') 


_..J 
'- -- 


~c 
·L. 
.. 


1.1•••.@I,!·®-·®I·®1 
" 


NOTES: 
1. Interpret 
dimensions 
and tolerances 
in accordance 
with ANSI Y14.5M-1982. 
2. Data enclosed 
in parentheses 
is for reference 
only. 


Figure 15.2a. Principal 
Dimensions 
and Data for 196 Lead PQFP Package 


Table 15.2. Symbol 
List and Dimensions 
for 
196 Lead Plastic Quad Flat Pack Package 


Symbol 
Description 
of Dimensions 
Mln 
Max 


A 
Package 
Height: 
0.160 
0.175 
Distance 
from the seating plane 
to the highest point of body. 


A1 
Standoff: 
0.020 
0.035 
The distance 
from the seating plane 
to the base plane. 


D,E 
Overall Package 
Dimension: 
1.470 
1.485 
Lead tip to lead tip. 


01, E1 
Plastic Body Dimension 
1.347 
1.353 


02, E2 
Bumper 
Distance 
Without 
FLASH 
1.497 
1.503 
With FLASH 
1.497 
1.510 


CP 
Seating 
Plane Coplanarity 
0.000 
0.004 


NOTES: 
1. All dimensions 
and tolerances 
conform 
to ANSI Y14.5M-1982. 


2. Dimensions 
are in inches. 


3. Data enclosed 
in parentheses 
is for reference 
only. 


I 


lO.,48i:O.007 
-J 
L1.363 
1.357 


(0.006) r 
(0.095)Lm 


~ 


0.009 --ll-- 
0.012 


2° 
ANGLE ON 
J SURFACE 
OF' THE 


8UlllPER 
BOTTOM 
HALF" 
• 


1 


_. 
-0 


• 
0, 
.~ 
• 
~ 


~M 
@@@@@@@@@@@@@@@@@ 
~ 
@@@@@@@@@@@@@@@@@ 
@@@@@@@@@@@@@@O@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
@@@ 
0 
@@@ 
@@@ 


PIN C3 
@@@ 
@@@ 
@@@ 
@@@ 
@@ 
@@@ 
@ 
@@ ~======::::y 
@@@ 
@@o@@@@@@@@@@@ 
@@ 
@@@@@@@@@@@@@@@o@ 
L-@@@@@@@@@@@@@@@o@ 


2.29 RE~ 
1.52 
,. 


45° 
CHAt.lfER 
(INDEX CORNER) 


SWAGGED 
PIN 
(4 PL) 


Family: Ceramic Pin Grid Array Package 


Symbol 
Millimeters 
Inches 


Mln 
Max 
Notes 
Mln 
Max 
Notes 


A 
3.56 
4.57 
0.140 
0.180 


A1 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
0.23 
0.30 
SOLID LID 
0.110 
0.140 
SOLID LID 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


0 
44.07 
44.83 
1.735 
1.765 


01 
40.51 
40.77 
1.595 
1.605 


e1 
2.29 
2.79 
0.090 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
169 
169 


S1 
1.52 I 
2.54 
0.060 
0.100 


ISSUE 
IWS 
1/15/91 


NOTE: 
See Table 
15.1 for Dimension 
Symbol 
descriptions. 
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SECTION 
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Intel OverDrive™ 
Processor, 
169 Pin, PGA Package 
Dimensions 
with Heat Sink Attached 
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Intel 
OverDrive"" 
PROCESSOR 
Ot.iNI-DIRECTIONAL 
HEAT SINK 


/ADHESIVE 


Dimension 
(Inches) 
Mln 
Max 


A 
Heat Sink Width 
1.520 
1.550 
8 
PGA Package Width 
1.735 
1.765 


C 
Heat Sink Edge Gap 
0.065 
0.155 


D 
Heat Sink Height 
0.212 
0.260 


E 
Adhesive 
Thickness 
0.008 
0.012 


F 
Package 
Height from Stand-Offs 
0.140 
0.180 


G 
Total Height from Package 
0.360 
0.452 


Stand-Ofts 
to Top of Heat Sink 


sink, 
is measured 
using 
a 0.005 


H 
diameter 
(AWG 
#36) 
thermocouple 
with a 90· angle adhesive 
bond 
at the center 
of the package 
top surface, 
opposite 
the pins. This is then used to calculate 
9JC and 9JS. 
Figures 
15.5 and 
15.6 illustrate 
this 
methodology. 


The case temperature 
for the Intel OverDrive 
Proc- 
essor is measured 
as shown 
in Figure 
15.7. 


The Intel486 
SX microprocessor 
is specified 
for op- 
eration when TC (the case temperature) 
is within the 
range of 0·C-85·C. 
TC may be measured 
in any en- 
vironment 
to determine 
whether 
the Intel486 
SX mi- 
croprocessor 
is within 
specified 
operating 
range. 


The 
case 
temperature, 
with 
and 
without 
heat 


intel~ 


Figure 15.5. Case Temperature Measurement without Heat Sink (O.005" Dla. Thermocouple 
on the 
Center of Package Top Surface with a 90· Angle Adhesive Bond) . 


Figure 15.6.Case Temperature Measurement with Heat Sink (O.005" Dla. Thermocouple 
on the Center 


of Package Top Surface with a 900Angle Adhesive Bond through a Hole Drilled at the Heat Sink Base). 


Figure 15.7. Heat Sink Measurement {O.005" Dla. Thermcouple on the Center of Heat Sink with a 900 


Angle Adhesive Bond through a Hold Drilled through the Center of the Name Plate. 


• 


The ambient 
temperature 
(TAJ is guaranteed 
as long 
as TC is not violated. 
The ambient 
temperature 
can 


be calculated 
from 
9JC and 9JA from 
the following 
equations. 


TJ = Tc + P • 6JC 


TA = TJ - 
P • 6JA 


Tc = TA + P' 
[6JA - 
6Jcl 


where 
TJ. TA. Tc 
= Junction. 
Ambient 
and 
Case 
Temperature 
respectively. 
9JC. 9JA = Junction-to- 
Case and Junction-to-Ambient 
Thermal 
Resistance, 
respectively. 


intel~ 


The values 
for 9JA and 9JC are given in Table 
15.3 


for the 1.75 sq. in., 168-pin. 
ceramic 
PGA package 


and in Table 
15.4 for the 196-pin 
PQFP package. 


Tables 
15.5, 15.6, 15.7, and 15.8 show the TA allow- 


able (without 
exceeding 
Td at various 
airflows 
and 


operating 
frequencies 
(fCLK) for 
the 
Intel486 
SX 


CPU in PGA and PQFP. 


Note that TA is greatly 
improved 
by attaching 
"fins" 


or a "heat 
sink" 
to the 
package. 
P (the maximum 


power consumption) 
is calculated 
by using the maxi- 


mum Icc at 5V as tabulated 
in the DC Characteris- 


tics of Section 
14. 


15.1.1 Intel486TM 
SX MICROPROCESSOR 
PGA PACKAGE 


Table 
15.3. Thermal 
Resistance 
("C/W) 
9JC and 9JA 
(for Intel486 
SX CPU In PGA) 


9JA vs Alrflow-ft/mln 
9JC 
0 
200 
400 
600 
800 
1000 
I Without 
Heat Sink 
1.5 
17 
14.5 
12.5 
11.0 
10.0 
9.5 
I With Heat Sink' 
2.0 
13.0 
8.0 
6.0 
5.0 
4.5 
4.25 


'0.350' 
high unidirectional heat sink (AI alloy 6063, 40 mil fin width, 155 mil 
center-to-eenter fin spacing). See Figure 15.6 for details on measurement 
methodology. 


Table 
15.4. Maximum 
TA at Various 
Airflows 
In °C 
(for Intel486TM 
SX CPU In PGA package) 


fCLK 
Alrflow-ft/mln 
(m/sec) 


0 
200 
400 
600 
800 
1000 
(MHz) 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TA without 
16 
50 
56 
60 
64 
66 
67 
Heat Sink 
20 
44 
51 
56 
67 
62 
64 


25 
36 
44 
50 
55 
58 
60 


33 
32 
40 
47 
52 
56 
58 


TA with 
16 
60 
71 
76 
78 
79 
80 
Heat Sink 
20 
56 
69 
74 
77 
78 
79 


25 
50 
66 
72 
76 
77 
78 


33 
47 
64 
71 
75 
76 
77 
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15.1.2 Intel486TM SX MICROPROCESSOR PQFP PACKAGE 


Table 15.5.Thermal Resistance ("C/W) 
6JCand 6JA 
{for Intel486 SX CPU (16, 20, and 25 MHz) In PQFP Package) 


6JC 
6JA vs Alrflow-ft/mln 


0 
200 
400 
600 
I 
Without 
Heat Sink 
3.5 
22.5 
19.0 
16.0 
14.0 
I 
With Heat Sink' 
6.0 
18.0 
12.5 
10.0 
9.5 


Table 15.6.Thermal Resistance ("C/W) 6JCand 6JA for the 
33 MHz Intel486 SX Microprocessor 
In PQFPwith Enhanced MM Package 


6JC 
6JA vs Alrflow-ft/mln 


0 
200 
400 
600 
I 
Without 
Heat Sink 
3.5 
20.3 
16.5 
13.9 
12.5 
I 
With Heat Sink' 
6.0 
16.7 
10.4 
8.3 
7.8 


·1.35" sq. 0.350" high omni-directional pin AI heat sink with 0.050" pin 
width, 0.143" pin-to-pincenter spacingand 0.100" base thickness. 


Table 15.7. Maximum TA at Various Airflows In·C 
(for Intel486 SX CPU In PQFP Package) 


fCLK 
Alrflow-ft/mln 
(m/sec) 


0 
200 
400 
600 
(MHz) 
(O) 
(1.01) 
(2.03) 
(3.04) 


TA without 
Heat Sink 
16 
42 
50 
57 
61 


20 
37 
46 
54 
59 


25 
32 
42 
50 
56 


TA with Heat Sink 
16 
58 
70 
76 
77 


20 
55 
69 
75 
76 


25 
51 
67 
74 
75 


Table 15.8. Maximum TA at Various Airflows in ·C 
(for the 33 MHz Intel486 SX Microprocessor 
In PQFP with Enhanced MM Package) 


fCLK 
Airflow-ft/min 
(m/sec) 


0 
200 
400 
600 
(MHz) 
(O) 
(1.01) 
(2.03) 
(3.04) 


ITA 
without 
Heat Sink 
33 
27 
40 
49 
54 


ITA 
with Heat Sink 
33 
48 
70 
77 
79 
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15.1.3 Intel 
OverDrive 
Processor 


The methodology 
for calculating 
the heat dissipation 
for the 20 MHz Intel OverDrive 
Processor 
(without 
a 
heat sink) and the 25 MHz, 33 MHz Intel OverDrive 
Processor 
(with a heat sink) are identical 
except that 
the reference 
point for measuring 
the product 
tem- 
perature 
is different. 
The 
20 
MHz 
Intel 
OverDrive 
Processor 
specifies 
Tease (the temperature 
at the 
outside 
center 
of the 
PGA 
package, 
opposite 
the 
pins) and 9JC (the thermal 
resistance 
from the sili- 
con junction 
to the package 
case) but the 25 MHz, 
33 
MHz 
Intel 
OverDrive 
Processor 
specifies 
Tsink 
(the temperature 
at the outside 
center 
base of the 
heat sink, not on the heat sink marking plate or cool- 
ing posts) 
and 9JS (the thermal 
resistance 
from the 
silicon junction 
to the heat sink base). The relation- 
ships between 
temperature, 
thermal 
resistance 
and 
power are shown 
in the following 
equations: 


Tcase = Tambient+ (Pmax• 8eAl and 


Tsink = Tambient+ (Pmax• 8sAl 


where, 


Tambient= Ambient Temperature 


Pmax= Power (Ice' 
Ved, 


8eA = 8JA - 
8Je, 


8SA = 8JA - 
8Js· 


The 20 MHz Intel OverDrive 
Processor, 
for 16 MHz 
and 20 MHz i486 SX CPU systems, 
is supplied 
with· 
out a heat sink. Table 
15.9 contains 
the thermal 
reo 
sistance 
values 
for 
the 
Intel 
OverDrive 
Processor 
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without 
heat sink. The thermal 
resistance 
values 
for 
the 
Intel 
OverDrive 
Processor 
with 
attached 
heat 
sink (25 MHz, 33 MHz versions) 
are shown 
in Table 
15.10. 
Table 
15.9 Thermal 
Resistance 
for the Intel 
OverDrlve™ 
Processor 
without 
Heat Sink 


9JC = 
2.0'C/W 
0 


9JA("C/W) 
19.0' 


800 


10.0 


NOTE: 
'The thermal resistance from junction to ambient (8JAl in 
static air is actually a linear function of power dissipation. 
The value shown in the table (19.0'C/W) 
represents the 
worst case expected value which is derived from a power 
dissipation of 2.9W. The maximum expected power dissipa- 
tion of 4W yields a 8JA value of 18.5'C/W. 


Table 
15.10. Thermal 
Resistance 
for the Intel 
OverDrlve™ 
Processor 
with 
Attached 
Heat Sink 


9JS = 
2.5'C/W 
0 
800 


5.7 


NOTE: 
'The thermal resistance from junction to ambient (8JAl in 
static air is actually a linear function of power dissipation. 
The value shown in the table (14.0'C/W) represents the 
worst case expected value which is derived from a power 
dissipation of 2.9W. The maximum expected power dissipa- 
tion of 6W yields a 8JA value of 13.1'C/W. 


fClK 
Airflow-Linear 
ft/mln 
(m/sec) 


(MHz) 
0 
200 
400 
600 
800 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


OverDrive 
Processor 
16 
42 
51 
62 
67 
70 
without 
Heat Sink(l) 
20 
29 
41 
54 
60 
64 


OverDrive 
Processor 
25 
30 
49 
61 
67 
70 
with Heat Sink(2) 
33 
16 
40 
55 
63 
66 


NOTES: 
1. The 20 MHz OverDrive Processor does not have a heat sink. 
2. The 25 MHz and 33 MHz Overrive Processors have a heat sink attached. 


16.0 
LOW POWER 
Intel486TM SX 
MICROPROCESSOR 


Low Power Intel486™ 
SX CPUI 
Intel487™ 
SX MCP 
• Lower Power Dissipation 
- 
Dynamic Frequency Scalability 
- 
Icclmax) Reduced to 150 mA at 2 MHz 
- 
Improved Vcc Rating (±10%) 
• Binary Compatible with Large Software Base 
- 
MS-DOS, OS/2, Windows 
- 
UNIX System V/386 
- 
iRMX, iRMK Kernels 
• High Integration Enables On-Chip 
- 
8 KByte Code and Data Cache 
- 
Floating Point Unit on the Intel487 SX Math 
CoProcessor 
- 
Paged, Virtual Memory Management 


• Easy to Use 
Built-In Self Test 
Hardware Debugging Support 
Intel Software Support 
Ex1ensiveThird Party Software Support 
• 168-Lead Pin Grid Array for Intel486 SX Micro- 
processor 


• 196-Lead Plastic Quad Flat Package for Intel486 
SX Microprocessor 
• 169-Pin Grid Array Package for Intel487 SX Math 
CoProcessor 
• High Performance Design 
- 
Intel486 One Clock Instruction Core 
- 
16/20/25 MHz Operation for Intel486 SX 
- 
64 MByte/Sec Burst Bus 
- 
CHMOS IV Process Technology 
- 
Dynamic Bus Sizing for 8-, 16- and 32-Bit Buses 


• Complete 32-Bit Architecture 
- 
Address and Data Buses 
- 
Registers 
- 
8-, 16- and 32-Bit Data Types 
• Multiprocessor Support 
- 
Multiprocessor Instructions 
- 
Cache Consistency Protocols 
- 
Support for Second Level Cache 


This section describes the Low Power Intel486 SX 
microprocessor. The Intel487 Math CoProcessor will 
support the low power Intel486 SX microprocessor 
as an optional upgrade available through the retail 
channel. 


The Low Power Intel486 family microprocessors 
meet today's need for high performance portables. 
Their combination of special features like dynamic 
frequency scaling, lower minimum frequency, im- 
proved Vcc operation and high integration contrib- 
ute significantly to lower power dissipation and meet 
the needs of portable computing. 


The Low Power capability is achieved by operating 
the Intel486 microprocessor in the 2X mode. The 
frequency can be varied dynamically between maxi- 
mum to minimum as needed. The frequency change 
does not affect contents of the registers and data 
integrity is maintained. Power dissipation is reduced 
significantly at 2 MHz where Icc is only 150 mA 
compared to 600 mA at 20 MHz. 


The 
Low 
Power 
Intel486 
microprocessors 
are 
100-percent compatible with all versions of the 
Intel386 microprocessor family, assuring compatibil- 
ity with the more than $40 billion software base of 
MS-DOS, Windows, OS/2 and UNIX/System oper- 
ating system applications. The Low Power Intel486 
microprocessor integrates the same RISC-technolo- 
gy, one clock per instruction integer core, on-chip 
cache, and memory management unit as the stan- 
dard Intel486 microprocessor. 


The Intel487 Math CoProcessor provides optional 
math upgrade capability for the Intel486 SX micro- 
processor and supports low power operation; provid- 
ing end-users increased floating point performance 
for more than 2100 software packages that were de- 
signed to use Intel Math CoProcessors. Note that 
the Intel OverDrive Processor does not work in sys- 
tems based on the Low Power Intel486 CPU. 


The following section on the Low Power Intel 486 SX 
microprocessor contains information specific to the 
Low Power device only. All data not defined are lo- 
cated in the appropriate sections of this data sheet 
unless specified otherwise. 


The Low Power Intel486 microprocessor brings In- 
tel486 technology and performance to the portable 
computer 
market. The 
low 
power 
capability 
is 
achieved by a frequency scalability feature during 
normal operation. The operating frequency can be 
brought down dynamically resulting in lower power 
supply current (Icel. This results in minimal power 
dissipation which ensures a longer battery life. 


• 


The Low Power Intel486 microprocessor integrates 
the same RISC-technology, one clock per instruc- 
tion integer core, on-chip cache, and memory man- 
agement unit as the standard Intel486 microproces- 
sor. 


The Low Power Intel486 microprocessor has the fol- 
lowing special features: 


• Frequency Scalability-This 
is achieved by op- 
erating the Intel486 microprocessor in the 2X 
clock mode. The frequency can be varied dynam- 
ically from maximum back to minimum or vice 
versa. The frequency change does not affect the 
register content of the CPU, thus data integrity is 
maintained. 
• Lower 
Minimum Frequency-The 
Low Power 
Intel486 microprocessor can be operated at a 
minimum frequency of 2 MHz, at which Icc(max) 
is only 150 mA, compared to an Icc(max) of 
600 mA at 20 MHz operation. The power dissipa- 
tion is thus drastically reduced ensuring a longer 
battery life. 


• Improved 
Vcc Operation-The 
Low Power In- 
tel486 microprocessor has an improved Vcc rat- 
ing of ± 10%. Again this feature makes it ex- 
tremely attractive to portable battery powered ap- 
plications. 


The above three features ensure power savings for 
portable computer systems resulting in prolonged 
battery life. 


Besides these special features, the Low Power 
Intel486 microprocessor has an identical feature set 
to the standard Intel486 CPU. This includes: 
• Binary Compatibility-The 
Low Power Intel486 
CPU is binary compatible with the 8086, 8088, 
80186, 80286, i386TMSX, i386™ OX, Intel486™ 
SX and Intel486TMOX CPUs. 


• Full 32-Bit Integer Processor-The 
Low Power 
Intel486 CPU performs a complete set of arith- 
metic and logical operations on 8-, 16-, and 32-bit 
data types using a full-width ALU and eight gener- 
al-purpose registers. 
• Separate 
32-Bit Address 
and Data Paths- 


Four gigabytes of physical memory can be ad- 
dressed directly. 


• Single-Cycle 
Execution-Many 
instructions ex- 
ecute in a single clock cycle. 


• On-Chip Floating Point Unit-This 
is available 
on the Intel486 OX CPU. The 32-, 64-, and 80-bit 
formats specified in IEEE standard 754 are sup- 
ported. The unit is binary compatible with the 
8087,80287, i387™, i387TMSX, and Intel Over- 
Drive Processor and the Intel486™ CPU. 


• On-Chip Memory Management Unit-Address- 
management 
and 
memory-space 
protection 
mechanisms maintain the integrity of memory. 
This is necessarYin multitasking and virtual-mem- 
ory environments, like those implemented by the 
UNIX and OS/2 operating systems. Both memory 
segmentation and paging are supported. 


• On-Chip Cache with Cache Consistency 
Sup- 
port-The 
internal write-through cache can hold 
8 KBytes of data or instructions. Cache hits are 
as fast as read accesses to a processor register. 
Bus activity is tracked to detect alterations in the 
memory which internal cache represents. The in- 
ternal cache can be invalidated or flushed, so 
that an external cache controller can maintain 
cache consistency in multi-processor environ- 
ments. 
• External Cache Control-Write-back 
and flush 
controls over an external cache are provided so 
that the processor can maintain cache consisten- 
cy in multi-processor environments. 


• Instruction 
Pipellning- 
The fetching, decoding, 
execution and address translation of instructions 
are overlaped within the Low Power Intel486 mi- 
croprocessor. This results in a continuous execu- 
tion rate of one clock cycle per instruction, for 
most instructions. 
• Burst Cycles-Burst 
transfers allow a new dou- 


bleword to be read from memory each clock cy- 
cle. With this capability the internal cache and in- 
struction prefetch buffer can be filled very rapidly. 


• Write 
Buffers-The 
processor contains write 
buffers to enhance the performance of consecu- 
tive writes to memory. The Low Power Intel486 
CPU can continue operations internally after a 
write, without waiting for the write to be executed 
on the external bus. 


• Bus Backoff-If 
another bus master needs con- 
trol of the bus during a Low Powerlntel486 micro- 
processor initiated bus cycle, the Low Power 
Intel486 microprocessor will float its bus signals, 
then restart its cycle when the bus becomes 
available again. 


• Instruction 
Restart-Programs 
can continue ex- 
ecution following an exception generated by an 
unsuccessful attempt to access memory. This 
feature is important for supporting demand-paged 
virtual memory applications. 
• Dynamic Bus Sizing-External 
controllers can 
dynamically alter the effective width of the data 
bus. Bus widths of 8, 16 or 32 bits can be used. 
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The low 
Power Intel486™ 
SX microprocessor 
pinout follows 
the same definition 
as the Intel486™ 
SX micro- 
processor 
given in Section 
1.0, except 
for the pins listed in Table 
16.1. 


i486SX 
Low Power 
Pln# PGA Pin# PQFP 
Microprocessor 
1486Microprocessor 


ClK 
ClK2 
C3 
123 
NC 
ClKSEL 
A3 
127 


S 
R 
Q 
P 
N 
M 
l 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 


1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
A27 
A31 
02 
Vss 
Vee 
Vss 
Vss 
011 
020 
A28 
DO 
Vss 
Vss 
Vss 
OPl 
09 
019 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 
A26 
Vss 
01 
06 
05 
Vee 
Vee 
018 
022 
A25 
A29 
Vee 
Vee 
03 
08 
013 
021 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
A23 
A17 
OPO 
07 
016 
012 
010 
CLK2 
CLKSEL 
Vee 
A30 
04 
014 
OP2 
015 
017 
vss 
4 
0 
0 
0 
0 
0 
0 
4 
NC 
A19 
Vcc 
023 
Vss 
vss 
5 
0 
0 
0 
0 
0 
0 
5 
A14 
A21 
vee 
OP3 
A18 
vss 
6 
0 
0 
0 
0 
0 
0 
6 
vss 
A24 
027 
024 
Vee 
025 
7 
0 
0 
0 
0 
0 
0 
7 
A12 
A22 
026 
Vss 
A15 
vee 
8 
0 
0 
0 
LOW POWER 
0 
0 
0 
8 
vss 
A20 
028 
029 
Vee 
031 
9 
0 
0 
0 
168-PIN 
PGA PINOUT 
0 
0 
0 
9 
vss 
A16 
030 
Vss 
Vee 
Intel486TM 
SX CPU 
vee 
10 
0 
0 
0 
0 
0 
0 
10 
vss 
A13 
TOP SIDE VIEW 
NC 
NC 
Vee 
NC 
11 
0 
0 
0 
0 
0 
0 
11 
vss 
A9 
NC 
vss 
vee 
vee 
12 
0 
0 
0 
0 
0 
0 
12 
vss 
A5 
NC 
NC 
All 
NC 
13 
0 
0 
0 
0 
0 
0 
13 
AIO 
A7 
NC 
NC 
A8 
NC 
14 
0 
0 
0 
0 
0 
0 
14 
vss 
A2 
NC 
NC 
vee 
NC 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
15 
A6 
BREQ 
HLOA LOCK* 
o/c* 
PWT 
BE2* 
NC 
HOLD 
rLUSH* 
NlAl 
A3 
BEO* 
BROY* 
KEN* 
A20lA* 
NC 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
A4 
PLOCK* 
lA/IO* 
vce 
BE1* 
vee 
Vcc 
RESET 
INTR 
BLAST* 
Vee 
Vcc 
Vee 
Vee 
ROY* 
BS8* 
NC 
17 
0 
0 
0 
0 
W?R* 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
AOS* 
PCHK* 
vss 
pco 
vss 
vss 
BS16* 
AHOLO 
NC 
vss 
vss 
vss 
vss 
BE3* 
Borr* 
EADS* 


S 
R 
Q 
P 
N 
M 
l 
K 
J 
H 
G 
F 
E 
0 
C 
B 
A 


240950-87 
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A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
020 
011 
VSS 
VSS 
Vee 
VSS 
02 
A31 
A27 
019 
09 
OPI 
VSS 
VSS 
VSS 
DO 
A28 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 
022 
018 
Vee 
Vee 
05 
06 
01 
VSS 
A26 
021 
013 
08 
03 
Vee 
Vee 
A29 
A25 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
CLKSEL 
CLK2 
010 
012 
016 
07 
OPO 
A17 
A23 
VSS 
017 
015 
OP2 
014 
04 
A30 
vee 
4 
0 
0 
0 
0 
0 
0 
4 
023 
Vee 
A19 
NC 
VSS 
vss 
5 
0 
0 
0 
0 
0 
0 
5 
OP3 
vee 
A21 
AU 
VSS 
Al8 
6 
0 
0 
0 
0 
0 
0 
6 
024 
027 
A24 
VSS 
025 
vee 
7 
0 
0 
0 
0 
0 
0 
7 
vss 
026 
A22 
A12 
Vee 
A15 
8 
0 
0 
0 
LOW POWER 
0 
0 
0 
8 
029 
028 
A20 
VSS 
031 
vee 
9 
0 
0 
0 
168-PIN 
PGA PINOUT 
0 
0 
0 
9 
vss 
030 
A16 
VSS 
Vee 
Intel486TM 
SX CPU 
vee 
10 
0 
0 
0 
0 
0 
0 
10 
NC 
NC 
PIN SIDE VIEW 
A13 
Vss 
NC 
vee 
11 
0 
0 
0 
0 
0 
0 
11 
vss 
NC 
A9 
vss 
vee 
vee 
12 
0 
0 
0 
0 
0 
0 
12 
NC 
NC 
A5 
vss 
NC 
All 
13 
0 
0 
0 
0 
0 
0 
13 
NC 
NC 
A7 
Al0 
NC 
A8 
14 
0 
0 
0 
0 
0 
0 
14 
NC 
NC 
A2 
vss 
NC 
vee 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
15 
NIAI 
FLUSH# 
HOLD 
NC 
8E2# 
PWT 
O/C# 
LOCK# 
8REQ 
A6 
Ne 
A201A# 
KEN# 
8ROY# 
8EO# 
HLOA 
A3 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
INTR 
RESET 
Vee 
Vee 
8El# 
Vee 
IA/IO# 
PLOCK# 
A4 
NC 
8S8# 
ROY# 
Vee 
Vee 
Vee 
Vee 
8LAST# 
17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
W?R# 
0 
PC~K# 
,0 
0 
17 
AHOLO 
8S16# 
vss 
vss 
pco 
vss 
AOS# 
EAOS# 
80FF# 
8[3# 
VSS 
vSS 
VSS 
VSS 
NC 


A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


240950-88 


intel~ 


>8 •• }~ 
~ ~ >~ ; 
"~ 
>8 ; 
>::1 
; 
>8 ; 
,8 •• 
>rJ. 
>YJ. 
~ 
0 
>8 ~ 
: •• ~ . ~ : 
~ 
>~ 
•• 
•• ; •• 
•• 


>VI 
::;; ; 
•• .•• 
. 
•• 
•• ~•• 
•• 
•• 
•• 
; 
~ ~~ ~ 
" . ~~ .~ 
0 ~; ~~ .".~ 
0 ;.; ~; ~~~i! ;. 
- - 
- - - 
- 
- 
- 
- - 
- 
- - 
- - 
- ~ 
- 
'" 
Vcc 
• 
". 
A2 
". 


AM> 
". 
•.••.ST. 


1&! 
PLOCK. 


Low Power 
Intel486TM 
SX CPU 


Plaatlc Quad Flat Package (PQFP) 


(Top View) 


~~~~".~~~:; ~; 
N 
" ;: ~.~: : ~~~~...~~~~; = = ;: = : ~ : 
; = = ;:.~~: 
~. 
~~ 
~~~ 


>Vl S •• 
8" .. ~::os •• 
>8 ~ •• 
>YJ. S •• 
80 
•• •• :a •••• •••• •• ~•• •• •• 
8u 
>'(J, 
~ 
•• •• •••• •• >8" 
" 
"u 
>8 
:;: > :;: 
0 
0 
0 > a 
> 
z 
>VI 
~:II: 


240950-89 


Figure 16·3. Low Power Intel486TM SX CPU 196·Lead 
PQFP Pinout 
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Address 
Data 
Control 
N/C 
Vcc 
Vss 


A2 
014 
Do 
P1 
A20M# 
015 
A10 
B7 
A7 
A3 
R15 
01 
N2 
AOS# 
S17 
A12 
B9 
A9 
A4 
S16 
02 
N1 
AHOLO 
A17 
A13 
B11 
A11 
A5 
012 
03 
H2 
BEO# 
K15 
A14 
C4 
B3 
Ae 
S15 
04 
M3 
BE1# 
J16 
B10 
C5 
B4 
A7 
013 
05 
J2 
BE2# 
J15 
B12 
E2 
B5 
As 
R13 
De 
L2 
BE3# 
F17 
B13 
E16 
E1 
A9 
011 
07 
L3 
BLAST # 
R16 
B14 
G2 
E17 
Al0 
S13 
08 
F2 
BOFF# 
017 
B16 
G16 
G1 
An 
R12 
09 
01 
BROY# 
H15 
C10 
H16 
G17 
A12 
S7 
010 
E3 
BREO 
015 
C11 
J1 
H1 
A13 
010 
On 
C1 
BS8# 
016 
C12 
K2 
H17 
A14 
S5 
012 
G3 
BS16# 
C17 
C13 
K16 
K1 
A15 
R7 
013 
02 
CLK2 
C3 
G15 
L16 
K17 
A16 
09 
014 
K3 
CLKSEL 
A3 
R17 
M2 
L1 
A17 
03 
015 
F3 
O/C# 
M15 
S4 
M16 
L17 
A18 
R5 
016 
J3 
OPO 
N3 
P16 
M1 
A19 
04 
017 
03 
OP1 
F1 
R3 
M17 
A20 
08 
018 
C2 
OP2 
H3 
R6 
P17 
A21 
05 
019 
B1 
OP3 
A5 
R8 
02 
A22 
07 
020 
A1 
EAOS# 
B17 
R9 
R4 
A23 
S3 
021 
B2 
FERR# 
C14 
R10 
S6 
A24 
06 
022 
A2 
FLUSH# 
C15 
R11 
S8 
A25 
R2 
023 
A4 
HLDA 
P15 
R14 
S9 
A26 
S2 
024 
A6 
HOLD 
E15 
S10 
A27 
S1 
025 
B6 
IGNNE# 
A15 
S11 
A28 
R1 
026 
C7 
INTR 
A16 
S12 
A29 
P2 
027 
C6 
KEN# 
F15 
S14 
A30 
P3 
028 
C8 
LOCK# 
N15 
A31 
01 
029 
A8 
M/IO# 
N16 
030 
C9 
NMI 
B15 
031 
B8 
PCO 
J17 
PCHK# 
017 
PWT 
L15 
PLOCK # 
016 
ROY# 
F16 
RESET 
C16 
W/R# 
N17 


Address 
Data 
Control 
N/C 
Vcc 
Vss 


A2 
146 
Do 
17 
A20M# 
104 
15 
6 
1 
A3 
150 
01 
18 
AOS# 
145 
34 
19 
11 
A4 
152 
02 
20 
AHOLO 
129 
52 
24 
21 
A5 
152 
03 
23 
BEO# 
117 
56 
28 
22 
A6 
158 
04 
25 
BE1# 
116 
60 
36 
33 
A7 
159 
05 
26 
BEU 
115 
64 
49 
40 
A8 
161 
06 
27 
BE3# 
113 
68 
54 
50 
A9 
163 
07 
29 
BLAST # 
144 
72 
62 
58 
AlO 
165 
08 
31 
BOFF# 
137 
73 
70 
66 
A11 
172 
09 
32 
BROY# 
138 
75 
84 
86 
A12 
174 
010 
35 
BREQ 
118 
76 
93 
95 
A13 
176 
011 
37 
BS8# 
135 
77 
98 
96 
A14 
178 
012 
38 
BS16# 
136 
78 
107 
99 
A15 
180 
013 
39 
CLK2 
123 
79 
112 
109 
A16 
181 
014 
41 
CLKSEL 
127 
81 
119 
114 
A17 
183 
015 
42 
O/C# 
110 
82 
125 
121 
A18 
189 
016 
44 
OPO 
16 
83 
131 
126 
A19 
191 
017 
.45 
OP1 
30 
85 
147 
141 
A20 
193 
018 
46 
OP2 
43 
87 
164 
148 
A21 
2 
019 
47 
OP3 
57 
88 
170 
167 
A22 
3 
020 
48 
EAOS# 
105 
89 
175 
168 
A23 
4 
021 
51 
FLUSH# 
102 
90 
179 
177 
A24 
5 
022 
53 
HLOA 
122 
91 
184 
182 
A25 
7 
023 
55 
HOLD 
130 
92 
196 
194 
A26 
8 
024 
59 
INTR 
101 
94 
A27 
9 
025 
61 
KEN# 
132 
97 
A28 
10 
026 
63 
LOCK# 
142 
124 
A29 
12 
027 
65 
M/IO# 
111 
134 
A30 
13 
028 
67 
NMI 
100 
140 
A31 
14 
029 
69 
PCO 
106 
149 
030 
71 
PCHK# 
139 
151 
031 
74 
PWT 
108 
153 
PLOCK# 
143 
155 
ROY# 
133 
157 
RESET 
103 
160 
TCK 
128 
162 
TOI 
185 
166 
TOO 
80 
166 
TMS 
187 
169 
UP# 
156 
171 
W/R# 
120 
173 
186 
188 
190 
192 
195 
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All pin descriptions 
for the Low Power Intel486 
SX microprocessor 
follow 
the same definition 
as the Intel486 
SX microprocessor 
with the exception 
of those 
listed in Table 
16.2. 


Symbol 
Type 
Name and Function 


CLK2 
I 
CLK2 provides the fundamental 
timing for the Low Power Intel486 SX microprocessor. 


This is twice the internal frequency 
of the CPU. 


CLKSEL 
I 
CLOCK SELECT 
pin selects the 2X mode required for the Low Power Intel486 SX CPU. 


A well defined 
pulse on this pin establishes 
the phase relationship 
of the 2X clock. With 
the exception 
of a pulse during cold reset. this pin should be driven low at all times and 


must be free of spikes or glitches. 


Table 
16.3 lists all the output 
pins. indicating 
their 


active 
level. and when they are floated. 


Name 
Active 
When Floated 
Level 


BREQ 
HIGH 


HLDA 
HIGH 


BEOill'-BE3il1' 
LOW 
Bus Hold 


PWT, PCD 
HIGH 
Bus Hold 


W/RilI'.D/CilI'. 
HIGHILOW 
Bus Hold 


MilOill' 
LOCK ill' 
LOW 
Bus Hold 


PLOCKill' 
LOW 
Bus Hold 


ADS ill' 
LOW 
Bus Hold 


BLAST ill' 
LOW 
Bus Hold 


PCHKilI' 
LOW 


FERRill'· 
LOW 


A2-A3 
HIGH 
Bus. 
Address 
Hold 


Table 
16.4 lists all input pins, indicating 
their active 


level, 
and whether 
they 
are synchronous 
or asyn- 


chronous 
inputs. 


Name 
Active 
Synchronousl 


Level 
Asynchronous 


CLK2 
CLKSEL 
RESET 
HIGH 
Asynchronous 


HOLD 
HIGH 
Synchronous 


AHOLD 
HIGH 
Synchronous 


EADSilI' 
LOW 
Synchronous 


BOFFilI' 
LOW 
Synchronous 


FLUSH ill' 
LOW 
Asynchronous 


A20MilI' 
LOW 
Asynchronous 


BS16il1'. BS8il1' 
LOW 
Synchronous 


KEN ill' 
LOW 
Synchronous 


ROYill' 
LOW 
Synchronous 


BRDYilI' 
LOW 
Synchronous 


INTR 
HIGH 
Asynchronous 


NMI 
HIGH 
Asynchronous 


IGNNEilI'·(1) 
LOW 
Asynchronous 


UPilI'(2) 
LOW 
Asynchronous 


NOTES: 
1. The IGNNE # pin is present 
on the Intel486 
OX CPU and 


Intel487 
SX MCP only. 


2. The 
UP# 
pin is present 
on the Intel486 
SX CPU PQFP 


package 
only. 


Table 16.5 lists all the input/output pins, indicating 
their active level and when they are floated. 


Name 
Active 
When Floated 
Level 


00-031 
HIGH 
Bus Hold 
OPO-OP3 
HIGH 
Bus Hold 
M-A31 
HIGH 
Bus, Address Hold 


Name 
Input or 
Sampled/ 
Output 
Driven On 


TCK 
Input 
N/A 


TOI 
Input 
Rising Edge of TCK 


TOO 
Output 
Falling Edge of TCK 


TMS 
Input 
RisingEdge of TCK 


1486 SX Microprocessor 
Component 
Revision 
Stepping 
Name 
10 
10 


AO 
04 
20 
BO 
04 
22 


NOTE: 
Table 16.7 shows the Component ID number and Revision 
ID number for the A-O stepping of the Intel486 SX Micro- 
procesor and Intel OverDrive Processor. When an Intel 
OverDrive Processor is installed in the system, the Compo- 
nent ID and Revision ID is provided by the Intel OverDrive 
Processor and not the Intel486 SX Microprocessor. The 
Component ID and Revision ID read by the BIOS/software 
may change when a Performance Upgrade Component, 
such as the Intel OverDrive Processor, is installed in an 
Intel486 SX Microprocessor based system. 


16.6 
Signal Description 


With the exception of CLK2 and CLKSEL, all signals 
follow the same definition as the Intel486 microproc- 
essor. The A.C. timing parameters for all of these 
signals are given in Tables 16.11-16.13. 


CLK2 provides the fundamental timing for the Low 
Power Intel486 microprocessor. It is divided by two 
internally to generate the internal processor clock 
used for instruction execution. The internal clock is 
comprised of two phases, "phase one" and "phase 
two". Each CLK2 period is a phase of the internal 
clock. Figure 15.4 illustrates the relationship. If de- 
sired, the phase of the internal processor clock can 
be synchronized to a known phase by ensuring the 
pulse on the CLKSEL pin meets the applicable tim- 
ings during cold boot (power-up reset). 


All set-up, hold, float-delay and valid delay timings 
are referenced to the phase one of the clock. 


The internal processor clock (CLK) is similar to the • 
clock .signal of the standard Intel486 microproces- 
sor. All I/O signals get sampled on the rising edge of 
this signal, Le. the rising edge of phase one. Thus it 
is important to synchronize the external circuitry with 
the phase one of CLK2. 


Clock Select pin selects the 2X mode required for 
the Low Power Intel486 CPU. This pin should be 
driven low after power-up and during the entire oper- 
ation of the CPU. However, a well defined pulse is 
required on CLKSEL pin during cold boot (power-up 
reset) to establish the phase relationship of the 2X 
clock. The reset pulse width during cold reset should 
be at least 1 ms. As shown in Figure 16.5, the pulse 
on CLKSEL should be asserted by the end of reset 
(approximately 0.9 ms after driving reset active) and 
at least 30 CLK2 periods before the falling edge of 
reset. 


Figure 16.6 shows the detailed timing definition of 
this pulse. The pulse on CLKSEL pin is only required 
during power-up reset. During all other times includ- 
ing warm resets the CLKSEL pin should be driven 
low and must be free of spikes or glitches. After the 
power-up reset, the system must track the phase of 
CLK2 at all times including during warm resets so 
that the input/output signals can be sampled at the 
appropriate clock edge. The phase relationship is 
described in the next section. 


ClK2 
PERIOD 


<1>2 


ClK2 
PERIOD 


<1>1 


INTERNAL 
InteI486Tt.lCPU 
ClK 


(half 
the 
freq. 


of ClK2) 
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The Low Power Intel486 SX microprocessor is archi- 
tecturally similar to the Intel486 SX CPU. Thus all 
bus cycles follow the same definition. The difference 
lies in the fact that the Low Power Intel486 SX CPU 
works with an external 2X clock input (CLK2). As 
shown in Figure 16.4, each of the internal processor 
clock (CLK) cycles is two CLK2 cycles wide. Thus a 
25 MHz Low Power Intel486 microprocessor needs 
a 50 MHz clock input. 


CLK2 provides the fundamental timing for the Low 
Power Intel486 SX CPU, It is divided by two internal- 
ly to generate the internal processor clock (CLK) 
used for instruction execution. The internal clock is 
comprised of two phases, "phase one" and "phase 
two". Each CLK2 period is a phase of the internal 
clock. All Low Power Intel486 SX microprocessor in- 
puts are sampled at the rising edge of phase 1. Each 
bus cycle is comprised of at least two bus states, T1 
and T2. Each bus state in turn consists of two CLK2 
cycles phase 1 and phase 2 of the bus state. The 
bus state diagram in Section 7.2.13 is valid for the 
Low Power Intel486 SX microprocessor. 


NOTE: 
The timing diagrams given for the Intel486 SX can 
be used for the Low Power Intel486 SX microproc- 
essor. Read "CLK" signal as the internal clock of 
the CPU, with "CLK2" (the input clock of the Low 
Power Intel486 CPU) being twice the frequency of 
the internal processor clock as shown in Figure 
16.4. 


The following describes how the input signals are 
sampled and output signals are referenced with re- 
spect to the input clock (CLK2): 


The Low Power Intel486 SX CPU samples all its 
synchronous 
input signals (Le., RDY#, BRDY#, 


BS8#, 
BS16#, 
KEN#, 
EADS#, 
BOFF#, 
HOLD 
and AHOLD) at the rising edge of phase 1, as long 
as proper setup and hold times relative to that clock 
edge are met. 


The Low Power Intel486 SX CPU samples all its 
asynchronous 
input signals (Le., RESET, INTR, 
NMI, A20M# FLUSH#, IGNNE#) at every other ris- 
ing edge of the system clock (Phase 1), as long as 
proper setup and hold times relative to that clock 
edge are met. 


The A.C. timing specifications for output signals (Le., 
valid and float delay timings) are specified with re- 
spect to the rising edge of the Phase 1 of the system 
clock. This holds true for all output signals including 
ADS# and PCHK#. 
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The Low Power Intel486 SX microprocessor allows 
the 
CPU frequency to 
change dynamically. As 
shown in Figures 16.7 and 16.8, the relationship be- 
tween frequency and power consumption is approxi- 
mately linear. Thus lowering the CPU frequency, re- 
duces the power supply current (led consumed by 
the CPU. 


The following must be satisified to change the CPU 
frequency: 
1. Frequency can be changed at least 8 clocks after 


satisfying t4 (see Figure 16.6). The system can be 
started at a lower frequency and after satisfying 
the CLKSEL pulse specifications, it can be oper- 
ated at the required speed. 
2. The change in frequency should satisfy the mini- 


mum specification 
of 
"CLK2 
high time" 
and 


"CLK2 low time". That is, at no time should the 
clock period go below the specified clock high 
and clock low times (see A.C. specifications for 
exact values). 
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Table 16.6 provides the absolute maximum ratings. It is a stress rating only and functional operation at the 
maximums is not guaranteed. Functional operating conditions are given in Section 16.9.1 D.C. Specifications 
and 16.9.3 A.C. Specifications. 


Case Temperature under Bias 
-65°C to + 110°C 


Storage Temperature 
- 65°C to + 150°C 


Voltage on Any Pinwith Respect to Ground 
-O.5V to (Vee + O.5V) 


Supply Voltage with Respect to VSS 
-O.5Vto 
+6.5V 


Table 
16.7 provides 
the 
D.C. operating 
conditions 
for the 
low 
Power 
Intel486 
SX microprocessor 
(PGA 
Version) 
and the Intel487 
SX Math CoProcessor 
installed 
in a low power 
system. 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input low 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
VCC + 0.3 
V 


VOL 
Output 
low 
Voltage 
0.45 
V 
(Note 1) 


VOH 
Output High Voltage 
2.4 
V 
(Note 2) 


Icc 
Power Supply Current 
ClK2 
= 32 MHz 
525 
mA 
(Note 3) 
= 40 MHz 
600 
= 50 MHz 
700 


ICCF 
Power Supply Current with Intel486 SX 
CPU Tn-stated 
(floating) 
ClK2 
= 32 MHz 
400 
mA 
= 40 MHz 
500 
= 50 MHz 
600 


III 
Input leakage 
Current 
±15 
/LA 
(Note 4) 


IIH 
Input leakage 
Current 
200 
/LA 
(Note 5) 


IlL 
Input leakage 
Current 
-400 
/LA 
(Note 6) 


ILa 
Output leakage 
Current 
±15 
/LA 


CIN 
Input Capacitance 
20 
pF 
FC = 1 MHz(7) 


Co 
I/O or Output Capacitance 
20 
pF 
FC = 1 MHz(7) 


CCLK 
ClK 
Capacitance 
20 
pF 
Fc = 1 MHz(7) 


NOTES: 
1. This parameter is measured at: 


Address, Data BEn 4.0 mA 
Definition, Control 5.0 mA 
2. This parameter is measured at: 


Address, Data BEn -1.0 
mA 
Definition, Control -0.9 
mA 
3. Typical supply current 
Ice 
= 400 @CLK2 = 32 MHz 
(Normal Operation) 
= 475 mA @CLK2 = 40 MHz 
= 500 mA @CLK2 = 50 MHz 
ICCF = 325 mA @CLK2 = 32 MHz 
Intel486 CPU Tri-stated (Floating) 
= 400 mA @CLK2 = 40 MHz 
= 470 mA @CLK2 = 50 MHz 
4. This parameter is for inputs without pullups or pulldowns and 0 ,;; VIN ,;; Vcc. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and VIL = 0.45V. 
7. Not 100% tested. 
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Table 
16.8 provides 
the 
D.C. Operating 
Conditions 
for the 
Low Power 
Intel486 
SX microprocessor 
(PQFP 
version) 
and the Intel487 
SX Math CoProcessor 
installed 
in a low power system. 


Table 16.8.Low Power Intel486TMSX Microprocessor D.C.Parametric Values (PQFPversion) 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input Low Voltage 
- 0.3 
V 


VIH 
Input High Voltage 
V 


VOL 
Output Low Voltage 
V 


VOH 
Output High Voltage 
V 


lee 
Power Supply Current 
CLK2 = 32 MHz 
CLK2 = 40 MHz 
CLK2 = 50 MHz 


(Note 1) 


(Note 2) 


(Note 3) 


III 
Input Leakage Current 


IIH 
Input Leakage Current 


IlL 
Input Leakage C 


ILO 
Output Leaka 


CIN 
Input Capacitance 


Co 
I/O or Output Capacitance 


CeLK 
CLK Capacitance 


mA 


mA 
(Note 7) 


±15 
IJ-A 
(Note 4) 


200 
IJ-A 
(Note 5) 


-400 
IJ-A 
(Note 6) 


±15 
IJ-A 


10 
pF 
Fe = 1 MHz(?) 


10 
pF 
Fe = 1 MHz(?) 


6 
pF 
Fe = 1 MHz(?) 


NOTES: 
1. This parameter is measured at: 
Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 
Address, Data BEn 
- 1.0 mA 
Definition, Control 
- 0.9 mA 
3. Typical supply current: 


Ice 380 mA @ CLK2 = 32 MHz (Normal Operation) 
440 mA @ CLK2 = 40 MHz 
480 mA @ CLK2 = 50 MHz 
4. This parameter is for inputs without pullups or pulldowns and 0 ,;; VIN ,;; Vee. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and VIL = 0.45V. 
? Not 100% tested. 
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Following is the power consumption of the Low Power Intel486 microprocessor or Intel487 SX Math CoProc- 
essor installed in a low power system for different frequencies. 


CLK2 Frequency 
Operating Frequency 
ICC(max) 
ICC(typ) 
(MHz) 
(MHz) 
(mA) 
(mA) 


4 
2 
150 
100 


16 
8 
325 
235 


32 
16 
525 
400 


40 
20 
600 
475 


50 
25 
700 
550 


Table 16.10. Power Supply Current (Icc) Values 
over Frequencies of Operation (PQFP Version) 


CLK2 Frequency 
Operating Frequency 
Icc (max) 
Icc (typ) 
(MHz) 
(MHz) 
(mA) 
(mA) 
4 
I 
2 
150 
100 


16 
8 
250 
210 


32 
16 
450 
380 


40 
20 
500 
440 


50 
25 
560 
480 


The following tables provide the A.C. specifications for the Low Power Intel486 SX microprocessors. They 
consist of output delays, input setup requirements and input hold requirements. All A.C. specifications are 
relative to the rising edge of the phase 1 of the input system clock (CLK2), unless otherwise specified. 


Table 16.11. Low Power Intel486™ SX-16 
MHz 
Microprocessorllntel487 
SX Math CoProcessor A.C. Characteristics 
Vcc = 5V ± 10%; Tease= O°Cto + 85°C; CL = 50 pF(2)unless otherwise specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
16 
MHz 
Half of CLK2 Frequency 


t1 
CLK2 Period 
31 
250 
ns 
16.9 


t2 
CLK2 High Time 
10 
ns 
16.9 
At2V 


t3 
CLK2 Low Time 
10 
ns 
16.9 
AtO.8V 


t4 
CLK2 Fall Time 
4 
ns 
16.9 
2V to 0.8V 


t5 
CLK2 Rise Time 
4 
ns 
16.9 
0.8Vt02V 


ts 
A2-A31, PWT, PCD, BEO-3#, 
3 
26 
ns 
16.10 
M/IO#, 
D/C#, 
W/R#, ADS#, LOCK#, 
FERR#o, BREQ, HLDA Valid Delay 


t7 
A2-A31, PWT, PCD, BEO-3#, 
42 
ns 
16.10 
After Clock Edge(1) 
M/IO#, 
D/C#, 
W/R#, ADS#, 
LOCK# Float Delay 
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Microprocessor/intel487™ 
SX Math CoProcessor 
A.C. Characteristics 
(Continued) 
Vcc 
= 5V ±10%; 
Tease = O·C to + 85·C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


ta 
PCHK# 
Valid Delay 
3 
35 
ns 
16.10 


taa 
BLAST #, PLOCK # Valid Delay 
3 
35 
ns 
16.10 


t9 
BLAST #, PLOCK # Float Delay 
42 
ns 
16.10 
After Clock Edgell) 


tl0 
00-031, 
OPO-3 Write Data Valid Delay 
3 
30 
ns 
16.10 


tll 
00-031, 
OPO-3 Write Data Float Delay 
42 
ns 
16.10 
After Clock Edgell) 


t12 
EAOS # Setup Time 
12 
ns 
16.11 


t13 
EAOS# 
Hold Time 
4 
ns 
16.11 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
12 
ns 
16.11 


t15 
KEN"#, BS16#, 
BS8# 
Hold Time 
4 
ns 
16.11 


t16 
ROY #, BROY # Setup Time 
12 
ns 
16.11 


t17 
ROY #, BROY # Hold Time 
4 
ns 
16.11 


t18 
HOLD, AHOLO, BOFF # Setup Time 
12 
ns 
16.11 


t19 
HOLD, AHOLO, BOFF # Hold Time 
4 
ns 
16.11 


t20 
RESET, FLUSH#, 
A20M#, 
NMI, 
14 
ns 
16.11 
INTR, IGNNE # * Setup Time 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, 
4 
ns 
16.11 
INTR, IGNNE#* 
Hold Time 


t22 
00-031, 
OPO-3, A4-A31 
10 
ns 
16.11 
Read Setup Time 


t23 
00-031, 
DPO-3, 
A4-A31 
4 
ns 
16.11 
Read Hold Time 


CLKSEL 
See Figures 16.5 and 16.6 for details on this signal. 
Figure 16.6 shows minimum timings 
required for the 
proper operation 
of the CPU. The pulse on CLKSEL 
can be of any length as long as the minimums 
are 
satisfied 
and the transitions 
from low to high occurs 
at the clock edge shown. 


NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. 


Table 16.12. Low Power Intel486TMSX-20 
MHz 
Microprocessorlintel487™ SX Math CoProcessor A.C. Characteristics 
VCC = 5V ± 10 %; Tcase = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
20 
MHz 
Half of CLK2 Frequency 


t1 
CLK2 Period 
25 
250 
ns 
16.9 


t2 
CLK2 High Time 
8.5 
ns 
16.9 
At2V 


t3 
CLK2 Low Time 
8.5 
ns 
16.9 
AtO.8V 
4 
CLK2 Fall Time 
3 
ns 
16.9 
2V to 0.8V 


ts 
CLK2 Rise Time 
3 
ns 
16.9 
0.8V to 2V . 


ttl 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
23 
ns 
16.10 
MIlOI', 
D/C#, WIR#, 
ADS#, 
LOCK#, 
FERR#', 
BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
37 
ns 
16.10 
After Clock Edge(1) 
MIlOI', 
D/C#, WIR#, 
ADS#, 
LOCK # Float Delay 


ta 
PCHK# 
Valid Delay 
3 
28 
ns 
16.10 


taa 
BLAST #, PLOCK # Valid Delay 
3 
28 
ns 
16.10 


t9 
BLAST # , PLOCKI' 
Float Delay 
37 
ns 
16.10 
After Clock Edge(1) 


t10 
00-031, 
DPO-3 Write Data Valid Delay 
3 
26 
ns 
16.10 


t11 
00-031, 
DPO-3 Write Data Float Delay 
37 
ns 
16.10 
After Clock Edge(1) 


t12 
EADS # Setup Time 
10 
ns 
16.11 


t13 
EADS# 
Hold Time 
4 
ns 
16.11 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
10 
ns 
16.11 


t1S 
KEN#, 
BS16#, 
BS8# 
Hold Time 
4 
ns 
16.11 


t16 
ROY #, BRDY # Setup Time 
10 
ns 
16.11 


t17 
ROY #, BRDY # Hold Time 
4 
ns 
16.11 


t1a 
HOLD, AHOLD, 
BOFF # Setup Time 
12 
ns 
16.11 


t19 
HOLD, AHOLD, 
BOFF # Hold Time 
4 
ns 
16.11 


t20 
RESET, FLUSH #, A20M #, NMI, 
12 
ns 
16.11 
INTR, IGNNE#' 
Setup Time 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, 
4 
ns 
16.11 
INTR,IGNNE#' 
Hold Time 


t22 
00-031, 
DPO-3, 
A4-A31 
6 
ns 
16.11 
Read Setup Time 


t23 
00-031, 
DPO-3, 
A4-A31 
4 
ns 
16.11 
Read Hold Time 


NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = SOpF. 


• 


Table 16.13. Low Power Intel486™ 
SX-25 
MHz 
Microprocessorllntel487™ 
SX Math CoProcessor 
A.C. Characteristics 


Vcc = SV ± 10%; Tcase = O·C to +8S·C; 
CL = SO pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
2S 
MHz 
Half of CLK2 Frequency 


t1 
CLK2 Period 
20 
2S0 
ns 
16.9 


t2 
CLK2 High Time 
7 
ns 
16.9 
At2V 


t3 
CLK2 Low Time 
7 
ns 
16.9 
AtO.8V 


t4 
CLK2 Fall Time 
2 
ns 
16.9 
2V to 0.8V 


t5 
CLK2 Rise Time 
2 
ns 
16.9 
0.8Vt02V 


ts 
A2-A31,PVVT,PCD,BEO-3#, 
3 
19 
ns 
16.10 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
FERR#', 
BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
28 
ns 
16.10 
After Clock Edge(1) 


M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK # Float Delay 


ta 
PCHK# 
Valid Delay 
3 
24 
ns 
16.10 


taa 
BLAST # , PLOCK# 
Valid Delay 
3 
24 
ns 
16.10 


t9 
BLAST #, PLOCK # Float Delay 
28 
ns 
16.10 
After Clock Edge(1) 


t10 
00-031, 
DPO-3 Write 
3 
20 
ns 
16.10 
Data Valid Delay 


t11 
00-031, 
DPO-3 Write 
28 
ns 
16.10 
After Clock Edge(1) 


Data Float Delay 


t12 
EADS # Setup Time 
9 
ns 
16.11 


t13 
EADS # Hold Time 
4 
ns 
16.11 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
9 
ns 
16.11 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
4 
ns 
16.11 


t16 
RDY#, 
BRDY# 
Setup Time 
9 
ns 
16.11 


t17 
RDY#, 
BRDY# 
Hold Time 
4 
ns 
16.11 


t1a 
HOLD, AHOLD, 
BOFF # Setup Time 
11 
ns 
16.11 


t19 
HOLD, AHOLD, 
BOFF # Kold Time 
4 
ns 
16.11 


t20 
RESET, FLUSH#, 
A20M#, 
NMI, 
11 
ns 
16.11 
INTR,IGNNE#' 
Setup Time 
, 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, 
4 
ns 
16.11 
INTR, IGNNE #. 
Hold Time 


t22 
00-031, 
DPO-3, 
A4-A31 
6 
ns 
16.11 
Read Setup Time 


NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. 


Table 
16.13. Low Power 
Intel486TM 
SX-25 
MHz 
Microprocessorlintel487™ 
SX Math CoProcessor 
A.C. Characteristics 
(Continued) 


Vcc = SV ± 10%; Tcase = O·C to + 8S·C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


t23 
00-031, 
OPO-3, A4-A31 
4 
ns 
2.3 
Read Hold Time 


CLKSEL 
See Figures 16.5 and 16.6 for details on this signal. 
Figure 16.6 shows minimum timings required for the 
proper operation 
of the CPU. The pulse on CLKSEL 
can be of any length as long as the minimums 
are 
satisfied 
and the transitions 
from low to high occurs at 


the clock edge shown. 


NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. 
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16.10 
Math Upgrade for Low Power 
Intel486™ 
SX Microprocessor 


The Intel487 SX Math CoProcessor (MCP) is the 
math upgrade for the Low Power Intel486 SX micro- 
processor. The Intel487 SX MCP is designed and 
tested to operate with an external 1X clock input 
(standard mode) when it is installed in an Intel486 
SX CPU system or with an external 2X clock input 
(low power mode) when it is installed in a Low Power 
Intel486 SX CPU based system. The Intel487 SX 
MCP is a super-set of the Intel486 SX CPU, contain- 
ing a floating point unit, in addition to all other on- 
chip units present in the Intel486 SX microproces- 
sor. 


The Floating Point Unit (FPU) performs floating point 
operations on the 32-, 64-, and 80-bit arithmetic for- 
mats specified in IEEE Standard 754. Like the inte- 
ger processing unit, the floating point unit architec- 
ture is binary-compatible with the 8087 and 80287 
Math CoProcessors. The architecture is 100% com- 
patible with the Intel287 and Intel387 Math CoProc- 
essors. 


The on-chip floating point unit of the Intel487 SX 
Math CoProcessor is similar to the FPU of the 


Intel486 OX CPU containing eight data registers, a 
tag word, a control register, a status register, an in- 
struction pointer and a data pointer. (For details on 
FP registers and instructions, see Section 2.1.3). 


Note that the Intel487 SX Math Coprocessor is the 
only upgrade available for the Low Power Intel486 
SX microprocessor based designs. It is fully compat- 
ible with the Low Power mode of the Intel486 SX 
CPU. However, the Intel OverDrive Processor does 
not work in systems based on the Low Power In- 
tel486 CPU. All address, data and control signals, 
including the external CPU clock input (CLK2) and 
the CLKSEL signal of the Low Power Intel486 SX 
CPU, must be tied to the corresponding signals of 
the Intel487 SX MCP (Le. the Intel486 SX CPU's 
CLK2 signal must be connected to the Intel487 SX 
MCP's CLK2 signal, and the Intel486 SX CPU's 
CLKSEL signal must be connected to the CLKSEL 
signal on the Intel487 SX MCP). Additionally, the 
performance upgrade circuit given in Section 6.6 
should be followed to provide the math upgrade ca- 
pability in Low Power Intel486 SX CPU systems. 


The D.C.!A.C. specifications given in Section 16.9 
apply to the Low Power Intel486 SX CPU and the 
Intel487 SX MCP when it is installed in a Low Power 
Intel486 SX CPU system. 
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E 
D 
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0 
0 
0 
0 
0 
1 
"27 
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0 
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NC 
FERR" 
A8 
NC 
14 
0 
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0 
0 
0 
0 
14 
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A2 
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NC 
Vcc 
MP" 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
/0 
0 
0 
0 
0 
0 
15 
A6 
BREa 
LOCK" 
PWT 
BE2" 
NC 
HOLD 
FLUSH- 
IGNNE" 
A3 
HLOA 
o/c" 
BEO" 
BROY" 
KEN" 
A20M" 
NMI 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
A4 
NC 
MilO" 
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RESET 
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BLAST- 
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17 
0 
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A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
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1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 


020 
011 
VSS 
VSS 
Vee 
Vss 
02 
A31 
A27 


019 
09 
OPl 
Vss 
Vss 
Vss 
DO 
A28 
2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 


022 
018 
Vee 
Vee 
05 
06 
01 
Vss 
A26 


021 
013 
08 
03 
Vee 
Vee 
A29 
A25 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 


CLKSEL 
eLK2 
010 
012 
016 
07 
OPO 
All 
A23 


Vss 
017 
015 
OP2 
014 
04 
A30 
vee 
4 
0 
0 
0 
0 
0 
0 
0 
4 


023 
Vee 
KEV 
A19 
NC 
Vss 
vss 
5 
0 
0 
0 
0 
0 
0 
5 


OP3 
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A21 
A14 


vss 
A18 


6 
0 
0 
0 
0 
0 
0 
6 


024 
027 
A24 
Vss 
025 
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7 
0 
0 
0 
0 
0 
0 
7 
vss 
026 
A22 
A12 


Vee 
A15 
8 
0 
0 
0 
0 
0 
0 
8 


029 
028 
A20 
Vss 
031 
Intel487™ 
SX Mep 


vee 


9 
0 
0 
0 
0 
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0 
9 
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A16 
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10 
0 
0 
0 
0 
0 
0 
10 
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0 
11 
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vee 


12 
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0 
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12 
NC 
NC 
A5 
vss 
NC 
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13 
0 
0 
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0 
0 
0 
13 


FERRI' 
NC 
A7 
Al0 


NC 
A8 
14 
0 
0 
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0 
0 
0 
14 


NC 
NC 
A2 
vss 
t.4P- 
Vcc 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
15 
IGNNE- 
FLUSH- 
HOLD 
NC 
BE2- 
PWT 


O/C- 
LOCK- 
BREQ 
A6 


Nt.41 
A20t.4- 
KEN- 
BROV- 
BED- 
HLOA 
A3 
16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
INTR 
RESET 
Vee 
Vee 
BEll' 
Vee 
t.4/10- 
PLOCKI' 
A4 


NC 
BS8- 
ROV- 
Vee 
Vee 
Vee 
Vee 
BLAST- 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
w1R- 
0 
0 
0 
0 
17 


AHOLO 
BSI6- 
vss 
vss 
pco 
vss 
PCHK- 
AOS# 


EAOS- 
BOFF- 
BE3- 
vss 
vss 
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vss 
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16.10.2 
PIN REFERENCE 
OF Intel487™ 
SX Math CoProcessor 


Table 
16.14. Pin Cross Reference 
by Pin Name 


Address 
Data 
Control 
N/C 
Vcc 
Vss 


A2 
014 
00 
P1 
A20M# 
015 
A10 
87 
A7 
A3 
R15 
01 
N2 
A05# 
517 
A12 
89 
A9 
A4 
516 
02 
N1 
AHOLO 
A17 
A14 
811 
A11 
A5 
012 
03 
H2 
8EO# 
K15 
810 
C4 
83 
~ 
515 
04 
M3 
8E1# 
J16 
812 
C5 
84 
A7 
013 
05 
J2 
8E2# 
J15 
813 
E2 
85 
A8 
R13 
06 
L2 
8E3# 
F17 
816 
E16 
E1 
A9 
011 
07 
L3 
8LA5T# 
R16 
C10 
G2 
E17 
A10 
513 
08 
F2 
80FF# 
017 
C11 
G16 
G1 
A11 
R12 
09 
01 
8ROY# 
H15 
C12 
H16 
G17 
A12 
57 
010 
E3 
8REO 
015 
C13 
J1 
H1 
A13 
010 
011 
C1 
85B# 
016 
C14 
K2 
H17 
A14 
55 
012 
G3 
8516# 
C17 
G15 
K16 
K1 
A15 
R7 
013 
02 
CLK2 
C3 
R17 
L16 
K17 
A16 
09 
014 
K3 
CLK5EL 
A3 
54 
M2 
L1 
A17 
03 
015 
F3 
O/C# 
M15 
04 
M16 
L17 
A18 
R5 
016 
J3 
OPO 
N3 
P16 
M1 


A19 
04 
017 
03 
OP1 
F1 
. 
R3 
M17 
A20 
OB 
018 
C2 
OP2 
H3 
R6 
P17 
A21 
05 
019 
81 
OP3 
A5 
RB 
02 
A22 
07 
020 
A1 
EA05# 
817 
R9 
R4 
A23 
53 
021 
82 
FERR# 
A13 
R10 
56 
A24 
06 
022 
A2 
FLU5H# 
C15 
R11 
5B 
A25 
R2 
023 
A4 
HLOA 
P15 
R14 
59 
A26 
52 
024 
A6 
HOLO 
E15 
510 
A27 
51 
025 
86 
IGNNE# 
A15 
511 


A28 
R1 
026 
C7 
INTR 
A16 
512 
A29 
P2 
027 
C6 
KEN# 
F15 
514 
A30 
P3 
028 
CB 
LOCK# 
N15 
A31 
01 
029 
AB 
M/IO# 
N16 
030 
C9 
MP# 
814 
031 
8B 
NMI 
815 
PCO 
J17 
PCHK# 
017 
PWT 
L15 
PLOCK# 
016 
ROY# 
F16 
RE5ET 
C16 
W/R# 
N17 


• 


The Intel487 
SX Math CoProcessor 
consists 
of all the Intel486 
SX microprocessor 
signals with the following 
additional 
pins. 


NUMERIC 
ERROR REPORTING 


FERR# 
0 
The Floating 
point error pin is driven active when a floating 
point error occurs. 
FERR # is 
similar to the ERROR # pin on the i387 Math CoProcessor. 
FERR # is included 
for 


compatibility 
with systems 
using DOS type floating 
point error reporting. 
FERR # is active 
LOW, and is not floated 
CONT1 g bus hold. 


IGNNE# 
I 
When the Ignore numeric error pin is asserted, 
the Low Power Intel487 SX Math 


CoProcessor 
will ignore a numeric error and continue 
executing 
non-control 
floating 
point 
instructions. 
When IGNNE# 
is deasserted 
the Low Power Intel487 SX Math CoProcessor 
will freeze on a non-control 
floating 
point instruction, 
if a previous floating 
point instruction 
caused an error. IGNNE # has no effect when the NE bit in control 
register 0 is set. IGNNE # 


is active LOW and is provided with a small internal pullup resistor. 
IGNNE # is asynchronous 
but setup and hold times t20 and t21 must be met to insure recognition 
on any specific 
clock. 


Intel487™ 
SX Math CoProcessor 
INTERFACE 


MP# 
0 
The math present 
pin is used to signal the Intel486 SX microprocessor 
to float its outputs 
and get-off the bus. This pin can be used to check the presence 
of the Math CoProcessor 
in 
the two socket math upgrade circuit. It is active low and is never floated. 
MP# 
is driven low 
at power-up 
and remains active for the entire duration 
of the Intel487™ 
SX Math 


CoProcessor 
operation. 


KEY PIN 


KEY 
The KEY pin is an electrically 
non-functional 
pin which is used to insure the correct 
Intel487 
SX Math CoProcessor 
orientation 
in a 169-pin socket. 
KEY pin is located at "04" 
and is the 
169th pin of the Intel487 SX MCP. 


17.0 
SUGGESTED 
SOURCES 
FOR 
Intel486™ 
SX 
MICROPROCESSOR/Intel 
OverDrive™ 
PROCESSOR 
ACCESSORIES 


Following 
are some suggested 
sources 
of accesso- 
ries for the Intel486 
SX microprocessorllntel 
Over- 
Drive 
Processor. 
They 
are not an endorsement 
of 
any kind, nor Ii warranty 
of the performance 
of any 
of the listed products 
and/or 
companies. 


Sockets 


1. McKenzie 
Technology 
44370 
Old Palmspring 
Blvd. 
Fremont, 
CA 
94538 
Tel: (415) 651-2700 


2. 
E-CAM Technology, 
Inc. 
14455 North Hayden 
Rd. 


Suite 208 
Scottsdale, 
AZ 
85260 
Tel: (602) 443-1949 


3. Augat 
Inc. (for sockets 
with decaps) 
Interconnection 
Products 
Group 
33 Perry Ave. 
P.O. Box 779 
Attleboro, 
MA 
02703 
Tel: (508) 222-2202 


OverDrive Processor Socket for the Intel 
OverDrive Processor 


1. Amp Incorporated 
P.O. Box 3608 
Harrisburg, 
PA 17105-3608 
Tel: (800) 522-6752 


2. Thomas 
& Betts 
200 Executive 
Center 
Dr. 
P.O. Box 24901 
Greenville, 
SC 29616-2401 
Tel: (201) 685-1600 


3. Yamaichi 
Electronics 
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Inc. 


2021 West Valley View Lane 
Dallas, TX 
75381-0839 
Tel: (214) 243-4321 


2. E G & G Division 
60 Audubon 
Road 
Wakefield, 
MA 
01880 
Tel: (617) 245-5900 
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• 


Revision -004 of the Intel486 SX Microprocessor 
Data Book contains many updates and improve- 
ments to the original version. A revision summary of 
major changes is listed below: 


The sections significantly revised since version -001 
are: 
Cover Page 


Table 1.1c & 
Table 1.1d 
Quick Pin 
Reference 


Component & 
Revision 10 


Added 
Intel486 
SX 
CPU 
PQFP 
package information. 


Corrected the Intel486 SX CPU pin- 
out. Pin 06 at location F2 has been 
changed to pin 08. 
Corrected the Intel487 SX MCP pin- 
out. Pin 06 at location F2 has been 
changed to pin 08. 
Corrected the Intel486 SX CPU pin- 
out. Pin 06 at location F2 has been 
changed to pin 08. 
Corrected the Intel487 SX MCP pin- 
out. Pin 06 at location F2 has been 
changed to pin 08. 
Added Intel486 SX CPU PQFP pin- 
out. 


Added 
Intel486 
SX 
CPU 
PQFP 
package Pin Cross Reference table. 
The PWT and PCD functional de- 
scription has been clarified. New 
signals for only the Intel486 SX CPU 
PQFP package have been added. 
The MP'" 
pin has been added to 
the input pin list for the Intel487 SX 
MCP. 
The UP", pin has been added to the 
input pin list for the Intel486 SX CPU 
PQFP package. 
Added a new pin table for the In- 
tel486 SX CPU PQFP package test 
pins. 
Added note about Component 10 
and Revision 10 numbers for the In- 
tel486 SX CPU and Intel487 SX 
MCP. 
. 


Section 2.0 
A description of the features of the 
Intel486 SX CPU in the PQFP pack- 
age has been added. 
Section 6.2.9 
Added description of HOLD recogni- 
tion during BOFF"'. 
Section 6.2.12 Added PCD and PWT description 
when paging disabled. 
Section 6.2.15 Corrected explanation of A20M'" 
sampling during reset. 


intel~ 


Section 6.2.16 
Added section on Performance 
Upgrade support signal descrip- 
tion for 
the 
Intel486 SX CPU 
PQFP package. 


Section 6.2.17 
Added section on Boundary Scan 
Test Signals for the Intel486 SX 
CPU PQFP package. 


Figure 6.4 
Added additional details on signal 
sampling during RESET. 
Figure 6.5.1 
Added details about FERR'" sig- 
nal. 


Figure 6.6 
Corrected Intel487 SX MCP sig- 
nal name on the Performance Up- 
grade circuit. The HOLD signal 
from the Intel487 SX MCP should 
be connected to the HOLD signal 
from the Intel486 SX CPU PGA 
package; the DATA signal from 
the Intel487 SX MCP should NOT 
be connected to the HOLD signal 
from the Intel486 SX CPU PGA 
package. 
Figure 6.7 
Added a Performance Upgrade 
circuit for the Intel486 SX CPU in 
the PQFP package. 
Figure 7.30 
Added HOLD to state transition 
between Tb and T1b. 


Section 8.1 
Corrected 
description 
of 
how 
BIST is initiated. 


Section 8.2.2 
Added information about moves 
to TR4 and TR5. 


Section 8.4 
Corrected description of how the 
Tri-State Output Test Mode is ini- 
tiated. 
Section 8.5 
Section 8.5 describes the Bound- 
ary Scan feature of the PQFPver- 
sion of the Intel486 SX CPU. 
Table 13.2 
16 MHz and 25 MHz D.C. specifi- 
cations were added for the In- 
tel486 SX CPU and for the PGA 
package. 
Table 13.3 
Table 12.3 is a new table which 
shows the 16 MHz, 20 MHz, and 
25 MHz D.C. specifications for the 
Intel486 SX CPU for the PQFP 
package. The maximum and typi- 
cal Ice specification for the 20 
MHz Intel487 SX MCP has been 
reduced. 
Table 13.4 
16 MHz and 25 MHz D.C. specifi- 
cations were added for the In- 
tel487 SX MCP. 


Table 13.5 
Added table with 16 MHz A.C. 
specifications for the Intel486 SX 
CPU ~nd Intel487 SX MCP. 


intel~ 


Table 13.6 
Added improved 20 MHz A.C. speci- 
fications for T6 through T11 for the 
Intel486 SX CPU and Intel487 SX 
MCP. 
Minimum 
frequency 
was 
changed from 16 MHz to 8 MHz. 
Table 13.7 
Added table with 25 MHz A.C. spec- 
ifications for the Intel486 SX CPU 
and Intel487 SX MCP. 


Section 13.4.2 Added Derating curves for Intel486 
SX CPU in PQFP. 
Figure 14.3a 
Figures 13.3a, 13.3b, 13.3c and Ta- 
& Figure 14.3b ble 13.2 were added to show the 
& Figure 14.3c specifications for the 196 pin PQFP 
& Table 14.2 
package. 
Table 14.4 
Table 13.4 was added to provide 
the eJA and eJC values for the 196 
pin PQFP package. 
Table 14.5 & 
Corrected maximum TA tables for 
Table 14.6 
the Intel486 SX CPU for the PGA 
package and the Intel487 SX MCP. 
Table 14.7 
Table 13.7 was added to show the 
maximum TA values for the Intel486 
SX CPU for the 196 pin PQFP pack- 
age. 
Section 15.0 
Corrected address for AMP Incorpo- 
rated. 
Appendix B 
A FINIT instruction was added to 
the "restore_EFLAGS" 
section of 
the Intel Recommended CPU Identi- 
fication Code. 


Intel OverDrive Processor information/specifications 
have been added throughout the document, includ- 
ing 33 MHz specifications for the OverDrive Proces- 
sor. Section 8.0 contains OverDrive Processor spe- 
cific information. 


A new section was added containing a summary of 
differences between the Intel486 SX Microprocessor 
in PGA and PQFP packages. 


Intel OverDrive Processor information/specifications 
have been added throughout the document. Section 
13.0 contains OverDrive Processor specific informa- 
tion. 


33 MHz Intel486 SX Microprocessor specifications 
have been added throughout the document. 


Low Power Intel486 OX CPU information/specifica- 
tions have been added. Section 16.0 contains Low 
Power specific information. 
Cover Page 
Listed boundary scan compatibility 
feature for PQFP package. 


Pin Description 
Table 1.1d 


Noted the pinout and pin descrip- 
tion for the Intel487 SX MCP are 
identical to the Intel OverDrive 
Processor except for the MP# pin 
is redefined as the UP# pin on the 
Intel OverDrive Prol,,"M-c;or. 


More clearly defined A20M" 
~ 
Corrected Bus Request instruction 
from BREQ# to BREQ. 
Added BOstepping information for 
Intel486 SX microprocessor along 
with 
OverDrive Processor step- 
ping information. 
Added stepping info for Intel487 
SX MCP and OverDrive Proces- 
sor. 
Added number of clocks required 
for BIST at 25 MHz. 
Removed FERR# as a control pin 
for MISCCTL. 


, New chapter on the differences 
with the Intel486 SX Microproces- 
sor in PGA vs PQFP. 
New chapter on OverDrive Proc- 
essor. 
Added D.C.specs for Intel 33 MHz 
486 SX microprocessor. 
Changed 
Hold 
times 
for 
Intel 
16 MHz 486 SX microprocessor 4 
ns to 3 ns. 
Added A.C. specs for Intel486™ 
SX 33 MHz for PGA and PQFP 
packages. 
Added Boundary Scan Test sig- 
nals and timing diagrams. 
New 
section 
for 
Low 
Power 
Intel486 SX microprocessor. 


The sections significantly revised since version -003 
are: 
Section 6.5 
Added clarification for the built in 
self test (BIST) during reset. 
Added explanation of bus hold 
and hold acknowledge protocol. 
Added figure to illustrate HOLD re- 
quest 
acknowledge 
during 
BOFF#. 


Changed PGA Icc Max specifica- 
tions for 16 MHz and 33 MHz. 
Changed PQFP Icc Max specifica- 
tions for all frequencies. 
Changed Max specification of tg 
BLAST# , PLOCK# Float Delay. 


• 


Table 15.7 and 
Table 15.8 


Changed Min and Max specifica- 
tions for Boundary Scan Test in- 
puts. 


Changed maximum ambient tem- 
perature specifications for 16 MHz, 
and 33 MHz CPUs in PGA pack- 
age. 


Changed maximum ambient tem- 
perature specifications for all fre- 
quencies of CPUs in PQFP pack- 
age. 


OverDrive™ 
PROCESSOR 
INSTAllATION 
MADE EASY 


OEM PC manufacturers can make OverDrive Processor installation easier for the end user and reseller. 
Simple and foolproof OverDrive Processor installation results in fewer broken PC boards and less end user 
and reseller frustration. The following is a brief list of ways to make OverDrive Processor installation simple 
and foolproof for the end user and reseller: 
1. Intel has packaged the Intel487 SX Math CoProcessor in a 169 pin PGA package. The 169th pin insures 
that the Intel487 SX Math CoProcessor fits into a 169 pin socket in only the correct orientation. Supplying a 
169 pin socket as the OverDrive Processor eliminates the possibility of damaging the PC board due to end 
users and resellers powering up the system with the OverDrive Processor in an incorrect orientation(1). 


2. Because of the high pin count of the Intel487 SX Math CoProcessor package, the insertion force required • 
for a standard PGA socket is excessive. Even most Low Insertion Force sockets require approximately 
30 Ibs. (13.5 kg) of insertion force. A Zero Insertion Force socket insures that the OverDrive Processor 
installation insertion force does not damage the PC board. Be sure to allow enough clearance for the 
socket handle when a Zero Insertion Force socket is used. 
3. Make the OverDrive Processor socket easily accessible to the end user. (Le., Do not place the OverDrive 
Processor socket under a disk drive.) 
4. Label the Intel487 SX Math CoProcessor socket and the location of pin 1 by silk screening this information 
on the PC board. See Figure A. 
5. Describe the Intel487 SX Math CoProcessor installation procedure in the PC's User's Manual. 


NOTE: 
1. Many socket 
manufacturers 
use a standard 
grid array for PGA sockets. 
The socket 
cover contains 
more than the required 
number 
of pin holes. 
but contacts 
are only 
loaded 
where 
required 
for electrical 
connection. 
For the 
169th 
pin to assist 
the. 
end user and reseller 
in installing 
the chip with the correct 
orientation. 
it is essential 
that the socket 
have only 
169 pin holes 
in the cover. 
The only sure way to ascertain 
that a socket 
has only 169 pin holes is to order samples 
and do a visual 
check. 


C56 
1i:iJ- 1i:iJ- Cb6 
Intel OverDrive 
TM 
Processor 
Socket 


Intel 


C64 
= 


Pin 1 


---- 


Figure A. Example of PC Board with "'ntel OverDrive™ Processor Socket" 


and "Pin 1" Silk Screened onto PC Board 


APPENDIX 
B 
INTEL RECOMMENDED 
CPU IDENTIFICATION 
CODE 


The CPU identification 
assembly 
code will determine 
for the user which Intel microprocessor 
and if a Intel Math 
CoProcessor 
is installed 
in the system. 
If a 486 microprocessor 
has been installed, 
the program 
will determine 


if the CPU is with/without 
a floating 
point unit. This code should be executed 
so the system can be configured 
for a particular 
application, 
which 
may depend 
on the microprocessor 
and Math CoProcessor 
installed 
in the 


system. 


TITLE 
CPUID 
DOSSEG 
.model 
small 


•data 


fp_status 
dw 
id_mess 
db 


fp_8087 
db 
fp_80287 
db 
fp_80387 
db 
c8086 
db 
c286 
db 
c386 
db 
c486 
db 


c486nfp 
db 
period 
db 
present_86 
dw 
present_286 
dw 
present_386 
dw 
present_486 
dw 


•• 
? 
"This 
system 
has 
a$" 


"and an 8087 
Math 
CoProcessor$" 


"and an 287 Math 
CoProcessor$" 
"and an 387 Math 
CoProcessor$" 
"n8086/8088 
microprocessor$" 
"n80286 
microprocessor$" 
"386 microprocessor$" 
"486 DX microprocesser/487 
SX Math 
"CoProcessor$" 
"486 SX Microprocesser$" 
".$",13,10 
o 
o 
o 
o 


The purpose 
of this 
code 
is to allow 
the user 
the ability 
to 
identify 


the processor 
and 
coprocesor 
that 
is currently 
in the 
system. 
The 


algorithm 
of the program 
is to 
first 
determine 
the processor 
id. 


When 
that 
is accomplished, 
the program 
continues 
to then 
identify 


whether 
a coprocessor 
exists 
in the 
system. 
If a coprocessor 
or 


integrated 
coprocessor 
exists, 
the program 
will 
identify 
the 


coprocessor 
id. If one does not 
exist, 
the program 
then 
terminates. 


mov 
mov 
int 


ax,@data 
ds,ax 


dx,offset 
id_mess 
ah,9h 
21h 


8086 check 
Bits 12-15 
are always set on the 8086 processor. 


pushf 
pop 
mov 
and 
push 
popf 
pushf 
pop 
and 
cmp 
mov 
mov 
je 


bx 
aX,Offfh 
aX,bx 
ax 


ax 
aX,OfOOOh 
aX,OfOOOh 
dX,offset 
c8086 
present_86,l 
check_fpu 


save EFLAGS 
store EFLAGS in BX 
clear bits 12-15 
in EFLAGS 
store new EFLAGS value on stack 
replace current EFLAGS value 
set new EFLAGS 
store new EFLAGS in AX 
if bits 12-15 
are set, then CPU 
is an 8086/8088 
store 8086/8088 message 
turn on 8086/8088 
flag 
if CPU is 8086/8088, 
check for 
8087 


80286 CPU Check 
Bits 12-15 
are always clear on the 80286 processor. 


or 
push 
popf 
pushf 
pop 
and 
mov 
mov 
mov 
jz 


bX,OfOOOh 
bx 


ax 
aX,OfOOOh 
dX,offset 
c286 
present_86,O 
present_286,l 
check_fpu 


386 CPU check 
The AC bit, bit #18, is a new 
on the 486 DX CPU to generate 
on the 486 DX CPU, but not on 


if bits 12-15 
are cleared, then 
CPU is an 80286 
turn off 8086/8088 
flag 
turn on 80286 flag 
if CPU is 80286, check for 80287 


bit introduced 
in the EFLAGS register 
alignment 
faults. This bit can be set 
the 386 CPU. 


mov 
bx,sp 
save current stack pointer 
to 
align it 
and 
sp,not 3 
align stack to avoid AC fault 
db 
66h 
pushf 
push original EFLAGS 
db 
66h 
pop 
ax 
get original EFLAGS 
db 
66h 
mov 
cX,ax 
save original EFLAGS 
db 
66h 
xor EAX,40000h 
xor 
ax,O 
flip AC bit in EFLAGS 
dw 
4 
upper 16-bits of xor constant 
db 
66h 
push 
ax 
save for EFLAGS 
db 
66h 
popf 
copy to EFLAGS 


db 
66h 
pushf 
db 
66h 
pop 
ax 
db 
66h 
xor 
ax,cx 


mov 
dx,offset 
c386 
mov 
present_286,0 
mov 
present_386,1 
je 
check_fpu 


if AC 
bit 
cannot 
be 
changed, 
CPU 
is 
store 
386 message 
turn 
off 
80286 
flag 
turn 
on 386 
flag 
if CPU 
is 386, 
now 
check 
for 
80287/80387 


mov 
mov 
mov 


dx,offset 
c486nfp 
present_386,0 
present_486,1 


store 
486NFP 
message 
turn 
off 
386 
flag 
turn 
on 486 
flag 


Co-processor 
checking 
begins 
here 
for the 
8086/80286/386 
CPUs. 
The 
algorithm 
is to determine 
whether 
or not 
the 
floating-point 
status 
and 
control 
words 
can. be written 
to, 
the 
correct 
coprocessor 
is then 
determined 
depending 
on the 
pro cesser 
ide 
Coprocessor 
checks 
are 
first 
performed 
for 
an 8086, 
80286 
and 
a 486 
DX 
CPU. 
If the 
coprocessor 
id is still 
undetermined, 
the 
system 
must 
contain 
a 386 
CPU. 
The 
386 
CPU may 
work 
with 
either 
an 80287 
or an 80387. 
The 
infinity 
of the 
coprocessor 
must 
be 
checked 
to determine 
the 
correct 
coprocessor 
ide 


check 
for 
8087/80287/80387 
reset 
FP 
status 
word 
initialize 
temp 
word 
to non-zero 
value 
save 
FP 
status 
word 
check 
FP 
status 
word 
see 
if correct 
status 
with 
written 
jump 
if not 
Valid, 
no NPX 
installed 


save 
FP 
control 
word 
check 
FP 
control 
word 
see 
if selected 
parts 
looks 
OK 
check 
that 
ones 
and 
zeroes 
correctly 
read 
jump 
if not 
Valid, 
no NPX 
installed 


check 
if 486 
flag 
is on 
if 
so, 
jump 
to print 
486 
message 
else 
continue 
with 
386 
checking 


fninit 
mov 
fp_status,5a5ah 


fnstsw 
fp_status 
mov 
ax,fp_status 
cmp 
al,O 


fnstcw 
mov 
and 
cmp 


fp_status 
ax,fp_status 
ax,103fh 
ax,3fh 


cmp 
je 
jmp 


present_486,1. 
is_486 
not_486 


mov 
jmp 
dx,offset 
c486 
print_one 


• 


cmp 
jne 
present_386,l 
pr~nt_87_287 


mov 
int 
ah,9h 
2lh 


fldz 
fdiv 
fld 
st 
fchs 
fcompp 


fstsw 
fp_status 
mov 
aX,fp_status 
mov 
dX,offset 
fp_80287 
sahf 
jz 
restore_EFLAGS 
mov 
dX,offset 
fp_80387 


restore_EFLAGS: 
finit 
mov 
ah,9h 
int 
2lh 
db 
66h 
push 
cx 
db 
66h 
popf 
mov 
sp,bx 
jmp 
exit 


print_one: 
mov 
ah,9h 
int 
2lh 
jmp 
exit 


print-87_287: 
mov 
ah,9h 
int 
2lh 
cmp 
present_86,l 
mov 
dX,offset 
fp_8087 
je 
print_fpu 
mov 
dX,offset 
fp_80287 


print_fpu: 
mov 
ah,9h 
int 
2lh 
jmp 
exit 


exit: 
mov 
dX,offset 
period 
mov 
ah,9h 
int 
2lh 


mov 
aX,4cOOh 
int 
2lh 


end 
start 


2-706 


check 
if 386 
flag 
is on 
if 386 
flag 
not 
on, 
check 
NPX 
for 
8086/8088/80286 
print 
out 
386 
CPU 
ID first 


must 
use 
default 
control 
from 
FNINIT 
form 
infinity 
8087/80287 
says 
+inf = inf 
form 
negative 
infinity 
80387 
says + inf 
< > 
-inf 
see 
if they 
are 
the 
same 
and 
remove 
them 
look 
at 
status 
from 
FCOMPP 


store 
80287 
message 
see 
if infinities 
matched 
jump 
if 8087/80287 
is present 
store 
80387 
message 


clear 
any 
pending 
fp exception 
print 
NPX 
message 


restore 
original 
EFLAGS 
register 
restore 
original 
stack 
pointer 


if 8086/8088 
flag 
is on 
store 
8087 
message 


else 
CPU = 80286, 
store 
80287 
message 


APPENDIX 
C 
3.3V Intel486™ 
SX 
MICROPROCESSOR 


• 
Lower Supply Voltage Required 
- 
Operates from 3.0V to 3.6V 
- 
50% Lower Power than 5V CPU 
- 
Lower Power Consumption Produces 
Less Heat 
- 
Longer Battery Life for Portable 
Applications 


• 
Binary Compatible with Large Software 
Base 
-MS-DOS', 
OS/2", 
Windows' 


- 
UNIX'" 
System V/386 
-iRMX®, 
iRMK Kernels 


• 
High Integration Enables On-Chip 
- 
8 Kbyte Code and Data Cache 
- 
Paged, Virtual Memory Management 


• 
Easy To Use 
- 
Built in Self Test 
-Intel 
Software Support 
- 
Extensive Third Party Software 
Support 


• 
196-Lead PQFP Package 


• 
High Performance Design 
-lntel486 
One Clock Instruction Core 
-25 
MHz, 20 MHz, and 16 MHz Clock 
Frequencies 
- 
80 Mbytes/sec Burst Bus at 25 MHz 
- 
CHMOS V Process Technology 
- 
Dynamic Bus Sizing for 8-, 16-, 
32-Bit Busses 


• 
Complete 32-Bit Architecture 
• 
- 
Address and Data Busses 
-Registers 
- 
8-, 16-, 32-Bit Data Types 


• 
IEEE 1149.1 Boundary Scan 
Compatibility 
- 
F:orSurface Mount Production 
Testing 


The 3.3V 
Intel486 
SX microprocessor 
provides 
a low-cost 
entry 
point 
to powerful 
Intel486 
microprocessor 


based 
portable 
computing. 
The Intel486 
SX microprocessor 
is a binary compatible 
derivative 
of the Intel486 
DX microprocessor, 
giving it access 
to the $50 billion 
installed 
software 
base of over 50,000 
MS-DOS, 
Win- 


dows, OS/2, 
and UNIX system 
V/386 
applications. 
The Intel486 
SX microprocessor 
has the same integrated 


RISC integer core, 8 Kbyte cache memory, 
and memory 
management 
unit as the Intel486 
DX microprocessor. 


The high-performance 
RISC integer core of the Intel486 
SX microprocessor 
executes 
frequently-used 
instruc- 
tions in one clock cycle. An 8 Kbyte unified code and data cache allow this performance 
level to be sustained. 


An 80 Mbyte/sec 
burst bus at 25 MHz ensures 
high system 
throughput 
even with inexpensive 
DRAMs. 
The 
25 MHz Intel486 SX microprocessor 
has a Norton SI V5.0 rating of 54.1, a Dhrystone 
MIPS rating of 20.1 and a 


13.3 SPEC Integer rating. It provides 
up to 70% greater 
performance 
than a 33 MHz Intel386TM DX microproc- 


essor with an external 
cache 
(depending 
on the application) 
at a lower system 
cost. 


This documentation 
lists the A.C. and D.C. specifications 
of the 3.3V Intel486 
SX microprocessor 
and is a 


supplement 
to the Intel486TM SX Microprocessor/lntel487™ 
SX Math Coprocessor 
Data Book (Order 
Num- 
ber 240950-002). 


i386, Inte1386, 1387, Inte1387, 1486, Intel486, 
1487, Intel487 
are trademarks 
of Intel Corp. 
'MS-DOS 
and Windows 
are trademarks 
of Microsoft 
Corp. 
"OS/2 
is a trademark 
of IBM. 
'''UNIX 
is a registered 
trademark 
of UNIX Systems 
Labs. 


The 
ratings 
listed 
in this section 
are stress 
ratings 
only, 
and functional 
operation 
at the 
maximums 
is 
not guaranteed. 
Functional 
operating 
conditions 
are 
given in the D.C. Specifications 
and A.C. Specifica- 
tions sections. 


Extended 
exposure 
to the Maximum 
Ratings may af- 
fect device 
reliability. 
Furthermore, 
although 
the In- 
tel486 
SX microprocessor 
contains 
protective 
cir- 
cuitry to resist damage from static electric 
discharge, 
always take precautions 
to avoid high static voltages 
or electric 
fields. 


Case Temperature 
under Bias 


Storage Temperature 


Voltage 
on Any Pin with 
Respect to Ground 


Supply Voltage 
with 
Respect 
to Vss 


D.C. SPECIFICATIONS 
Functional 
operating 
range: VCC = 3.3V 
±0.3V; 
TCASE = O°C to +85°C 


3.3V Intel486™ 
SX Microprocessor 
D.C. Parametric 
Values 


Symbol 
Parameter 
Min 
Max 
Units 


VIL 
Input lOW 
Voltage 
-0.3 
+ 0.8 
V 


VIH 
Input HIGH Voltage 
2.0 
VCC + 0.3 
V 


VOL 
Output 
lOW 
Voltage 
IOL = 4/5 mA 
IOL = 0.1 mA 


VOH 
Output 
HIGH Voltage 
IOH = -1/0.9 
mA 
IOH = -0.1 
mA 


Icc 
Supply Current 
ClK 
= 8 MHz 
elK 
= 16 MHz 
ClK 
= 20 MHz 
ClK 
= 25 MHz 


III 


IIH 


IlL 


ILO 


CIN 


COUT 


CCLK 


NOTES: 
1. This parameter is measured at: 


Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 


Address, Data, BEn 
- 1.0 mA 
Definition, Control 
-0.9 mA 


V 
V 


V 
V 


mA 
mA 
mA 
mA 


±15 
J.LA 


200 
J.LA 


-400 
J.LA 


±15 
J.LA 


10 
pF 


10 
pF 


6 
pF 


-65°C 
to + 110°C 


- 65°C to + 150°C 


(Note 4) 


(Note 5) 


(Note 6) 


FC = 1 MHz(7) 


FC = 1 MHz(7) 


FC = 1 MHz(7) 


3. Typical supply current (3.3V): 
ICC @ 
8 MHz = 90 mA 


@ 16 MHz = 180 mA 


@ 20 MHz = 210 mA 


@ 25 MHz = 250 mA 
4. This parameter is for inputs without pull ups or pull 
downs and OV ~ VIN ~ Vcc. 
5. This parameter is for inputs with pull downs and 
VIH = 2.4V. 
6. This parameter is for inputs with pull ups and 
VIL = 0.45V. 
7. Not 100% tested. 


intel~ 


A.C. specifications measurement is defined by the 
timing diagrams in the base Intel486 SX documenta- 
tion (refer to the cover page of this document for the 
order number). The Boundary Scan (JTAG) timing 
waveforms are included in this document following 
the A.C. Specifications section. 


The A.C. Specifications given in the following tables 
consist of output delays, input setup requirements 
and input hold requirementts. All A.C. specifications 
are relative to the rising edge of the ClK signal. 


16 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 


vcc = 3.3V ± 0.3V; TCASE= O°Cto +85°C; Cl = 50 pF unless otherwise specified. 


Symbol 
Parameter 
Mln 
Max 
Figure 
Units 
Notes 


Frequency 
, 
8 
16 
MHz 
1X Clock 


t1 
ClK Period 
62.5 
125 
12.1 
ns 


t1a 
ClK Period Stability 
0.1% 
12.1 
ns 
Adjacent Clocks 


t2 
ClK High Time 
20 
~ 
ns 
at2V 


"i 


«~t!~1 
t3 
ClKlowTime 
20 
ns 
atO.8V 


t4 
ClK Fall Time 
A ~ 
....C~ 12.1 
ns 
2Vto 0.8V 


t5 
ClK Rise Time 
~~. 
.~~r'l 
12.1 
ns 
0.8Vto 2V 


~ 
A2-A31,PWT,PCO,BEO-3#,M/~~~; 
~ 
.... 
iP 
12.5 
ns 
W/R#, AOS#, lOCK#, 
BREQ, Hl 
illd Oe~y 


t7 
A2-A31, PWT, PCO,BEO-~~~~?~~ 
O~~~ 
r....' 
42 
12.6 
ns 
After Clock Edges(1) 


W/R#, AOS#, lOCK#, 
BR 
DA F_ 
••••~ 


te 
PCHK# Valid Delay 
~ 
~, 
"'.ft"J" 
3 
35 
12.4 
ns 


tea 
BLAST#, PlOCK# VaHdDelay//~V',,>O"" 
3 
35 
12.5 
ns 


t9 
BlASH, 
PLOCK# Float O~~" 
••.~ 
42 
12.6 
ns 
After Clock Edges(1) 


tlO 
00-031, 
OPO-OP3 Wri~D~~Va~elay 
3 
30 
12.5 
ns 


t11 
00-031, 
OPO-OP3 ~i'r5ata 
Float Delay 
42 
12.6 
ns 
After Clock Edges(l) 


t12 
EAOS# Setup Time 
,y 
12 
12.2 
ns 


t13 
EAOS# Hold Time 
3 
12.2 
ns 


t14 
KEN#, BS16#, BS8# Setup Time 
12 
12.2 
ns 


t15 
KEN#, BS16#, BS8# Hold Time 
3 
12.2 
ns 


t16 
ROY#, BROY# Setup Time 
12 
12.2 
ns 


• 


16 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 
vcc 
= 3.3V 
±0.3V; 
TCASE = O·C to + 85·C; Cl = 50 pF unless otherwise 
specified. 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
Figure 
Unit 
Notes 


t17 
RDY #, BRDY # Hold Time 
3 
12.2 
ns 


t1B 
HOLD, AHOLD, 
BOFF # Setup Time 
12 
12.2 
ns 


t19 
HOLD, AHOLD, 
BOFF # Setup Time 
3 
12.2 
ns 


t20 
RESET, FLUSH #, A20M #, NMI, INTR Setup Time 
14 


"I 
12.2 
ns 


t21 
RESET, FLUSH II, A20M #, NMI, INTR Hold Time 
3 
" 


12.2 
ns 


t22 
DO-D31 
, DPO-DP3, 
M-A31 
Read Setup Tiaf& 
10:' • .~ 
12.2 
ns 


t23 
DO-D31 
, DPO-DP3, 
M-A31 
Read Hold;r~ 
~~ 
12.2 
ns 


t24 
TCK Frequency 
.~~ 
~ i "" 


16 
12.7 
MHz 
1X Clock 


t25 
TCK Period 
,4'-v' 
_V 
6)5 
12.7 
ns 
(Note 3) 


t26 
TCK High Time 
iV" ,..,~ 
.~ 
10 
12.7 
ns 
@2.0V 


t27 
TCKLowTime 
h:..~' 
(t~A.~V" 
10 
12.7 
ns 
@0.8V 


t2B 
TCK Rise Time ~,.~~-~- 
4 
12.7 
ns 
(Note 2) 


t29 
TCK Fall Time 
.A.V.,f~~ 
4 
12.7 
ns 
(Note 2) 


t30 
TDI, TMS Setup Time 
.••..<;;f:> :....~ 
"0 
8 
12.7 
ns 
(Note 4) 


t31 
TDI, TMS Hold Time.6 
v 
~. 


10 
12.7 
ns 
(Note 4) 


t32 
TOO Valid Delay...•<... 
3 
25 
12.7 
ns 
(Note 4) 


t33 
TDO Float Delay •••• 
TBD 
12.7 
ns 
(Note 4) 


t34 
All Outputs 
(Non-Test 
Valid Delay) 
3 
25 
12.7 
ns 
(Note 4) 


t35 
All Outputs 
(Non-Test 
Float Delay) 
36 
12.7 
ns 
(Note 4) 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
12.7 
ns 
(Note 4) 


t37 
All Inputs (Non-Test) 
Hold Time 
10 
12.7 
ns 
(Note 4) 


NOTES: 
1. Not 100% tested, 
guaranteed 
by design 
characterization. 


2. Rise/Fall 
times 
are measured 
between 
O.BVand 2.0V. Rise/Fall 
times 
can be relaxed 
by 1 ns per 10 ns increase 
in TCK 


period. 
3. TCK period 
~ ClK period. 


4. Parameter 
measured 
from TCK. Boundary 
scan tested 
at Vcc greater 
than or equal to 3.3V. 


20 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 
vcc 
= 3.3V 
±0.3V; 
TCASE = O°C to + 85°C; CL = 50 pF unless otherwise 
specified. 


Symbol 
Parameter 
Mln 
Max 
Figure 
Unit 
Notes 


Frequency 
8 
20 
MHz 
1X Clock 


t1 
CLK Period 
50 
125 
12.1 
ns 


t1a 
CLK Period Stability 
0.1% 
12.1 
ns 
Adjacent 
Clocks 


t2 
CLK High Time 
16 
12.1 
ns 
at2V 


t3 
CLK Low Time 
16 
12.1 
ns 
atO.8V 


t4 
CLK Fall Time 
6 
12.1 
ns 
2V to 0.8V 


t5 
CLK Rise Time 
6 
12.1 
ns 
0.8Vto 
2V 


ts 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
D/C#, 
3 
23 
12.5 
ns 
W IR #, ADS#, 
LOCK #, BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
D/C#, 
37 
12.Q 
ns 
After Clock Edges(1) 


W/R#, 
ADS#, 
LOCK#, 
BREQ, HLDA Float Delay 
•. 


t8 
PCHK# 
Valid Delay 
~ 3 
28~ 
12.4 
ns 


t8a 
BLAST #, PLOCK # Valid Delay 
4 
3 
28·' 
12.5 
ns 


t9 
BLAST #, PLOCK # Float Delay 
~? 
'" •..31 
12.6 
ns 
After Clock Edges(1) 


tlO 
00-031 
, DPO-DP3 
Write Data Valid ~. 
'3" • 
~"\. 
12.5 
ns 


t11 
00-031 
, DPO-DP3 
Write Data FloalDelay 
'" 
"37 
12.6 
ns 
After Clock Edges(1) 


t12 
EADS # Setup Time 
/' 
~. 
~.,1'0 
12.2 
ns 


t13 
EADS # Hold Time 
~~ 
~~..s 
:3 
12.2 
ns 


t14 
KEN#, 
BS16#, 
BS8# Setup Timeo'" 
'?A..~ 
10 
12.2 
ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Tin{~ 
•••.. 
~'OiJ' 
3 
12.2 
ns 


t16 
ROY #, BRDY # Setup Time 
S->' 
10 
12.2 
ns 


t17 
ROY #, BRDY # HOldJ~ 
3 
12.2 
ns 


t18 
HOLD, AHOLD, 
B~}¥Setup 
Time 
12 
12.2 
ns 


t19 
HOLD, AHOLD, 
BOFF # Setup Time 
3 
12.2 
ns 


t20 
RESET, FLUSH #, A20M #, NMI, INTR Setup Time 
12 
12.2 
ns 


t21 
RESET, FLUSI-t#, 
A20M#, 
NMI, INTR Hold Time 
3 
12.2 
ns 


t22 
00-031 
, DPO-DP3, 
A4-A31 
Read Setup Time 
8 
12.2 
ns 


t23 
00-031 
, DPO-DP3, 
A4-A31 
Read Hold Time 
3 
12.2 
ns 


t24 
TCK Frequency 
20 
12.7 
MHz 
1X Clock 


t25 
TCK Period 
50 
12.7 
ns 
(Note 3) 


t26 
TCK High Time 
10 
12.7 
ns 
@2.0V 


t27 
TCK Low Time 
10 
12.7 
ns 
@0.8V 


t28 
TCK Rise Time 
4 
12.7 
ns 
(Note 2) 


t29 
TCKFaliTime 
4 
12.7 
ns 
(Note 2) 


• 


20 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 


vcc = 3.3V 
± 0.3V; TCASE = O°C to + 85°C; Cl = 50 pF unless otherwise 
specified. 
(Continued) 


Symbol 


t30 


t31 


t32 


t33 


t34 


t35 


t36 


t37 


Parameter 


TOI, TMS Setup Time 


TOI, TMS Hold Time 


TOO Valid Delay 


TOO Float Delay 


All Outputs 
(Non-Test 
Valid 


All Outputs 
(Non-Test 
F 


All Inputs (Non-Test 


All Inputs (Non-T-Elst ,Hold T 


e 
Unit 
Notes 


ns 
(Note 4) 


12.7 
ns 
(Note 4) 


12.7 
ns 
(Note 4) 


12.7 
ns 
(Note 4) 


12.7 
ns 
(Note 4) 


12.7 
ns 
(Note 4) 


12.7 
ns 
(Note 4) 


12.7 
ns 
(Note 4) 


NOTES: 
1. Not 100% tested, guaranteed by desig 
on. 
2. Rise/Fall times are measured betw 
O. 
.OV.Rise/Fall times can be relaxed by 1 ns per 10 ns increase in TCK 
period. 
3. TCK period ~ ClK period. 
4. Parameter measured from T 
undary scan tested at Vcc greater than or equal to 3.3V. 


25 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 


vcc = 3.3V 
±0.3V; 
TCASE = O°C to + 85°C; Cl = 50 pF unless otherwise 
specified. 


Symbol 
Parameter 
Mln 
Max 
Figure 
Unit 
Notes 


Frequency 
8 
MHz 
1X Clock 


t1 
ClK 
Period 
40 
ns 


t1a 
ClK 
Period Stability 
ns 
Adjacent 
Clocks 


t2 
ClK 
High Time 
ns 
at2V 


t3 
ClK 
low 
Time 
ns 
at 0.8V 
4 
elK 
Fall Time 
ns 
2VtoO.8V 


t5 
ClK 
Rise Time 
ns 
0.8Vto2V 


t6 
12.5 
ns 


t7 
28 
12.6 
ns 
After Clock Edges(l) 


ta 
3 
24 
12.4 
ns 


tea 
3 
24 
12.5 
ns 


tg 
28 
12.6 
ns 
After Clock Edges(l) 


t10 
3 
20 
12.5 
ns 


tll 
28 
12.6 
ns 
After Clock Edges(l) 


t12 
8 
12.2 
ns 


t13 
3 
12.2 
ns 


25 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 
vcc = 3.3V ± 0.3V; TCASE = O°C to + 85°C; CL = 50 pF unless otherwise 
specified. 
(Continued) 


Symbol 
Parameter 
Min 
Max 
Figure 
Unit 
Notes 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
8 
12.2 
ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
3 
12.2 
ns 


t16 
RDY#, 
BRDY# 
Setup Time 
8 
12.2 
ns 


t17 
RDY #, BRDY # Hold Time 
3 
12.2 
ns 


t1B 
HOLD, AHOLD, 
BOFF # Setup Time 
10 
12.2 
ns 


t19 
HOLD, AHOlD, 
BOFF # Setup Time 
3 
+" 
12.2 
ns 


t20 
RESET, FLUSH #, A20M #, NMI, INTR Setup Time 
10 
12.2 
ns 


t21 
RESET, FlUSH#, 
A20M#, 
NMI, INTR Hold TIme 
3 
12.2 
ns 


t22 
DO-D31, 
DPO-DP3, 
A4-A31 
Read SqttlJt Time 
,. 
:;#8 
12.2 
ns 


t23 
DO-D31, 
DPO-DP3, 
A4-A31 
Rea~ 
Time ~} 
. .~ 
12.2 
ns 


t24 
TCK Frequency 
,~,/ 
.•. 
25 
12.7 
MHz 
1X Clock 


t25 
TCK Period 
_~F 
..fAY At.,. 
40 
12.7 
ns 
(Note 3) 


t26 
TCK High Time 
A~- 
'>L::.W 
10 
12.7 
ns 
@2.0V 


t27 
TCKlowTime 
*"-> 
,*~Vft"""'- 
10 
12.7 
ns 
@0.8V 


t2B 
TCK Rise Time 
-<'-•... 
~- 
4 
12.7 
ns 
(Note 2) 


t29 
TCKFallTime 
~~~ 
4 
12.7 
ns 
(Note 2) 


t30 
TDI, TMS Setup Ti~~ 
8 
12.7 
ns 
(Note 4) 


t31 
TOI, TMS Hold ~ 
10 
12.7 
ns 
(Note 4) 


t32 
TDO Valid Delay" 
3 
25 
12.7 
ns 
(Note 4) 


t33 
TDO Float Delay 
TBD 
12.7 
ns 
(Note 4) 


t34 
All Outputs 
(Non-Test 
Valid Delay) 
3 
25 
12.7 
ns 
(Note 4) 


t35 
All Outputs 
(Non-Test 
Float Delay) 
36 
12.7 
ns 
(Note 4) 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
12.7 
ns 
(Note 4) 


t37 
All Inputs (Non-Test) 
Hold Time 
10 
12.7 
ns 
(Note 4) 


NOTES: 
1. Not 100% tested. guaranteed by design characterization. 
2. Rise/Fall times are measured between O.BVand 2.0V. Rise/Fall times can be relaxed by 1 ns per 10 ns increase in TCK 
period. 
3. TCK period ;;, CLK period. 
4. Parameter measured from TCK. Boundary scan tested at Vcc greater than or equal to 3.3V. 
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Intel is enhancing its entire Intel486™ microprocessor 
family with the energy-efficient technology driving to- 
day's highly integrated Intel486 SL CPUs. The SL En- 
hanced Intel486 microprocessor family enables built-in 
power management while maintaining full compatibil- 
ity with existing Intel architecture processors. The SL 
Enhanced Intel486 CPU family allows system designers 
to design desktop systems that exceed the Environmen- 
tal Protection Agency's (EPA) Energy Star guidelines, 
without sacrificing performance. Bringing SL Technol- 
ogy to the entire Intel486 CPU line increases notebook 
system design flexibility and increases the battery life of 
all high-performance 
Intel486 CPU-based notebooks. 
SL technology allows system designers to differentiate 
their power management schemes with a variety of en- 
ergy-efficient or battery life-preserving features. 


SL Enhanced Intel486 CPU-based systems can maxi- 
mize both energy efficiency and performance. SL Tech- 
nology features like Stop Clock and Auto HALT power 
down allow CPU standby modes, saving systems up to 
an additional 4W over previous Intel486 CPUs while 
providing virtually instantaneous recovery to a full-on 
state. Stop Clock and Auto Idle power down also allow 
active power management of the CPU so systems can 
save energy even when in use. SL Enhanced OverDri- 
ve™ processors provide end-user performance upgrad- 
ability to SL Enhanced Intel486 CPUs while maintain- 
ing their energy efficiency. 


SL Enhanced Intel486 CPUs enable power manage- 
ment features to be built into hardware, allowing ener- 
gy efficiency that is transparent to application and O/S 
software. Stop Clock, Auto HALT power down, and 
Auto Idle power down allow software transparent con- 
trol over CPU power management. Equally important 
is the capability of the processor to manage system 
power consumption. The SL Enhanced Intel486 CPU 
incorporates 
the 
same System 
Management 
Mode 
(SMM) found in the SL CPU family. A non-maskable 
System Management Interrupt (SMI), a corresponding 
Resume (RSM) instruction and a new memory space 
for system management code are the basis of Intel's 
System Management Mode. Intel's SMM ensures seam- 
less power control of the processor core, system logic, 
main memory and one or more peripheral 
devices 
transparent to any application or operating system. 


SL Enhanced Intel486 CPUs are available in a full 
range of speeds (25 MHz to 66 MHz), packages (PGA, 
SQFP, PQFP), and voltages (5V, 3.3V) to meet any 
system design requirement. 


This Data Book Addendum highlights the features of 
Intel's 
SL Enhanced 
Intel486 
microprocessors 
and 


should be used in conjunction with the latest version of 
the existing Intel486 microprocessor 
documents, 
in- 
cluding the following: 


• Intel486™ 
DX Microprocessor Data. Book, Order 
No. 240440 


• Intel486™ 
SX Microprocessor-Intel487™ SX Co- 
ProcessorData Sheet, Order No. 240950 
• Intel486TM Family of Microprocessors Low Power 
Version Data Sheet, Order No. 241199 
• Intel486™ 
Microprocessor Family 
Programmer's 
Reference Manual, Order No. 240486 


• Intel486™ 
Microprocessor Hardware 
Reference 
Manual, Order No. 240552 
• Intel486TM DX2 Microprocessor Data Book, Order 
No. 241245 
• Pentium™ 
OverDrive™ ProcessorData Sheet, Or- 
der No. 290436 
• 
1993 Packaging Handbook, Order No. 240800 


End-user upgradability for the SL Enhanced Intel486 
CPU product family is specified in the OverDrive Proc- 
essor socket section (Section 9) of this document. 


1.1 
SL Enhanced Intel486 
Microprocessor 
Features 


• Intel's System Management Mode - 
A unique Intel 
architecture operating mode with its dedicated spe- 
cial purpose interrupt and address space which can 
be used to implement intelligent power management 
and other enhanced functions in a manner which is 
completely transparent to the operating system and 
applications software. 
• I/O Restart - 
An I/O instruction interrupted by a 
System Management Interrupt 
(SMI#) 
can auto- 
matically be re-started during the execution of an 
RSM instruction. 
• Stop Clock - 
Provides a clock control mechanism 
for the SL Enhanced Intel486 CPUs. This mecha- 
nism provides two low-power states: a fast wake-up 
Stop Grant State (- 
20-55 mA) and a Stop Clock 
state with CLK frequency at O-MHz (-100 fJoA- 
200 fJoA). 


• Auto HALT Power Down - 
After the execution of a 
HALT instruction, the CPU issues a normal HALT 
bus cycle and the clock input to the SL Enhanced 
Intel486 CPU core is automatically stopped, causing 
the CPU to enter the Auto HALT Power Down 
state (- 20 mA-55 mA). 


• Auto Idle Power Down - 
This function which only 
applies to Intel486 DX2 CPUs, allows the CPU to 
reduce its core clock rate to half of its original fre- 
quency, without affecting performance. Auto Idle 
Power Down provides an average power savings of 
10%. 


• 


• UpgradePower Down Mode - 
When an OverDrive 
processor upgrade is installed, this feature detects 
the presence of the upgrade, then powers down the 
core, and three-states 
all outputs 
of the original 
CPU, so the original CPU consumes very low cur- 
rent. 
• Package Options - 
The SL Enhanced 
Intel486 
CPUs are available in 168 lead PGA, 
196 lead 
PQFP, and 208 lead SQFP packages. The SQFP 
package 
provides 
a .40% 
volume 
savings over 
PQFP, making it an ideal solution for mobile sys- 
tems. 
• 3.3 Volt Operation - 
The SL Enhanced Intel486 
CPUs are available with either a 3.3V supply voltage 
or a 5V supply voltage. The 3.3V supply voltage 
provides a 50% power saving over a 5V supply volt- 
age. 
• Clocking Options - 
The SL Enhanced 
Intel486 
CPUs are available with either a IX clock input or a 
2X clock input. The IX clock option, in which the 
phases of the core clock are provided by an internal 
Phase Lock Loop circuit, provides a simpler system 
design with better I/O-timing performance. The IX 
clock option, is required for DX2 'clock-doubled sys- 
tems and OverDrive Processor upgradability. The 
2X clock option, in which both phases of the core 
clock are provided by the system, allows portable 
designs to use existing Intel486 "Lpn CPU clock 
control mechanisms. The 2X clock option is intend- 
ed as a short-term solution for portable applications. 


All of the SL Enhanced Intel486 CPUs are based on an 
Intel486 
CPU core which consists of a 32-bit integer 
processing unit, an 8 Kbyte cache, and a memory man- 
agement unit. This ensures full binary compatibility 
with the 8086, 8088, 80186, 80286, Inte1386TMSX, In- 
tel386 DX, and all versions of Intel486 microproces- 
sors. All of the Intel486 microprocessors described in 
this document offer the following features: 
• Full 32-bit RISC integer processor - 
The Intel486 
performs a complete set of arithmetic and logical 
operations on 8-, 16-, and 32-bit data types using a 
full-width ALU and eight general purpose registers. 
• Single Cycle Execution - 
Many instructions exe- 
cute in a single clock cycle. 
• Instruction Pipe/ining - 
The fetching, decoding, ex- 
ecution, and address translation of instructions is 
overlapped within the Intel486 microprocessor. 


• On-Chip Floating Point Unit - 
(All except Intel486 
SX CPU) Supports 32-, 64-, and 80-bit formats 
specified in IEEE standard 754 are supported. The 
unit is binary compatible with the 8087, 80287, 
Inte1387TMSXand DX, and Intel487™ 
SX math 
coprocessors. 


• On-Chip Cache with Cache Consistency Support - 
An 8 Kbyte internal write-through cache is used for 
both data and instructions. Cache hits provide zero 
wait-state access times for data within the cache. 
Bus activity is tracked to detect alterations in the 
memory which the internal cache represents. The 
internal cache can be invalidated or flushed so that 
an external cache controller 
can maintain 
cache 
consistency in multi-processor environments. 


• External 
Cache Control - 
Write-back and flush 
controls for an external L2 cache are provided so 
the processor can maintain 
cache consistency in 
multiprocessor environments. 


• On-Chip Memory Management 
Unit - 
Address 
management and memory space protection mecha- 
nisms maintain the integrity of memory in a multi- 
tasking and virtual memory environment. Both seg- 
mentation and paging are supported. 


• Separate 32-bit Address and Data Paths - 
Up to 
four gigabytes of physical memory can be addressed 
directly. 
• Burst Cycles - 
Burst transfers allow a new double 
word to be read from memory on each clock cycle. 
This capability is especially useful for instruction 
prefetch and for filling the internal cache. 


• 
Write Buffers - 
The processor contains four write 
buffers to enhance the performance of consecutive 
writes to memory. The CPU can continue internal 
operations after a write, without waiting for the 
write to be executed on the external bus. 
• Bus Backoff - 
If another bus master needs control 
of the bus during a CPU initiated bus cycle, the 
Intel486 CPU will float its bus signals, then restart 
the cycle when the bus becomes available again. 


• Instruction Restart - 
Programs can continue execu- 
tion following an exception generated by an unsuc- 
cessful attempt to access memory. This feature is 
important 
for 
supporting 
demand-paged 
virtual 
memory applications. 


• Dynamic Bus Sizing - 
External controllers can dy- 
namically alter the effective width of the data bus. 
Bus widths of 8, 16, or 32 bits can be used. 
• Boundary Scan (JTAG) - 
'Boundary Scan provides 
for in-circuit testing of components on printed cir- 
cuit boards. The Intel Boundary Scan implementa- 
tion conforms with the IEEE Standard Test Access 
Port and Boundary Scan Architecture. Not all prod- 
ucts are offered with the boundary scan feature. 


1.2 
The SL Enhanced Intel486 CPU 
Product Family 


age. An individual product will have either a IX clock 
or a 2X clock, but not both. Likewise, an individual 
product will have either a 5V supply voltage or a 3.3V 
supply voltage, but not both. Please contact Intel for 
the latest product availability and specifications. 


Table I-I shows the SL Enhanced Intel486 CPUs and 
existing Intel486 microprocessors available by Clock 
Mode, Supply Voltage, Maximum Frequency, and Pack- 


Existing 
SLEnhanced 
Vcc 
CPU 
168 
196 
208 
Intel486 
CPUs 
Intel486 
CPUs 
FREQUENCY 
PGA 
PQFP 
SQFP 


1XCLOCK 


Intel486 
SX CPU 
SlEnhanced 
3.3V 
25 
33 
X 
Intel486 SX CPU 
5V 
25 
33 
X 
X 


Intel486 
OX CPU 
SlEnhanced 
3.3V 
33 
X 
Intel486 
OX CPU 
5V 
33 
X 
X 


50 
X 


Intel486 
OX2 CPU 
Sl Enhanced 
3.3V 
40 
50 
X 
Intel486 
OX2 CPU 
5V 
50 
66 
X 


2XCLOCK 


low 
Power 
Sl 
Enhanced 
3.3V 
25 
33 
X 
Intel486 
SX CPU 
Intel486 
SX CPU 
5V 
25 
33 
X 


low 
Power 
SlEnhanced 
3.3V 
33 
X 
Intel486 
OX CPU 
Intel486 
OX CPU 
5V 
33 
X 


196 Lead PQFP - 
Plastic Quad Flat Pack 


- 
Package Diagram 
- 
Pin Assignment Difference Table 
- 
Pin Assignment Table in numerical order 
The following figures show the pin assignments of each 
package type for the SL Enhanced Inte1486 CPU prod- 
uct family. Tables are provided showing the pin differ- 
ences between the existing Intel486 CPU products and 
the SL Enhanced Intel486 CPU products. 


168 Lead PGA - 
Pin Grid Array 


- 
Package Diagram 
- 
Pin Assignment Difference Table 
- 
Pin Cross Reference by Pin Name 
NOTE: 


NC pins should ALWAYS remain unconnected. 
208 Lead SQFP - 
Quad Flat Pack 


- 
Package Diagram 
- 
Pin Assignment Difference Table 
- 
Pin Assignment Table in numerical order 


• 


intel~ 


Vcc 
llC 
V •• 
V •• 


SRESET 


Vee 
SKIACT' 
llC 
llC 
llC 
llC 
llC 
v •• 
SMIt 


Vee 
llC 
llC 
PERR' 


TOO 
llC 
llC 
IGNlIl!' 
llC 
STPCLJ(' 
DJl 
llC 
llC 
D). 


Vee 
D2' 
llC 


D28 


Figure 2-1. Package 
Diagram 
(196 Lead PQFP Package) 


Table 2-1. Pinout Differences 
for 196 Lead PQFP Package 


Intel486SX 
Low Power 
SL Enhanced 
Intel486 
OX 
Low Power 
SL Enhanced 
Pln# 
CPU 
Intel486 
SX CPU 
Intel486SX 
CPU 
Intel486 
OX 
Intel486 
OX 


CPU 
CPU 
CPU 


75 
NC 
NC 
STPCLK# 
NC 
NC 
STPCLK# 


77 
NC 
NC 
NC 
IGNNE# 
IGNNE# 
IGNNE# 


81 
NC 
NC 
NC 
FERR# 
FERR# 
FERR# 


85 
NC 
NC 
SMI# 
NC 
NC 
SMI# 


92 
NC 
NC 
SMIACT# 
NC 
NC 
SMIACT# 


94 
NC 
NC 
SRESET 
NC 
NC 
SRESET 


127 
NC 
CLKSEL 
NC 
NC 
CLKSEL 
NC 


Pin# 
Description 
Pln# 
Description 
Pln# 
Description 
Pln# 
Description 


1 
Vss 
38 
012 
75 
STPClK# 
112 
Vcc 
2 
A21 
39 
013 
76 
NC 
113 
BE3# 


3 
A22 
40 
Vss 
77 
IGNNE# 
114 
Vss 
4 
A23 
41 
014 
78 
NC 
115 
BE2# 


5 
A24 
42 
015 
79 
NC 
116 
BE1# 


6 
Vcc 
43 
OP2 
80 
TOO 
117 
BEO# 


7 
A25 
44 
016 
81 
FERR# 
118 
BREQ 


8 
A26 
45 
017 
82 
NC 
119 
Vcc 
9 
A27 
46 
018 
83 
NC 
120 
W/R# 


10 
A28 
47 
019 
84 
Vcc 
121 
Vss 
11 
Vss 
48 
020 
85 
SMI# 
122 
HloA 


12 
A29 
49 
Vcc 
86 
Vss 
123 
ClK 


13 
A30 
50 
Vss 
87 
NC 
124 
NC 


14 
A31 
51 
021 
88 
NC 
125 
Vcc 
15 
NC 
52 
NC 
89 
NC 
126 
Vss 
16 
OPO 
53 
022 
90 
NC 
127 
NC 


17 
DO 
54 
Vcc 
91 
NC 
128 
TCK 


18 
01 
55 
023 
92 
SMIACT# 
129 
AHOlo 


19 
Vcc 
56 
NC 
93 
Vcc 
130 
HOLD 


20 
02 
57 
OP3 
94 
SRESET 
131 
Vcc 
21 
Vss 
58 
Vss 
95 
Vss 
132 
KEN# 


22 
Vss 
59 
024 
96 
Vss 
133 
Roy# 


23 
03 
60 
NC 
97 
NC 
134 
NC 


24 
Vcc 
61 
025 
98 
Vcc 
135 
BS8# 


25 
04 
62 
Vcc 
99 
Vss 
136 
BS16# 


26 
05 
63 
026 
100 
NMI 
137 
BOFF# 


27 
06 
64 
NC 
101 
INTR 
138 
BRoy# 


28 
Vcc 
65 
027 
102 
FlUSH# 
139 
PCHK# 


29 
07 
66 
Vss 
103 
RESET 
140 
NC 


30 
OP1 
67 
028 
104 
A20M# 
141 
Vss 
31 
08 
68 
NC 
105 
EAoS# 
142 
lOCK# 


32 
09 
69 
029 
106 
PCO 
143 
PlOCK# 


33 
Vss 
70 
Vcc 
107 
Vcc 
144 
BLAST# 


34 
NC 
71 
030 
108 
PWT 
145 
AoS# 


35 
010 
72 
NC 
109 
Vss 
146 
A2 
36 
Vcc 
73 
NC 
110 
O/C# 
147 
Vcc 
37 
011 
74 
031 
111 
M/IO# 
148 
Vss 


•• 


Pln# 
Description 
Pln# 
Description 
Pln# 
Description 
Pin # 
Description 


149 
NC 
161 
A8 
173 
NC 
185 
TDI 


150 
A3 
162 
NC 
174 
A12 
186 
NC 


151 
NC 
163 
A9 
175 
Vcc 
187 
TMS 


152 
A4 
164 
Vcc 
176 
A13 
188 
NC 


153 
NC 
165 
A10 
177 
Vss 
189 
A18 


154 
A5 
166 
NC 
178 
A14 
190 
NC 


155 
NC 
167 
Vss 
179 
Vcc 
191 
A19 


156 
UP# 
168 
Vss 
180 
A15 
192 
NC 


157 
NC 
169 
NC 
181 
A16 
193 
A20 


158 
A6 
170 
Vcc 
182 
Vss 
194 
Vss 


159 
A7 
171 
NC 
183 
A17 
195 
NC 


160 
NC 
172 
A11 
184 
Vcc 
196 
Vcc 


/ D'.o 


1GB-Lead 
PGA 
PIN 
SIDE 
VIEW 


.02 
A15 
A12 
000 


INTR 
TOO 
USIT' 
asll 
VC'C 
ROY. 
VCC 
VCC 
BElt 
vcc 
vcc 
VCC 
M/IO' 
VCC 
Pt.OCK. 
BLAST. 
.u 
00000000000000000 
• 


Intel486SX 
SL Enhanced 
Intel486 
OX 
SLEnhanced 
Intel486 
OX2 
SLEnhanced 
Pin 
Intel486SX 
Intel486 
OX 
Intel4860X2 
CPU 
CPU 
CPU 
CPU 
CPU 
CPU 


A3 
NC 
NC 
NC(5) 
TCK 
tCK 
TCK 


A10(1) 
NC 
NC 
NC 
NC 
NC 
NC 


A13 
NC 
INC(2) 
NC 
INC(2) 
NC 
INC(2) 


A14 
NC 
NC 
NC(5) 
TOI 
TOI 
TOI 


A15 
NMI 
NMI 
IGNNE# 
IGNNE# 
IGNNE# 
IGNNE# 


810 
NC 
SMI# 
NC 
SMI# 
NC 
SMI# 


814 
NC 
NC 
NC(5) 
TMS 
TMS 
TMS 


815 
NC 
NC 
NMI 
NMI 
NMI 
NMI 


816 
NC 
NC 
NC(5) 
TOO 
TOO 
TOO 


C10 
NC 
SRESET 
NC 
SRESET 
NC 
SRESET 


C11 
NC 
UP#(3) 
NC 
UP#(3) 
UP# 
UP# 


C12 
NC 
SMIACT# 
NC 
SMIACT# 
NC 
SMIACT# 


C14 
NC 
NC 
FERR# 
FERR# 
FERR# 
FERR# 


G15 
NC 
STPCLK# 
NC 
STPCLK# 
NC 
STPCLK# 


J1 
VCC 
VCC<4) 
Vcc 
VCC<4) 
Vcc 
VCC<4) 


NOTES: 
1. Pin A10 is defined as NC. For compatibility with future CPUs, this pin should not be connected. 
2. The INC pin is defined to be an internal no-connect. This means that the pin is not internally connected and may be used 
for the routing of external signals. 
3. The SL Enhanced Intel486 SX and OX CPUs now offer Upgrade Power Down mode through the UP# pin. This is the new 
power down mechanism that should be used when designing in an OverDrive processor. 
4. This pin is a Vcc pin. For compatibility with future 3.3V CPUs that will have 5V safe input buffers, this pin should be 
connected to a Vcc trace, not to the Vcc plane. For a mixed voltage system, where the CPU inputs are to be driven by 5V 
logic, this pin should be connected to 5V to bias the input buffers (the 3.3V CPU will require all of the normal Vcc pins to be 
connected to 3.3V). For systems that have all 3.3V logic, this pin should be connected to 3.3V. 
5. For the Intel486 OX CPU 50 MHz version, pins A3, A14, 814 and 816 are boundary scan pins. 


intel~ 


Address 
Data 
Control 
NC 
Vcc 
Vss 


A2 
Q14 
DO 
P1 
A20M*" 
015 
A10(1) 
87 
A7 


A3 
R15 
01 
N2 
ADS*" 
S17 
A13(2) 
89 
A9 


A4 
S16 
02 
N1 
AHOlO 
A17 
812 
811 
A11 


A5 
Q12 
03 
H2 
8EO*" 
K15 
813 
C4 
83 


A6 
S15 
04 
M3 
8E1*" 
J16 
C13 
C5 
84 


A7 
Q13 
05 
J2 
8EU 
J15 
R17 
E2 
85 


A8 
R13 
06 
l2 
8E3*" 
F17 
S4 
E16 
E1 


A9 
Q11 
07 
l3 
8LAST*" 
R16 
G2 
E17 


A10 
S13 
08 
F2 
80FF*" 
017 
G16 
G1 


A11 
R12 
09 
01 
8ROY*" 
H15 
H16 
G17 


A12 
S7 
010 
E3 
8REQ 
Q15 
J1 
H1 


A13 
Q10 
011 
C1 
8S8*" 
016 
K2 
H17 


A14 
S5 
012 
G3 
8S16*" 
C17 
K16 
K1 


A15 
R7 
013 
02 
ClK 
C3 
L16 
K17 


A16 
Q9 
014 
K3 
O/C*" 
M15 
M2 
l1 


A17 
Q3 
015 
F3 
OPO 
N3 
M16 
l17 


A18 
R5 
016 
J3 
OP1 
F1 
P16 
M1 


A19 
Q4 
017 
03 
OP2 
H3 
R3 
M17 


A20 
Q8 
018 
C2 
OP3 
A5 
R6 
P17 


A21 
Q5 
019 
81 
EAOS*" 
817 
R8 
Q2 


A22 
Q7 
020 
A1 
FERR*" 
C14 
R9 
R4 


A23 
S3 
021 
82 
FLUSH*" 
C15 
R10 
S6 


A24 
Q6 
022 
A2 
HlOA 
P15 
S8 


•• 


Address 
Data 
Control 
MC 
Vcc 
Vss 


A25 
R2 
023 
A4 
HOLO 
E15 
R14 
59 


A26 
52 
024 
A6 
IGNNE# 
A15 
510 


A27 
51 
025 
86 
INTR 
A16 
511 


A28 
R1 
026 
C7 
KEN# 
F15 
512 


A29 
P2 
027 
C6 
LOCK# 
N15 
514 


A30 
P3 
028 
C8 
M/lO# 
N16 


A31 
01 
029 
A8 
NMI 
815 


030 
C9 
PCO 
J17 


031 
88 
PCHK# 
017 


PWT 
L15 


PLOCK# 
016 


ROY# 
F16 


RE5ET 
C16 


SMI# 
810 


5MIACT# 
C12 


UP# 
C11(3) 


W/R# 
N17 


5TPCLK# 
G15 


5RE5ET 
C10 


TCK 
A3(4) 


TOI 
A14(4) 


TOO 
816(4) 


TM5 
814(4) 


NOTES: 
1. Pin A10. See note 1 for Table 2-3. 
2. Pin A13. See note 2 for Table 2-3. 
3. Pin C11. See note 3 for Table 2-3. 
4. Boundary Scan pins are not included on the Intel486 SX CPU in PGA. 


Figure 2·3. Package 
Diagram for 208 Lead SQFP 


Table 2-5. Pinout Differences 
for 208 Lead SQFP Package 


SLEnhanced 
SLEnhanced 
SLEnhanced 
Pin 
Intel486 
Intel486 
Intel486 
II 
SXCPU 
DXCPU 
DX2CPU 


66 
NC 
IGNNEII 
IGNNEII 


72 
NC 
FERRII 
FERRII 


intel~ 


Pln# 
Description 
Pln# 
Description 
Pln# 
Description 
Pln# 
Description 


1 
Vss 
53 
Vss 
105 
Vss 
157 
Vss 
2 
Vcc 
54 
Vcc 
106 
Vcc 
158 
A24 
3 
Vcc(1) 
55 
Vss 
107 
Vss 
159 
A23 
4 
PCHK# 
56 
Vcc 
108 
016 
160 
A22 
5 
BROY# 
57 
Vss 
109 
OP2 
161 
A21 
6 
BOFF# 
58 
SRESET 
110 
Vss 
162 
Vcc 
7 
BS16# 
59 
SMIACT# 
111 
Vcc 
163 
Vcc 
8 
BS8# 
60 
Vcc 
112 
015 
164 
A20 
9 
Vcc 
61 
Vss 
113 
014. 
165 
A19 
10 
Vss 
62 
Vcc 
114 
Vcc 
166 
A18 
11 
NC 
63 
NC 
115 
Vss 
167 
TMS 
12 
ROY# 
64 
NC 
116 
013 
168 
TOI 
13 
KEN# 
65 
SMI# 
117 
012 
169 
Vcc 
14 
Vcc 
66 
FERR# 
118 
011 
170 
Vss 
15 
Vss 
67 
NC 
119 
010 
171 
A17 
16 
HOLD 
68 
TOO 
120 
Vss 
172 
Vcc 
17 
AHOLO 
69 
Vcc 
121 
Vcc 
173 
A16 
18 
TCK 
70 
NC 
122 
Vss 
174 
A15 
19 
Vcc 
71 
NC 
123 
09 
175 
Vss 
20 
Vcc 
72 
IGNNE# 
124 
08 
176 
Vcc 
21 
Vss 
73 
STPCLK# 
125 
OP1 
177 
A14 
22 
Vcc 
74 
031 
126 
07 
178 
A13 
23 
Vcc 
75 
030 
127 
NC 
179 
Vcc 
24 
CLK 
76 
Vss 
128 
Vcc 
180 
A12 
25 
Vcc 
77 
Vcc 
129 
06 
181 
Vss 
26 
HLOA 
78 
029 
130 
05 
182 
A11 
27 
W/R# 
79 
028 
131 
Vcc 
183 
Vcc 
28 
Vss 
80 
Vcc 
132 
Vss 
184 
Vss 
29 
Vcc 
81 
Vss 
133 
Vcc 
185 
Vcc 
30 
BREQ 
82 
Vcc 
134 
Vcc 
186 
A10 
31 
BEO# 
83 
027 
135 
Vss 
187 
A9 
32 
BE1# 
84 
026 
136 
Vcc 
188 
Vcc 
33 
BEU 
85 
025 
137 
Vcc 
189 
Vss 
34 
BE3# 
86 
Vcc 
138 
Vss 
190 
A8 
35 
Vcc 
87 
024 
139 
Vcc 
191 
Vcc 
36 
Vss 
88 
Vss 
140 
04 
192 
A7 
37 
M/IO# 
89 
Vcc 
141 
03 
193 
A6 
38 
Vcc 
90 
OP3 
142 
02 
194 
UP# 
39 
O/C# 
91 
023 
143 
01 
195 
A5 
40 
PWT 
92 
022 
144 
DO 
196 
A4 
41 
PCO 
93 
021 
145 
OPO 
197 
A3 
42 
Vcc 
94 
Vss 
146 
Vss 
198 
Vcc 
43 
Vss 
95 
Vcc 
147 
A31 
199 
Vss 
44 
Vcc 
96 
NC 
148 
A30 
200 
Vcc 
45 
Vcc 
97 
Vss 
149 
A29 
201 
Vss 
46 
EAOS# 
98 
Vcc 
150 
Vcc 
202 
A2 
47 
A20M# 
99 
020 
151 
A28 
203 
AOS# 
48 
RESET 
100 
019 
152 
A27 
204 
BLAST # 
49 
FLUSH# 
101 
018 
153 
A26 
205 
Vcc 
50 
INTR 
102 
Vcc 
154 
A25 
206 
PLOCK# 
51 
NMI 
103 
017 
155 
Vcc 
207 
LOCK# 
52 
Vss 
104 
Vss 
156 
Vss 
208 
Vss 


NOTE: 
1. This 
pin is a Vcc 
pin. For compatibility 
with 
future 
3.3V CPUs 
that 
will 
have 
5V safe 
input 
buffers, 
this 
pin should 
be 
connected 
to a Vcc 
trace, 
not to the Vcc 
plane. 
For a mixed voltage 
system, 
where 
the CPU inputs 
are to be driven 
by 5V 
logic, this pin should 
be connected 
to 5V to bias the input buffers 
(the 3.3V CPU will require 
all of the normal 
Vcc 
pins to be 
connected 
to 3.3V). 
For systems 
that have all 3.3V logic, this pin should 
be connected 
to 3.3V. 
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Symbol 
Type 
Name and Function 
, 


ClK 
I 
CLocK provides 
the fundamental 
timing and the internal operating 
frequency 
for 
the CPU. All external timing parameters 
are specified 
with respect 
to the rising edge 
ofClK. 


For the 1X clock mode the ClK 
frequency 
is the same as the frequency 
of the CPU. 
For the DX2 clock doubled 
products 
the ClK 
frequency 
is half the frequency 
of the 
CPU. 


RESET 
I 
The RESET input forces the CPU to begin execution 
at a known state. Reset is 
asynchronous, 
but must meet setup and hold times t20 and t21 for recognition 
in 
any specific 
clock. The CPU cnnot begin execution 
of instructions 
until at least 1 ms 
after Vcc and ClK 
have reached their proper AC and DC specifications. 
However, 
for soft resets, RESET should remain active for at least 15 ClK 
periods. The 
RESET pin should remain active during this time to ensure proper CPU operation. 
RESET is active HIGH. 
RESET sets the 5MBASE 
descriptor 
to a default address of 30000H. 
If the system 
uses 5MBASE 
relocation, 
then the SRESET pin should be used for soft resets. 


SRESET 
I 
The SRESET pin duplicates 
all the functionality 
of the RESET pin with the following 
two exceptions: 
1. The 5MBASE 
register will retain its previous value. 
2. If UP# 
(I) is asserted, 
SRESET will not have an effect on the host 
microprocessor. 
For soft resets, SRESET should remain active for at least 15 ClK 
periods. SRESET 
is active HIGH. SRESET is asynchronous 
but must meet setup and hold times t20 
and t21 for recognition 
in any specific clock. 


SMI# 
I 
The System 
Management 
Interrupt 
input is used to invoke the System 
Management 
Mode (SMM). SMI # is a falling edge triggered 
signal which forces the 
CPU into SM M at the completion 
of the current instruction. 
SM I# is recognized 
on 
an instruction 
boundary 
and at each iteration for repeat string instructions. 
SMI # 
does not break lOCKed 
bus cycles and cannot 
interrupt 
a currently 
executing 
SMM. The CPU will latch the falling edge of one pending SMI # signal while the 
CPU is executing 
an existing SMI. The nested SMI will not be recognized 
until after 
the execution 
of a Resume 
(RSM) instruction. 


SMIACT# 
0 
The System 
Management 
Interrupt 
ACTive 
is an active low output, indicating 
that 
the processor 
is operating 
in SMM. It is asserted 
when the CPU begins to execute 
the SMI state save sequence 
and will remain active lOW 
until the processor 


executes 
the last state restore cycle out of SMRAM. 


STPClK# 
I 
The SToP CLocK request 
input signal indicates 
a request has been made to turn 
off the ClK 
input. When the CPU recognizes 
a STPClK 
#, the processor 
will stop 
execution 
on the next instruction 
boundary, 
unless superseded 
by a higher priority 
interrupt, 
empty all internal pipelines 
and the write buffers and generate 
a Stop 
Grant acknowledge 
bus cycle. STPClK# 
is active lOW 
and is provided 
with an 
internal 
pull-up resistor. STPClK# 
is an asynchronous 
signal, but must remain 
active until the processor 
issues the Stop Grant bus cycle. STPClK 
# may be de- 
asserted 
at any time after the processor 
has issued the Stop Grant bus cycle. Note 
that STPClK 
# should NOT be de-asserted 
before the CPU has issued the Stop 


. 
Grant bus cycle . 


UP# 
I 
The Upgrade 
Present 
input detects 
the presence 
of the upgrade processor, 
then 
powers down the core, and three-states 
all outputs of the original CPU, so that the 
original CPU consumes 
very low zero current. 
UP# 
is active lOW 
and sampled 
at 
all times, including 
after power-up 
and during reset. 


• 


Name 
Active 
When Floated 
Level 


BREQ 
HIGH 


HLOA 
HIGH 


BE3#-BEO# 
LOW 
Bus Hold 


PWT, PCO 
HIGH 
Bus Hold 


W/R#, 
N/A 
Bus Hold 


M/IO#,O/C# 


LOCK# 
LOW 
Bus Hold 


PLOCK# 
LOW 
Bus Hold 


AOS# 
LOW 
Bus Hold 


BLAST # 
LOW 
Bus Hold 


PCHK# 
LOW 


FERR#' 
LOW 


A3-A2 
N/A 
Bus, Address 
Hold 


SMIACT# 
LOW 


Name 
Active 
Level 
When Floated 


031-00 
N/A 
Bus Hold 


OP3-0PO 
HIGH 
Bus Hold 


A31-A4 
N/A 
Bus, Address 
Hold 


NOTES: 
All input/output 
signals 
are floated 
when 
UP# 
is asserted. 


• Present 
on the Intel486 
DX and DX2 CPUs only. 


Name 
Input or Output 
sampled/Driven 
On 


TCK 
Input 
N/A 
TOI 
Input 
Rising Edge of TCK 


TOO 
Output 
Falling Edge of TCK 


TMS 
Input 
Rising Edge of TCK 


Name 
Active 
Level 
Synchronous/ 
Internal 
Pull-up/ 


Asynchronous 
Pull-Down 


CLK, CLK2 
RESET 
HIGH 
Asynchronous 


SRESET 
HIGH 
Asynchronous 
Pull-down 
HOLD 
HIGH 
Synchronous 


AHOLO 
HIGH 
Synchronous 
Pull-down 
EAOS# 
LOW 
Synchronous 
Pull-up 


BOFF# 
LOW 
Synchronous 
Pull-up 


FLUSH # 
LOW 
Asynchronous 
Pull-up 
A20M# 
LOW 
Asynchronous 
Pull-up 


BS16#, 
BS8# 
LOW 
Synchronous 
Pull-up 


KEN# 
LOW 
Synchronous 
Pull-up 


ROY# 
LOW 
Synchronous 


BROY# 
LOW 
Synchronous 
Pull-up 


INTR 
HIGH 
Asynchronous 


NMI 
HIGH 
Asynchronous 


IGNNE#' 
LOW 
Asynchronous 
Pull-up 
SMI# 
LOW 
Asynchronous 
Pull-up 


STPCLK# 
LOW 
Asynchronous 
Pull-up 


UP# 
LOW 
Pull-up 


TCK 
HIGH 
Pull-up 


TOI 
HIGH 
Pull-up 


TMS 
HIGH 
Pull-up 
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The standard Inte1486 SX and Intel486 OX CPUs are 
driven by a IX clock as opposed to the Intel386 CPUs 
which use a 2X clock input. The SL Enhanced Inte1486 
CPUs are available with both the IX and 2X clocking 
options. The IX CLK input frequency is the same as 
the internal frequency of the CPU. The IX clock allows 
simpler system design by cutting in half the clock speed 
required in the external system. The CLK2 input fre- 
quency for 2X CLK CPUs is twice the frequency of the 
CPU. Clock control for CPUs with the 2X clock option 
is described in section 7.3. The remainder of Section 3 
discusses clock control for IX clock CPUs. 


The IX clock relies on an internal Phase Lock Loop 
(PLL) to generate the two internal clock phases, "phase 
one" and "phase two". The rising edge of CLK corre- 
sponds to the start of phase one (Phi). All external 
timing parameters are specified with respect to the ris- 
ing edge of CLK. The PLL requires a constant frequen- 
cy CLK input (to within 0.1%), and therefore the CLK 
input cannot be changed dynamically. 


On processors which use an internal clock doubling 
(OX2), CLK provides the fundamental timing refer- 
ence for the bus interface unit. The CLK input is dou- 
bled internally so that the CPU core will operate at 
twice the CLK input frequency, hence twice the bus 
frequency. The internal clock doubler enhances all op- 
erations operating out of the internal cache and/or not 
blocked by external bus accesses. This mode also uses a 
Phase Lock Loop (PLL) and therefore the CLK input 
must be maintained at a constant frequency. 


The SL Enhanced Intel486 CPU provides an interrupt 
mechanism, STPCLK #, that allows system hardware 
to control the power consumption of the CPU by stop- 
ping the internal clock (output of the PLL) to the CPU 
core in a controlled manner. This low-power state is 
called the Stop Grant state. In addition, the STPCLK # 
interrupt allows the system to change the input fre- 
quency within the specified range or completely stop 
the CLK input frequency (input to the PLL). If the 
CLK input is completely stopped, the CPU enters into 
the Stop Clock state-the 
lowest power state. 


STPCLK # is active LOW and is provided with an in- 
ternal pull-up resistor. STPCLK # is an asynchronous 
signal, but must remain active until the processor issues 
the Stop Grant but cycle. STPCLK # may be de-assert- 
ed at any time after the processor has issued the Stop 
Grant bus cycle. Note that STPCLK # should NOT be 
de-asserted before the CPU has issued the Stop Grant 
bus cycle. When the CPU enters the Stop Grant state, 
the internal pull-up resistor is disabled so that the CPU 
power consumption is reduced. The STPCLK # input 
must be driven high (not floated) in order to exit the 
Stop Grant state. STPCLK # must be deasserted for a 
minimum of 5 clocks after ROY # or BROY # is re- 
turned active for the Stop Grant bus cycle before being 
asserted again (see Figure 3-1). 


There are two targets for the low-power mode supply 
current: 


- 20 mA-55 
mA in the Stop Grant state (fast wake- 
up, frequency- and voltage-dependent). 
and 
- 100 /LA-2oo /LA in the full Stop Clock state (slow 
wake-up, voltage-dependent). 


These are explained in further detail in Sections 3.5.2 
and 3.5.3. 


When the CPU recognizes a STPCLK # interrupt, the 
processor will stop execution on the next instruction 
boundary (unless superseded by a higher priority inter- 
rupt), stop the pre-fetch unit, empty all internal pipe- 
lines and the write buffers, generate a Stop Grant bus 
cycle, and then stop the internal clock. At this point the 
CPU is in the Stop Grant state. 


The CPU cannot respond to a STPCLK # request from 
an HLOA state because it cannot empty the write buff- 
ers and, therefore, cannot generate a Stop Grant cycle. 


The rising edge of STPCLK # will tell the CPU that it 
can return to program execution at the instruction fol- 
lowing the interrupted instruction. 


Unlike the normal interrupts, 
INTR 
and NMI, the 
STPCLK # 
interrupt 
does not initiate interrupt 
ac- 
knowledge cycles or interrupt table reads. Among ex- 
ternal 
interrupts, 
STPCLK # 's order of priority 
is 
shown as follows: 


1) RESET/SRESET 
2) FLUSH# 
3) SMI# 
4) NMI 
5) INTR 
6) STPCLK# 


STPCLK # will be recognized while in an interrupt 
service routine or an SMM handler. 


• 


A special Stop Grant bus cycle will be driven to the bus 
after the CPU recognizes the STPCLK # 
interrupt. 


The definition of this bus cycle is the same as the 
HALT cycle defmition for the standard Intel486 micro- 
processor, with the exception that the Stop Grant bus 
cycle drives the valueססoo OOIOHon the address pins. 
The system hardware must acknowledge this cycle by 
returning RDY # or BRDY #. The CPU will not enter 
the Stop Grant state until either RDY # or BRDY # 
has been returned. 


M/IO# 
= 0, D/C# 
= 0, W!R# = I, Address Bus 


=ססoo OOIOH(A4 = 
I), BE3#-BEO# 
= 
lOll, 


Data bus = undefined 


The latency between a STPCLK # request and the Stop 
Grant bus cycle is dependent on the current instruc- 
tion, the amount of data in the CPU write buffers, and 
the system memory performance. 


During the Stop Grant state, most output and input/ 
output signals of the microprocessor will maintain their 
previous condition (the level they held when entering 
the Stop Grant state). The data and data parity signals 


will be three-stated. In response to HOLD being driven 
active during the Stop Grant state (when the CLK in- 
put is running), the CPU will generate HLDA 
and 


three-state all output and input/output 
signals that are 


three-stated 
during 
the HOLD/HLDA 
state. After 


HOLD is de-asserted all signals will return to their pri- 
or state before the HOLD/HLDA 
sequence. 


Signal 
Type 
State 


A3-A2 
0 
Previous State 


A31-A4 
1/0 
Previous State 


D31-DO 
1/0 
Floated 


BE3#-BEO# 
0 
Previous State 


DP3-DPO 
1/0 
Floated 


W/R#, 
D/C#, 
M/IO# 
0 
Previous State 


ADS# 
0 
Inactive 


LOCK#, PLOCK# 
0 
Inactive 


BREQ 
·0 
Previous State 


HLDA 
0 
As per HOLD 


BLAST# 
0 
Previous State 


FERR# 
0 
Previous State 


PWT/PCD 
\ 
0 
Previous State 


PCHK# 
0 
Previous State 


SMIACT# 
0 
Previous State 


In order 
to achieve 
the lowest possible 
power consump- 


tion 
during 
the Stop 
Grant 
state, 
the system 
designer 


must ensure 
the input 
signals with pull-up 
resistors 
are 


not driven 
LOW 
and the input 
signals 
with pull-down 


resistors 
are not driven 
HIGH. 
(See Table 
2-10 in the 


Quick 
Pin 
Reference 
section 
for signals 
with 
internal 


pull-up 
and pull-down 
resistors). 


All inputs, 
except 
data 
bus pins must be driven 
to the 


power supply 
rails to ensure 
the lowest possible 
current 


consumption 
during 
Stop Grant 
or Stop Clock 
modes. 


For 
compatibility 
with 
future 
processors, 
data 
pins 


must be driven 
low to achieve the lowest possible power 


consumption. 
' 


The following 
state descriptions 
and diagram 
show the 


state transitions 
during 
a Stop Clock cycle for IX clock 


mode 
(including 
the Intel486 
DX2 
CPU 
clock mode). 
Refer to Figure 
3-2 for the Stop Clock state diagram. 


3.5.2 
STOP GRANT 
STATE 


The 
Stop 
Grant 
state 
(- 
20-55 mA) 
provides 
a fast 


wake-up 
state 
that 
can be entered 
by simply 
asserting 


the external 
STPCLK 
II interrupt 
pin. 
Once 
the Stop 


Grant 
bus cycle has been placed 
on the bus, and either 


ROY II or BROY II is returned, 
the 
CPU 
is in this 


- 20-55 mA state 
(depending 
on the CLK 
input 
fre- 
quency). 
The 
CPU 
returns 
to the 
normal 
execution 


state 
10-20 
clock 
periods 
after 
STPCLKII 
has been 


de-asserted. 


While 
in the Stop Grant 
state, 
the pull-up 
resistors 
on 
2 


STPCLKII 
and UPII 
are disabled 
internally. 
The sys- 


tem 
must 
continue 
to drive 
these 
inputs 
to the 
state 


4 Halt Auto Powerdown 
State 
HALT bus cycle 
generated 
1 Normal State 
~ 


CLKRunning 
- 
Normal Execution 
.. 
INTR.NMI. 
SMI#.RESEf 
STPCLK# asserted & 
STPCLK# 
SRESEf 
Stop Grant bus cycle 
de-asserted 
generated 
orRESEf. 
STPCLK# de-asserted 
, 


SRESET 
EADS# 
STPCLK# asserted & Stop Grant 
bus cycle generated 


1 
, 


5 Stop Clock Snoop State 
~ 
EADS# 
2 Stop Grant State 
~ 


One clock Powerup 
- 
CLKRunning 


Perform cache invalidation 
Icc - 20-55 mA 


Stop CLKl 
fStart CLK 
+ PLL Startup Latency 


3 Stop Clock State 


Internal Powerdown 
CLK Stopped" 
Icc - 100-200 uA 


241696-5 


'The 
system 
can change 
the input frequency 
within the specified 
range or completely 
stop the elK input frequency 
(input 


to Pll). 


they were in immediately before the CPU entered the 
Stop Grant state. For minimum CPU power consump- 
tion, all other input pins should be driven to their inac- 
tive level while the CPU is in the Stop Grant state. 


A RESET or SRESET will bring the CPU from the 
Stop Grant state to the Normal state. The CPU will 
recognize the inputs required for cache invalidation's 
(HOLD, AHOLD, BOFF# 
and EADS#) 
as explained 
later in this section. The CPU will not recognize any 
other inputs while in the Stop Grant state. Input signals 
to the CPU will not be recognized until I CLK after 
STPCLK # is de-asserted (see Figure 3-3). 


While in the Stop Grant state, the CPU will not recog- 
nize transitions on the interrupt signals (SMI #, NMI, 
and INTR). Driving an active edge on either SMI # or 
NMI will not guarantee recognition and service of the 
interrupt request following exit from the Stop Grant 
state. However, if one of the interrupt signals (SMI #, 
NMI, or INTR) is driven active while the CPU is in the 
Stop Grant state, and held active for at least one CLK 
after STPCLK # is de-asserted, the corresponding in- 
terrupt 
will be serviced. The SL Enhanced Intel486 
CPU product family requires INTR to be held active 
until the CPU issues an interrupt acknowledge cycle in 
order to guarantee recognition. This condition also ap- 
plies to the existing Intel486 CPUs (see Figure 3-3). 


When the CPU is in the Stop Grant state, the system is 
allowed to stop or change the CLK input. If the CPU 
clock input frequency is changed, the CPU will not 
return to the Stop Grant state until the CLK input has 
been running at a constant frequency for the time peri- 
od necessary for the PLL to stabilize. This constant 
frequency must be within the specified operating fre- 
quency range of the CPU. 


When the CLK 
input to the CPU is stopped (or 
changed), 
current 
members 
of the 
SL 
Enhanced 
Intel486 CPU family require the CLK input to be held 
at a constant frequency for a minimum of I ms before 


de-asserting STPCLK #. This I ms time period is nec- 
essary so that the PLL can stabilize, and it must be met 
before the CPU will return to the Stop Grant state. 
Future versions of the SL Enhanced Intel486 CPU fam- 
ily may require substantially less than I ms for the PLL 
to stabilize. In order to take advantage of a shorter PLL 
stabilization period, a system design should provide for 
a programmable 
time period between restarting 
the 
CPU clock and de-asserting STPCLK # . 


The CPU will generate a Stop Grant bus cycle only 
when entering that state from the Normal or the Auto 
HALT Power Down state. When the CPU enters the 
Stop Grant state from the Stop Clock state or the Stop 
Clock Snoop state, the CPU will not generate a Stop 
Grant bus cycle. 


Stop Clock state (~100-200 
/LA) is entered from the 
Stop Grant state by stopping the CLK input (either 
logic high or logic low). None of the CPU input signals 
should change state while the CLK input is stopped. 
Any transition on an input signal (with the exception of 
INTR, NMI and SMI) before the CPU has returned to 
the Stop Grant state will result in unpredictable behav- 
ior. If INTR is driven active while the CLK input is 
stopped, and held active until the CPU issues an inter- 
rupt acknowledge bus cycle, it will be serviced in the 
normal manner. The system design must ensure the 
CPU is in the correct state prior to asserting cache in- 
validation or interrupt signals to the CPU. 


The CPU will return to the - 20-55 mA Stop Grant 
state after the CLK input has been running at a con- 
stant frequency for a period of time equal to the PLL 
startup latency (see section 3.5.2). The CLK input can 
be restarted to any frequency between the minimum 
and maximum frequency listed in the A.C. timing spec- 
ifications. 
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The execution of a HALT instruction will also cause 
the CPU to automatically enter a ~ 20-55 mA state, 
called the Auto HALT Power Down state. The CPU 
will issue a normal HALT bus cycle when entering this 
state. The CPU will transition to the Normal state on 
the 
occurrence 
of 
INTR, 
NMI, 
SMI # , 
RESET, or SRESET. 


The system can generate a STPCLK # while the CPU 
is in the Auto HALT Power Down state. The CPU will 
generate a Stop Grant bus cycle when it enters the Stop 
Grant State from the HALT state. When the system 
de-asserts the STPCLK # interrupt, the CPU will re- 
turn execution to the HALT state. The CPU will gener- 
ate a new HALT bus cycle when it re-enters the HALT 
state from the Stop Grant state. 


3.5.5 
STOP CLOCK 
SNOOP 
STATE 
(CACHE 
INVALIDATIONS) 


When the CPU is in the Stop Grant state or the Auto 
HALT 
Power Down state, the CPU will recognize 
HOLD, AHOLD, BOFF# 
and EADS# 
for cache in- 
validation. When the system asserts HOLD, AHOLD, 
or BOFF #, the CPU will float the bus accordingly. 
When the system then asserts EADS #, the CPU will 
transparently enter the Stop Clock Snoop state and will 
power up for 1 full core clock in order to perform the 


required cache snoop cycle. It will then re-freeze the 
clock to the CPU core and return to the previous state. 
The CPU does not generate a bus cycle when it returns 
to the previous state. 


A FLUSH # event during the Stop Grant state or the 
Auto HALT Power Down state will be latched and 
acted upon by asserting the internal FLUSH # signal 
for one clock upon re-entering the Normal state. 


When the chip is known to be truly idle and waiting for 
a ready from a memory or I/O bus cycle read, the 
Inte1486 DX2 CPU will reduce its core clock rate to 
half of its original frequency without affecting perform- 
ance. When any ready is asserted, the part will return 
2 


to clocking the core at the doubled DX2 clock. This 
functionality is transparent 
to software and external 
hardware. This feature applies only to those members 
of the SL Enhanced Intel486 CPU family that have a 
doubled core clock rate. Auto Idle Power Down pro- 
vides an average power savings of 10%. 


3.6 
Supply Current Model for Stop 
Clock Modes and Transitions 


The following diagram illustrates the effect of different 
Stop Clock state transitions on the supply current. 


Ice 
A 
STPCLKlasserted. 
CLK not changed. 
- 
Normal State 


A' 
STPCLKI de•••• rted. CLK not changed 
• 
Stop Grant State 
~ort 
(IOCLK) latency 
o Stop Clock State 


B 
CLK changed. requires PLL startup 
25/f'iJ 
latency to re-enter Stop Grant state 
- 
~------. 
C 
Stop CLK (from Stop Grant to Stop Clock) 


20/40 
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4.0 
INTEL'S 
SYSTEM 
MANAGEMENT 
MODE ARCHITECTURE 


The SL Enhanced Intel486 microprocessor supports 
four modes: Real, Virtual-86, Protected and System 
Management Mode (SMM). As an operating mode, 
SMM has its distinct processor environment, interface 
and hardware/software features. 


SMM provides the system designer with a means of 
adding new software controlled features to their com- 
puter products which always operate transparent to the 
Operating System (OS) and software applications. 
SMM is intended for use only by system firmware, not 
by applications software or general purpose systems 
software. 


The SMM architectural extension consists of the fol- 
lowing elements: 
1. A System Management Interrupt (SMI#) hardware 
interface. 


2. A dedicated and secure memory space (SMRAM) 
for SMI handler code and CPU state (context) data 
with a status signal for the system to decode access 
to that memory space, SMIACT#. 
3. Resume (RSM) instruction, for exiting the System 
Management Mode. 
4. Special Features such as I/O Restart, for transpar- 


ent power management of I/O 
peripherals, and 
Auto HALT Restart. 
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The followingterms are used throughout the discussion 
of System Management Mode. 
SMM: 
System Management Mode. This is 
the operating environment that the 
processor (system) enters when the 
System Management Interrupt is be- 
ing serviced. 
SMI: 
System Management Interrupt. This 
is part of the SMM interface. When 
SMI# is asserted (SMI# pin asserted 
low) it causes the processor to invoke 
SMM. The SMI# 
pin is the only 
means of entering SMM. 


SMM handler: 
System Management Mode handler. 
This is the code that will be executed 
when the processor is in SMM. An 
example application that this code 
might implement is a power manage- 
ment control or a system control 
function. 
RSM: 
Resume instruction. This instruction 
is used by the SMM handler to exit 
the SMM and return to the interrupt- 
ed OS or Application process. 


SMRAM: 
This is the physical memory dedicat- 
ed to SMM. The SMM handler code 
and related data reside in this memo- 
ry. This memory is also used by the 
processor to store its context before 
executing the SMM handler. The op- 
erating system and applications do 
not have access to this memory space. 


Context: 
This term refers to the processor 
state. The SMM discussion refers to 
the context, or processor state, just 
before the processor invokes SMM. 
The context normally consists of the 
CPU registers that fully represent the 
processor state. 
Context Switch: A context switch is the process of ei- 
ther saving or restoring the context. 
The SMM discussion refers to the 
context switch as the process of sav- 
ing/restoring the context while invok- 
ing/exiting SMM, respectively. 


4.3 
System Management 
Interrupt 
Processing 


The system interrupts the normal program execution 
and invokes SMM by generating a System Management 
Interrupt (SMI#) 
to the CPU. The CPU will service 
the SMI # by executing the following sequence. 
1. The CPU asserts the SMIACT# 
signal, indicating 
to the system that it should enable the SMRAM. 
2. The CPU saves its state (context) to SMRAM, start- 
ing at address location 3FFFFH, proceeding down- 
ward in a stack-like fashion. 
3. The 
CPU 
switches to the System Management 
Mode processor environment (a pseudo-real mode). 
4. The CPU will then jump to the absolute address of 
38000H in SMRAM to execute the SMI handler. 
This SMI handler performs the system management 
activities. 
5. The SMI handler will then execute the RSM in- 
struction which restores the CPU's context from 
SMRAM, 
de-asserts the SMIACT# 
signal, and 
then returns control to the previously interrupted 
program execution. 


The System Management Interrupt hardware interface 
consists of the SMI # interrupt request input and the 
SMIACT # output used by the system to decode the 
SMRAM. 


4.3.1 SYSTEM MANAGEMENT INTERRUPT 
(SMI#) 


8MI # is a falling-edge triggered, non-maskable inter- 
rupt request signal. SMI # is an asynchronous signal, 
but setup and hold times, t20 and t21, must be met in 
order to guarantee recognition in a specific clock. The 
SMI # input need not remain active until the interrupt 
is actually serviced. The SMI # input only:needs to re- 
main active for a single clock if the required setup and 
hold times are met. SMI # will also work correctly if it 
is held active for an arbitrary number of clocks. 


The SMI # input must be held inactive for at least four 
clocks after it is asserted to reset the edge triggered 
logic. 
• 


CPU 
SMIACf#, 


'/ 
SMI# 
/ 
} 
SMllnterface 
I' 


241696-9 


SMI#, like NMI, is not affected by the IF bit in the 
EFLAGS register and is recognized on an instruction 
boundary. An SMI# will not break locked bus cycles. 
The SMI# has a higher priority than NMI and is not 
masked during an NMI. 


After the SMI# interrupt is recognized, the SMI# sig- 
nal will be masked internally until the RSM instruction 
is executed and the interrupt service routine is com- 
plete. Masking the SMI# prevents recursive SMI# 
calls. If another SMI# occurs while the SMI# 
is 
masked, the pending SMI# will be recognized and exe- 
cuted on the next instruction boundary after the cur- 
rent SMI# completes. This instruction boundary oc- 
curs before execution of the next instruction in the in- 
terrupted application code, resulting in back to back 
SMM handlers. Only one SMI# can be pending while 
SMI# is masked. 


The SMI# signal is synchronized internally and must 
be asserted at least three (3) eLK periods prior to as- 
serting the RDY # signal in order to guarantee recogni- 
tion on a specific instruction boundary. This is impor- 
tant for servicing an I/O trap with an SMI# handler. 
In order for SMI# to be recognized with respect to 
SRESET, SMI# should not be asserted until two (2) 
clocks after SRESET becomes inactive. 
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SMIACT if indicates that the CPU is operating in Sys- 
tem Management Mode. The CPU asserts SMIACT if 
in response to an SMI interrupt request on the SMI if 
pin. SMIACT if 
is driven active after the CPU has 


completed all pending write cycles (including emptying 
the 
write buffers), 
and 
before the 
first access to 


SMRAM when the CPU saves (writes) its state (or con- 
text) to SMRAM. SMIACTiF remains active until the 
last access to SMRAM when the CPU restores (reads) 
its state from SMRAM. The SMIACTiF signal does 
not float in response to HOLD. The SMIACT if signal 
is used by the system logic to decode SMRAM. 


NOTE: 


The number of CLKs required to complete the SMM 
state save and restore is very dependent on system 
memory performance. The values shown in Figure 4-4 


assume 0 wait-state memory writes (2 CLK cycles), 2- 
1-1-1 burst read cycles, and 0 wait-state non-burst 
reads (2 CLK cycles). Additionally, it is assumed that 
the data read during the SMM state restore sequence 
is not cacheable. 


As shown in Figure 4-4, the minimum time required to 
enter an SMI handler routine for the Intel486 OX CPU 
(from the completion of the interrupted instruction) is 
given by: 


Latency to beginning of SMI handler = A + B + C = 
306 CLKs 


and the minimum time required to return to the inter- 
rupted application (following the final SMM instruc- 
tion before RSM) is given by: 


Latency to continue interrupted application 
+G = 327 CLKs 


Gp' 
, 
, 
, 
, 
, 
, 
, 


A: Last ROY fI from non-SMM 
transfer 
to SMIACT fI assertion 
B: SMIACTfi 
assertion 
to first AOSfi 
for SMM state save 
C: SMM state save (dependent 
on memory 
performance) 
0: SMM handler 
E: SMM state restore 
(dependent 
on memory 
performance) 


F: last 
ROY fI from SMM transfer 
to de-assertion 
of SMIACT fI 
G: SMIACT fI de-assertion 
to first non-SMM 
AOSfi 


Intel486 
OX CPU 
2 CLK minimum 
40 CLK minimum 
Approximately 
264 ClKs 
User determined 
Approximately 
303 CLKs 
4 CLK minimum 
20 CLK minimum 


Intel486 
OX2 CPU 
2 CLK minimum 
20 CLK minimum 
Approximately 
139 CLKs 


User determined 
Approximately 
236 CLKs 


2 CLK minimum 
20 CLK minimum 


The CPU uses the SMRAM space for state save and 
state restore operations during an SMI. The SMI han- 
dIer, which also resides in SMRAM, uses the SMRAM 
space to store code, data and stacks. In addition, the 
SMI handler can use the SMRAM for system manage- 
ment information such as the system configuration, 
configuration of a powered-down device, and system 
designer-specific information. 


The CPU asserts the SMIACT# 
output to indicate to 
the memory controller that it is operating in System 
Management 
Mode. The system logic should ensure 
that only the CPU has access to this area. Alternate bus 
masters or DMA devices trying to access the SMRAM 
space when SMIACT # is active should be directed to 
system RAM in the respective area. 


The system logic is minimally required to decode the 
physical 
memory 
address 
range 
from 
38000H- 
3FFFFH as SMRAM area. The CPU will save its state 
to the state save area from 3FFFFH 
downward to 
3FEOOH. After saving its state the CPU will jump to 
the address location 38000H to begin executing the 
SMI handler. The system logic can choose to decode a 
larger area of SMRAM as needed. The size of this 
SMRAM can be between 32 KBytes and 4 Gbytes. 


The system logic should provide a manual method for 
switching the SMRAM 
into system memory space 
when the CPU is not in SMM. This will enable initiali- 
zation of the SMRAM space (i.e., loading SMI handler) 
before executing the SMI handler during SMM. See 
Figure 4-5. 


CPU accessesto 
systemaddress 
spaceusedfor 
loadingSMRAM 
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When 
the SMI # 
is recognized 
on an instruction 
boundary, the CPU core first sets the SMIACT# 
sig- 
nal LOW indicating to the system logic that accesses 
are now being made to the system-defined SMRAM 
areas. The CPU then writes its state to the state save 
area in the SMRAM. The state save area starts at CS 
Base + [8000H + 7FFFH1. The default CS Base is 
30000H, therefore the default state save area is at 
3FFFFH. In this case, the CS Base can also be referred 
to as the 5MBASE. 


If the 5MBASE Relocation feature is enabled, then the 
SMRAM addresses can change. The following formula 
is used to determine the relocated addresses where the 
context is saved. The context will reside at CS Base + 
[8000H + Register Offset], where the default initial CS 
Base is 30000H and the Register Offset is listed in the 
SMRAM state save map (Table 4-1). Reserved spaces 
will be used to accommodate new registers in future 
CPUs. The state save area starts at 7FFFH and contin- 
ues downward in a stack-like fashion. 


Some of the registers in the SMRAM state save area 
may be read and changed by the SMI handler, with the 
changed values restored to the processor registers by 
the RSM instruction. Some register images are read- 
only, and must not be modified (modifying these regis- 
ters will result in unpredictabl,f; behavior). The values 
stored in the areas marked reserved may change in fu- 
ture CPUs. An SMM handler should not rely on any 
values stored in an area that is marked as reserved. 


Systemmemory 
accessesredirected 
toSMRAM 


Systemmemory 
accessesnot 
redirected 
toSMRAM 


Normal 
memory 
space 


Register 
Offset 
Register 
Writeable? 


7FFC 
CRO 
NO 


7FF8 
CR3 
NO 


7FF4 
EFLAGS 
YES 


7FFO 
EIP 
YES 


7FEC 
EOI 
YES 


7EE8 
ESI 
YES 


7FE4 
EBP 
YES 


7FEO 
ESP 
YES 


7FOC 
EBX 
YES 


7F08 
EOX 
YES 


7F04 
ECX 
YES 


7FOO 
EAX 
YES 


7FCC 
OR6 
NO 


7FC8 
OR7 
NO 


7FC4 
TR' 
NO 


7FCO 
LOTR' 
NO 


7FBC 
GS' 
NO 


7FB8 
FS' 
NO 


7FB4 
OS' 
NO 


7FBO 
SS' 
NO 


7FAC 
CS' 
NO 


7FA8 
ES' 
NO 


7FA7-7F98 
Reserved 
NO 


7F94 
lOT Base 
NO 


7F93-7F8C 
Reserved 
NO 


7F88 
GOT Base 
NO 


7F87-7F04 
Reserved 
NO 


7F02 
Auto HALT RestartSlot(Word) 
YES 


7FOO 
I/OTrap RestartSlot(Word) 
YES 


7EFC 
SMM 
RevisionIdentifier 
(Oword) 
NO 


7EF8 
5MBASE 
Slot(Oword) 
YES 


7EF7-7EOO 
Reserved 
NO 


NOTES: 
'Upper 
two bytes are reserved 
Modifying 
a value 
that is marked 
as not writeable 
will result 
in unpredictable 
behavior. 


Words 
are stored 
in 2 consecutive 
bytes 
in memory 
with the low-order 
byte at the lowest 
address 
and the high-order 
byte in 
the high address. 


• 


The following registers are saved and restored (in areas 
of the state save that are marked reserved) but are not 
visible to the system software programmer: 


CRI, CR2 and CR4, hidden descriptor registers for CS, 
DS, ES, FS, GS. 


If an SMI request is issued for the purpose of powering 
down the CPU, the values of all reserved locations in 
the SMM state save must be saved to non-volatile mem- 
ory. 


The following registers are not automatically saved and 
restored by SMI # and RSM: 


DR5-DRO, 
TR7-TR3, 
FPU 
registers: STn, FCS, 
FSW, tag word, FP instruction pointer, FP op code, 
and operand pointer. 


For all SMI requests except for suspend/resume, 
these 


registers do not have to be saved as their contents will 
not change. During a power down suspend/resume 
however, a resume reset will clear these registers back 
to their default values. In this case, the suspend SMI 
handler should read these registers directly to save 
them and restore them during the power up resume. 
Anytime the SMI handler changes these registers in the 
CPU, it must also save and restore them. 


The RSM instruction, is only available to the SMI han- 
dier. The op code of the instruction is OFAAH. Execu- 
tion of this instruction while the CPU is executing out- 
side of SMM will cause an invalid op-code error. The 
last instruction of the SMI handler will be the RSM 
instruction. 


The RSM instruction restores the state save image from 
SMRAM back to the CPU, then returns control back 
to the interrupted program execution. There are three 
SMM features that can be enabled by writing to control 
"slots" in the SMRAM state save area. 


Auto HALT Restart. It is possible for the SMI request 
to interrupt the HALT state. The SMI handler can tell 
the RSM instruction to return control to the HALT 


instruction or to return control to the instruction fol- 
lowing the HALT instruction by appropriately setting 
the Auto HALT Restart slot. The default operation is 
to restart the HALT instruction. 


I/O Trap Restart. If the SMI # interrupt was generat- 
ed on an I/O access to a powered-down device, the 
SMI handler can tell the RSM instruction to re-execute 
that I/O instruction by setting the I/O Trap Restart 
slot. 


5MBASE 
Relocation. 
The system can relocate the 


SMRAM by setting the 5MBASE Relocation slot in 
the state save area. The RSM instruction will set the 
5MBASE in the processor based on the value in the 
5MBASE relocation slot. The 5MBASE must be 32K 
aligned. 


For 
further 
details 
on 
these 
SMM 
features, 
see 


Section 4.5. 


If the processor detects invalid state information, it en- 
ters the shutdown state; this happens only in the follow- 
ing situations: 
• The value stored in the 5MBASE slot is not a 32 


Kbyte-aligned address. 


• A reserved bit of CR4 is set to 1. 
• A combination of bits in CRO is illegal; namely, 


(PG= I and PE=O) or (NW= I and CD=O). 


In shutdown mode, the processor stops executing in- 
structions until an NMI interrupt is received or reset 
initialization is invoked. The processor generates a spe- 
cial bus cycle to indicate it has entered shutdown mode. 


4.4 
System Management 
Mode 
Programming 
Model 


4.4.1 
ENTERING 
SYSTEM 
MANAGEMENT 
MODE 


SMM is one of the major operating modes, on a level 
with Protected mode, Real address mode or virtual-86 
mode. Figure 4-6 shows how the processor can enter 
SMM from any of the three modes and then return. 


Reset 
OR 
PE=O 


Reset 
OR 
RSM 


The external signal SMI # 
causes the processor to 
switch to SMM. The RSM instruction 
exits SMM. 
SMM is transparent to applications programs and oper- 
ating systems because of the following: 


• The only way to enter SMM is via a type of non- 
maskable interrupt triggered by an external signal. 


• The processor begins executing SMM code from a 
separate address space, referred to earlier as system 
management RAM (SMRAM). 
• Upon entry into SMM, the processor saves the regis- 
ter state of the interrupted 
program in a part of 
SMRAM called the SMM context save space. 


• All interrupts 
normally handled by the operating 
system or by applications are disabled upon entry 
into SMM. 


• A special instruction, RSM, restores processor regis- 
ters from the SMM context save space and returns 
control to the interrupted program. 


SMM is similar to Real address mode in that there are 
no privilege levels or address mapping. An SMM pro- 
gram can execute all I/O and other system instructions 
and can address up to four Gbytes of m~mory. 


When an SMI # signal is recognized on an instruction 
execution boundary, the processor waits for all ~tores to 
complete, including emptying of the write buffers. The 
final write cycle is complete when the system returns 
RDY # 
or 
BRDY #. 
The 
processor 
then 
drives 
SMIACT # active, saves its register state to SMRAM 
space, and begins to execute the SMM handler. 


SMI # has greater priority than debug exceptions and 
external interrupts. This means that if more than one of 
these conditions occur at an instruction boundary, only 
the SMI # processing occurs, not a debug exception or 
external interrupt. Subsequent SMI # requests are not 
acknowledged while the processor is in SMM. The first 
SMI # interrupt request that occurs while the processor 
is in SMM is latched, and serviced when the processor 
exits SMM with the RSM instruction. Only one SMI # 
will be latched by the CPU while it is in SMM. 


When the CPU invokes SMM, the CPU core registers 
are initialized as follows: 


• 
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Register 
Contents 


General 
purpose 
registers 
Unpredictable 


EFLAGS 
00000002H 


EIP 
00008000H 


CS selector 
3000H 


CS base 
SMM Base (default 30000H) 


OS, ES, FS, GS, SS Selectors 
OOOOH 


OS, ES, FS, GS, SS Bases 
OOOOOOOOOH 


OS, ES, FS, GS, SS limits 
OFFFFFFFFH 


CRO 
Bits 0,2,3 & 31 cleared 
(PE, EM, TS & PG); others unmodified 


OR6 
Unpredictable 


OR? 
OOOOOOOOH 


The following is a summary of the key features in the 
SMM environment: 
I. Real mode style address calculation 
2. Gbyte limit checking 
3. IF flag is cleared 
4. NMI is disabled 
5. TF flag in EFLAGS is cleared; single step traps are 
disabled 
6. DR7 is cleared; debug traps are disabled 
7. The RSM instruction no longer generates an invalid 
op code error 
8. Default 16-bitop code, register and stack use. 


All bus arbitration (HOLD, AHOLD, BOFF#) inputs 
and bus sizing (BS8#, BSI6#) inputs operate normally 
while the CPU is in SMM. 


4.4.3 
EXECUTING 
SYSTEM 
MANAGEMENT 
MODE 
HANDLER 


The processor begins execution of the SMM handler at 
offset 8000H in the CS segment. The CS Base is initial- 
ly 30000H. The CS Base can be changed, however, us- 
ing the SMM Base relocationfeature. 


When the SMM handler is invoked, the CPU's PE and 
PG bits in CRO are reset to O.The processor is in an 
environment similar to Real mode, but without the 64 
Kbyte limit checking. However, the default operand 
size and the default address size are set to 16bits. 


The EM bit is cleared so that no exceptionsare generat- 
ed. (If the SMM was entered from Protected mode, the 
Real mode interrupt and exception support is not avail- 
able.) The SMI handler should not use floating point 
unit instructions until the FPU is properly detected 
(within the SMI handler) and the exception support is 
initialized. 


Because the segment bases (other than CS) are cleared 
to 0 and the segment limits are set to 4 Gbytes, the 
address space may be treated as a single flat 4 Gbyte 
linear space that is unsegmented. The CPU is still in 
Real mode and when a segment selector is loaded with 
a 16-bit value, that value is then shifted left by 4 bits 
and loaded into the segment base cache. The limits and 
attributes are not modified. 


In SMM, the CPU can access or jump anywhere within 
the 4 Gbyte logical address space. The CPU can also 
indirectly access or perform a near jump anywhere 
within the 4 Gbyte logical address space. 


4.4.3.1 
Exceptions 
and Interrupts 
within 
System 
Management 
Mode 


When the CPU enters SMM, it disables INTR inter- 
rupts, debug and single step traps by clearing the 
EFLAGS, DR6 and DR7 registers. This is done to pre- 
vent a debug application from accidentally breaking 
into an SMM handler. This is necessary because the 
SMM handler operates from a distinct address space 
(SMRAM) and hence the debug trap will not represent 
the normal system memory space. 


If an SMM handler wishesto use the debug trap feature 
of the processor to debug SMM handler code, it must 
first ensure that an SMM compliant debug handler is 
available. The SMM handler must also ensure DRO- 
DR3 is saved to be restored later. The debug registers 
DRO-DR3 and DR7 must then be initialized with the 
appropriate values. 


If the processor wishes to use the single step feature of 
the processor, it must ensure that an SMM compliant 
singlestep handler is availableand then set the trap flag 
in the EFLAGS register. 
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If the system design requires the processor to respond 
to hardware INTR requests while in SMM, it must en- 
sure that an SMM compliant interrupt handler is avail- 
able and then set the interrupt flag in the EFLAGS 
register (using the STI instruction). Software interrupts 
are not blocked upon entry to SMM, and the system 
software designer must provide an SMM compliant in- 
terrupt handler before attempting to execute any soft- 
ware interrupt instructions. Note that in SMM mode 
the interrupt vector table has the same properties and 
location as the Real mode vector table. 


NMI interrupts are blocked upon entry to the SMM 
handler. If an NMI request occurs during the SMM 
handler, it is latched and serviced after the processor 
exits SMM. Only one NMI request will be latched dur- 
ing the SMM handler. If an NMI request is pending 
when the processor executes the RSM instruction 
the 
NMI is serviced before the next instruction of the i~ter- 
rupted code sequence. 


Although NMI requests are blocked when the CPU 
enters .SMM, they may be enabled through software by 
executmg an lRET instruction. If the SMM handler 
requires the use of NMI interrupts, it should invoke a 
dummy interrupt service routine for the purpose of exe- 
cuting an IRET instruction. Once an IRET instruction 
is executed, NMI interrupt requests are serviced in the 
same "real mode" manner in which they are handled 
outside of SMM. 


The SMM revision identifier is used to indicate the ver- 
sion of SMM and the SMM extensions that are sup- 
ported by the processor. The SMM revision identifier is 
written during SMM entry and can be examined in 
SMRAM space at Register Offset 7EFCH. The lower 
word of the SMM revision identifier refers to the ver- 
sion of the base SMM architecture. The upper word of 
the SMM revision identifier refers to the extensions 
available. 


Bit 16 of the SMM revision identifier is used to indicate 
to the SMM handler that this processor supports the 
SMM I/O trap extension. If this bit is high, then this 2 
processor supports the SMM I/O trap extension. If this 
bit is low, then this processor does not support I/O 
trapping using the I/O trap slot mechanism. 


Bit 17 of this. slot indicates whether the processor sup- 
ports relocatIOn of the SMM jump vector and the 
SMRAM base address. 


All members of the SL Enhanced Intel486 CPU family 
support both the 110 Trap Restart and the 5MBASE 
relocation features. 


17 
16 


~ 
Register offset 7EFCH 


5MBASE Relocation 
. ~ 
I_I-------' 
SMM Revision 
10 trap with restart 
~---- 
Level 


• 
- 
Reserved 


Bits 
Value 
Comments 


16 
0 
Processor 
does not support 
I/O Trap Restart 
16 
1 
Processor 
supports 
I/O Trap Restart 
17 
0 
Processor 
does not support 5MBASE 
Relocation 
17 
1 
Processor 
supports 
5MBASE 
Relocation 


15 
1 
0 
~RegisterOffset7F02H 


HALTAutoRestart 


Value of 
Value of 
Comments 
Bit 0 at Entry 
Bit 0 at Exit 


0 
0 
Returns to next 
instruction 
in 
interrupted 
program. 
0 
1 
Unpredictable 
1 
0 
Returns to next 
instruction 
after 
HALT 
1 
1 
Returns to HALT 
state 


The Auto HALT Restart slot at register offset·(word 
location) 7F02H in SMRAM indicates to the SMM 
handler that the SMI interrupted 
the CPU during a 
HALT state (bit 0 of slot 7F02H is set to 1 if the previ- 
ous instruction was a HALT). If the SMI did not inter- 
rupt the CPU in a HALT state, then the SMI micro 
code will set bit 0 of the Auto HALT Restart slot to a 
value ofO. If the previous instruction was a HALT, the 
SMM handler can choose to either set or reset bit O.If 
this bit is set to I, the RSM micro code execution will 
force the processor to re-enter the HALT state. If this 
bit is set to 0 when the RSM instruction is executed, the 
processor will continue execution with the instruction 
just after the interrupted HALT instruction. Note that 
if the interrupted instruction was not a HALT instruc- 
tion (bit 0 is set to 0 in the Auto HALT Restart slot 
upon SMM entry), setting bit 0 to 1 will cause unpre- 
dictable behavior when the RSM instruction is execut- 
ed. 


If the HALT instruction is restarted, the CPU will gen- 
erate a memory access to fetch the HALT instruction 
(if it is not in the internal cache), and execute a HALT 
bus cycle. 


15 
0 
~--------IRegisteroffset7FOOH 
~L· 
110instructionrestartslot 
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Value 
Value 
Comments 
at Entry 
at Exit 


OOH 
OOH 
Do not restart trapped 
I/O 
instruction 
OOH 
OFFH 
Restart trapped 
I/O 
instruction 


The I/O instruction restart slot (register offset 7FOOH 
in SMRAM) gives the SMM handler the option of 
causing 
the 
RSM 
instruction 
to automatically 
re- 
execute the interrupted 
I/O 
instruction. 
When the 
RSM instruction is executed, if the I/O instruction re- 
start slot contains the value OFFH, then the CPU will 
automatically re-execute the I/O instruction that the 
SMI trapped. If the I/O instruction restart slot con- 
tains the value COHwhen the RSM instruction is exe- 
cuted, 
then 
the CPU will not re-execute the I/O 
instruction. The CPU automatically initializes the I/O 
instruction restart slot to OOHduring SMM entry. The 
I/O instruction restart slot should be written only when 
the processor has generated an SMI on an I/O instruc- 
tion boundary. 
Processor operation 
is unpredictable 
when the I/O instruction restart slot is set when the 
processor is servicing an SMI # that originated on a 
non-I/O instruction boundary. 


If the system executes back-to-back SMI requests, the 
second SMM handler must not set the I/O instruction 
restart slot (see Section 4.6.6). 


The SL Enhanced Intel486 CPU family provides a new 
control register, 5MBASE. The address space used as 
SMRAM can be modified by changing the 5MBASE 
register 
before 
exiting 
an 
SMI 
handler 
routine. 


5MBASE can be changed to any 32K aligned value 
(values that are not 32K aligned will cause the CPU to 
enter the shutdown state when executing the RSM in- 
struction). 
5MBASE is set to the default value of 
30000H on RESET, but is not changed on SRESET. If 
the 5MBASE register is changed during an SMM han- 
dier, all following SMI # requests will initiate a state 
save at the new 5MBASE. 


o 
I 
RegisterOffset7EF8H 
I 


SMMBase 
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The 5MBASE slot in the SMM state save area is a 
feature used to indicate and change the SMI jump vec- 
tor location and the SMRAM save area. When bit 17 of 
the SMM Revision Identifier is set then this feature 
exists and the SMRAM 
base and consequently the 
jump vector are as indicated by the SMM Base slot. 
During the execution of the RSM instruction, the CPU 
will read this slot and initialize the CPU to use the new 
5MBASE during the next SMI. During an SMI, the 
CPU will do its context save to the new SMRAM area 
pointed to by the 5MBASE, store the current 5MBASE 
in the SMM Base slot (offset 7EF8H), and then start 
execution of the new jump vector based on the current 
5MBASE. 


The 5MBASE must be a 32 Kbyte aligned, 32-bit inte- 
ger that indicates a base address for the SMRAM con- 
text save area and the SMI jump vector. For example 
when the processor first powers up, the minimum 
SMRAM area is from 38000H-3FFFFH. 
The default 
5MBASE is 30000H. Hence the starting address of the 
jump vector is calculated by: 


While the starting address for the SMRAM state save 
area is calculated by: 


Hence when this feature is enabled the SMRAM regis- 
ter map is addressed according to the above formulae. 


To change the SMRAM base address and SMI jump 
vector location, the SMM handler should modify the 
5MBASE slot. Upon executing an RSM instruction, 
the processor will read the 5MBASE slot and store it 
internally. Upon recognition of the next SMI request, 
the processor will use the new 5MBASE slot for the 
SMRAM dump and SMI jump vector. 


If the modified 5MBASE slot does not 
contain 
a 
32 KByte-aligned value, the RSM micro code will 
cause the CPU to enter the shutdown state. 


4.6 
SMM - System Design 
Considerations 


The hardware designed to control the SMRAM space 
must follow these guidelines: 
I. A provision should be made to allow for initializa- 
tion of SMRAM space during system boot up. This 
initialization of SMRAM space must happen before 
the first occurrence of an SMI II interrupt. Initializ- 
ing the SMRAM space must include installation of 
an SMM handler, and may include installation of 
related 
data 
structures 
necessary 
for 
particular 
SMM applications. The memory controller provid- 
ing the interface to the SMRAM should provide a 
means for the initialization code to manually open 
the SMRAM space. 
2. A 
minimum 
initial 
SMRAM 
address 
space of 
38000H-3FFFFH should be decoded by the memo- 
ry controller. 
3. Alternate bus masters (such as DMA controllers) 
should not be allowed to access SMRAM space. 
Only the CPU, either through SMI or during initial- 
ization, should be allowed access to SMRAM .. 


4. In order to implement a zero-volt suspend function, 
the system must have access to all of normal system . 
memory from within an SMM handler routine. If 
the SMRAM is going to overlay normal system 
memory, there must be a method of accessing any 
system memory that is located underneath SMRAM 
(see Section A.2.3). 


There 
are two potential 
schemes for locating 
the 
SMRAM, either overlaid to an address space on top of 
normal system memory, or placed in a distinct address 
space. See Figure 4-8. When SMRAM is overlaid on 
the top of normal system memory, the CPU output 
signal 
SMIACTII 
must 
be 
used 
to 
distinguish 
SMRAM 
from main system memory. Additionally, 
both the CPU internal cache and any second level 
caches must be empty before the first read of an SMM 
handler routine and before the first read of normal 
memory following an SMM handler routine. This is 
done by flushing the caches, and is required to maintain 
cache coherency. When the default SMRAM location 


• 


is used, SMRAM is overlaid on top of system main 
memory (at 38000H through 3FFFFH). 


If SMRAM is located in its own distinct memory space, 
which can be completely decoded with only the CPU 
address signals, it is said to be non-overlaid. In this 
case, there are no new requirements for maintaining 
cache coherency. 


The CPU does not unconditionally flush its cache be- 
fore entering SMM (this option is left to the system 
designer). If SMRAM is shadowed in a memory area 


Non-overlaid 
(no need to 
flush caches) 


Overlaid 
(caches must 
be flushed) 
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Cache must 
be empty 


that is visible to the application or operating system, it 
is necessary for the system to empty both the CPU 
cache and any second level cache before entering and 
after leaving SMM. That is, if SMRAM is in the same 
physical address 
location as the normal 
cacheable 


memory space, then an SMM read may HIT the cache 
which would contain normal memory space code/data. 
Likewise the normal read cycles after SMM may HIT 
the cache which may contain SMM code/data. In this 
case the cache should be empty before the first memory 
read cycle during SMM and before the first normal 
cycle after exiting SMM. 


The FLUSH # and KEN # signals can be used to en- 
sure cache coherency when switching between normal 
and SMM modes. Cache flushing during SMM entry is 
accomplished by asserting the FLUSH # 
pin when 


SMIACT # 
is driven active. Cache flushing during 


SMM exit is accomplished by asserting the FLUSH # 
pin after the SMIACT# 
pin is de-asserted (within I 


CLK). To guarantee this behavior, the constraints on 
setup and hold timings on the interaction of FLUSH # 
and SMIACT # as specified for a processor should be 
followed. 


If the SMRAM area is overlaid over normal memory 
and if the system designer does not want to flush the 
caches upon 
leaving SMM then references to the 


SMRAM area should not be cached. It is the obligation 
of the system designer to ensure that the KEN # pin is 
sampled inactive during all references to the SMRAM 
area. 


Instr Instr 


#4 
#5 


Cache must 
be empty 


Norma! 
Cycle 


RSM;-.: 


Nanna! 
Cycle 


RSM 


"" 4- 


Systems based on the MS-DOS operating system con- 
tain a feature that enables the CPU address bit A20 to 
be forced to O. This limits physical memory to a maxi- 
mum of 1 Mbyte, and is provided to ensure compatibil- 
ity with those programs that relied on the physical ad- 
dress wrap around functionality of the original IBM 
PC. The A20M # pin on SL Enhanced Intel486 CPUs 
provides this function. When A20M # is active, all ex- 
ternal bus cycles will drive A20 low, and all internal 
cache accesses will be performed with A20 low. 


The A20M # pin is recognized while the CPU is in 
SMM. The functionality of the A20M # input must be 
recognized in two instances: 
1. If the SMM handler needs to access system memory 
space above 1 Mbyte (for example, when saving 
memory 
to 
disk 
for a zero-volt 
suspend), 
the 
A20M # pin must be deasserted before the memory 
above 1 Mbyte is addressed. 


2. If SMRAM 
has been relocated to address space 
above 1 Mbyte, and A20M # is active upon entering 
SMM, the CPU will attempt to access SMRAM at 
the relocated address, but with A20 low. This could 
cause the system to crash, sincethere would beno valid 
SMM interrupt handler at the accessed location. 


In order to account for these two situations, the system 
designer must ensure that A20M # is deasserted on en- 
try to SMM. A20M # must be driven inactive before 
the first cycle of the SMM state save, and must be re- 
turned to its original level after the last cycle of the 
SMM state restore. This can be done by blocking the 
assertion of A20M# 
whenever SMIACT# 
is active. 


The system designer should take into account the fol- 
lowing restrictions while implementing the CPU Reset 
logic. 


\. When running software written for the 80286 CPU, 
a CPU RESET is used to switch the CPU from Pro- 
tected mode to Real mode. If we look at the relative 
interrupt priorities, RESET has higher priority than 
SMI #. When the CPU is in SMM, the SRESET to 
the CPU during SMM should be blocked until the 
CPU exits SMM. SRESET must be blocked begin- 
ning from the time when SMI # is driven active. 
Care should be taken not to block the global system 
RESET, which may be necessary to recover from a 
system crash. 


• 


2. During execution of the RSM instruction to exit 
SMM, there is a small time window between the de- 
assertion of SMIACT # and the completion of the 
RSM micro code. If a Protected mode to Real mode 
SRESET is asserted during this window it is possible 
that the SMRAM space will be violated. The system 
designer must guarantee that SRESET is blocked 
until at least 20 CPU clock cycles after SMIACT # 
has been driven inactive. 
3. Any request for a CPU RESET for the purpose of 
switching the CPU from Protected mode to Real 
mode must be acknowledged after the CPU has exit- 
ed SMM. In order to maintain software transparen- 
cy, the system logic must latch any SRESET signals 
which are blocked during SMM. 


For these reasons, the SRESET signal should be used 
for any soft resets, and the RESET signal should be 
used for all hard resets. 


4.6.5 
SMM AND SECOND 
LEVEL 
WRITE 
BUFFERS 


Before an SL Enhanced Intel486 CPU enters SMM, it 
empties its internal write buffers. This is necessary so 
that the data in the write buffers is written to normal 
memory space, not SMM space. Once the CPU is ready 
to begin writing an SMM state save to SMRAM, it 
asserts the SMIACT# 
signal. SMIACT# 
may be driv- 
en active by the CPU before the system memory con- 
troller has had an opportunity to empty the second lev- 
el write buffers. 


To prevent the data from these second level write buff- 
ers from being written to the wrong location, the sys- 
tem memory controller needs to direct the memory 
write cycles to either SMM space or normal memory 
space. This can be accomplished by saving the status of 
SMIACT# 
along with the address for each word in the 
write buffers. 


Special care must be taken when executing an SMM 
handler for the purpose of restarting an I/O instruc- 
tion. When the CPU executes a RSM instruction with 
the I/O restart slot set, the restored EIP is modified to 
point to the instruction 
immediately preceding the 
SMI # request, so that the I/O instruction can be re- 
executed. If a new SMI # request is received while the 
CPU is executing an SMM handler, the CPU will serv- 
ice this SMI # request before restarting the original I/O 
instruction. If the I/O restart slot is set when the CPU 
executes the RSM instruction for the second SMM han- 
dier, the RSM micro code will decrement the restored 
EIP again. EIP now points to an address different from 
the originally interrupted instruction, and the CPU will 
begin execution of the interrupted application code at 
an incorrect entry point. 


To prevent this from occurring, the SMM handler rou- 
tine must not set the I/O restart slot during the second 
of two consecutive SMM handlers. 


The default operand size and the default address size 
are 16 bits; however, operand-size override and ad- 
dress-size override prefixes can be used as needed to 
directly access data anywhere within the four Gbyte 
logical address space. 


With operand-size override prefixes, the SMM handler 
can use jumps, calls, and returns, to transfer control to 
any location within the four Gbyte space. Note, howev- 
er, the following restrictions: 


• Any control transfer that does not have an operand- 
size override prefix truncates EIP to 16 low-order 
bits. 


• Due to the Real mode style of base-address forma- 
tion, a long jump or call cannot transfer control to a 
segment with a base address of more than 20 bits 
(one megabyte). 


Upon entry into SMM, external interrupts that require 
handlers are disabled (the IF bit in the EFLAGS is 
cleared). This is necessary because, while the processor 
is in SMM, it is running in a separate memory space. 
Consequently the vectors stored in the interrupt 
de- 
scriptor table (IDT) for the prior mode are not applica- 
ble. Before allowing exception handling (or software in- 
terrupts), the SMM program must initialize new inter- 
rupt and exception vectors. The interrupt vector table 
for SMM has the same format as for Real mode. Until 
the interrupt 
vector table is correctly initialized, the 
SMM handler must not generate an exception (or soft- 
ware interrupt). Even though hardware interrupts are 
disabled, exceptions and software interrupts can still 
occur. Only a correctly written SMM handler can pre- 
vent internal exceptions. When new exception vectors 
are initialized, internal exceptions can be serviced. The 
following are the restrictions: 
I. Due to the Real mode style of base address forma- 
tion, an interrupt or exception cannot transfer con- 
trol to a segment with a base address of more than 
20 bits. 


2. An interrupt or exception cannot transfer control to 
a segment offset of more than 16 bits (64 KBytes). 


3. If exceptions or interrupts are allowed to occur, only 
the low order 16bits of the return address (EIP) are 
pushed onto the stack. If the offset of the interrupt- 
ed procedure is greater than 64 KBytes, it is not 


possible for the interrupt/exception 
handler to re- 
turn control to that procedure. (One work-around 
could be to perform software adjustment of the re- 
turn address on the stack). 
4. The 5MBASE Relocation feature affects the way the 
CPU will return from an interrupt or exception dur- 
ing an SMI handler. 


HALT should not be executed during SMM, unless in- 
terrupts have been enabled (see section 4.7.2). Inter- 
rupts are disabled in SMM and INTR, 
NMI, 
and 


SMI II are the only events that take the CPU out of 
HALT. 


4.7.4 
RELOCATING 
SMRAM 
TO AN ADDRESS 
ABOVE 
ONE MEGABYTE 


Within SMM (or Real mode), the segment base regis- 
ters can only be updated by changing the segment regis- 
ter. The segment registers contain only 16 bits, which 
allows only 20 bits to be used for a segment base ad- 
dress (the segment register is shifted left four bits to 
determine the segment base address). If SMRAM is re- 
located to an address above one megabyte, the segment 
registers 
can 
no longer be initialized 
to point 
to 


SMRAM. 


These areas can still be accessed by using address over- 
ride prefixes to generate an offset to the correct address. 
For example, if the 5MBASE has been relocated imme- 
diately below 16M, the DS and ES registers are still 
initialized toסס oo ooooH. We can still access data in 
SMRAM by using 32-bit displacement registers: 


mov esi,OOFFxxxxH ;64K segment immediately 
;below16M 


The RESET input must be used at power-up to initial- 
ize the CPU. The Reset input forces the CPU to begin 
execution at a known state. The microprocessor cannot 
begin execution of instructions until at least I ms after 
VCC and CLK have reached their proper D.C. and 
A.C. specifications. The RESET pin should remain ac- 
tive during this time to ensure proper microprocessor 
operation. However, for warm boot-ups RESET should 
remain active for at least 15 CLK periods. RESET is 


active HIGH. RESET is asynchronous but must meet 
setup and hold times t20 and t21 for recognition in any 
specific clock. The RESET signal is the same as the 
standard Intel486 CPU RESET signal with the follow- 
ing exceptions. 


RESET has a special function of resetting 5MBASE to 
the default value of 3OOOOH.If 5MBASE relocation is 
not used, the RESET signal can be used as the only 
reset. 


The microprocessor will be placed in the Power Down 
mode if UPII is sampled active at the falling edge of 
RESET. 


The SRESET (Soft RESET) input, has the same func- 
tions as RESET, but does not change the 5MBASE, 
and 
UP II 
is not sampled 
on the falling edge of 


SRESET. If 5MBASE relocation is used by the system, 
the soft resets should be handled using the SRESET 
input. The SRESET signal should not be used for the 
cold boot-up power-on reset. 


The SRESET input pin is provided to save the status of 
5MBASE during i286 CPU-compatible mode change. 
SRESET leaves the status of the on-chip FPU and 
5MBASE intact while resetting other units including 
the on-chip cache. For compatibility with future gener- 
ation Intel486 CPUs, the system should not rely on 
flushing the on-chip cache through the SRESET input 
pin. The FLUSH II input pin is provided for this pur- 
pose and should be used to flush the on-chip cache. 


SRESET should not be used to initiate test modes (this 
behavior is subject to change in future versions). 


While in reset, the SL Enhanced Intel486 microproces- 
sor bus is in the state shown in Figure 5-1, if the 
HOLD, AHOLD and BOFFII 
requests are inactive. 


Note that the address (A31-A2, BE311-BEOII) and cy- 
cle definition (MIlO 
II , D/CII, 
WIRII) 
pins are unde- 


fined from the time reset is asserted up to the start of 
the first bus cycle. All defined pins (except FERRII) 
assume known values at the beginning of the first bus 
cycle. The first bus cycle is always a code fetch to ad- 
dress OFFFFFFFOH. FERR II reflects the state of the 
ES (error summary status) bit in the floating point unit 
status word. The ES bit is initialized whenever the 
floating point unit state is initialized. 


Figures 5-1 and 5-2 show the bus state for the SL En- 
hanced Intel486 DX and SX CPUs when UPII is not 
asserted and for the SL Enhanced Intel486 SX CPU 
when UP II is asserted. 
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Figure 5·1. SL Enhanced Intel486TM DX CPU or SL Enhanced Intel486 SX CPU without UP", Asserted 
During SRESET or RESET 


NOTES: 
1. RESET 
is an asynchronous 
input. t20 must be met only to guarantee 
recognition 
on a specific 
clock 
edge. 


2a. When 
A20M'" 
is driven 
synchronously, 
it must be driven 
high (inactive) 
for the ClK 
edge prior to and the falling 
edge of 
RESET 
to ensure 
proper 
operation. 
A20M'" 
setup and hold times 
must be met. 


2b. When 
A20M", 
is driven 
asynchronously, 
it must be driven 
high (inactive) 
for two ClKs 
prior to and two ClKs 
after the 
falling 
edge of RESET 
to ensure 
proper 
operation. 


3a. When 
FLUSH'" 
is driven 
synchronously, 
it should 
be driven 
low (active) 
for the ClK 
edge 
prior 
to the falling 
edge 
of 
RESET 
to invoke 
the Three-State 
Output 
Test 
Mode. 
All outputs 
are guaranteed 
three-stated 
within 
10 ClKs 
of RESET 
being deasserted. 
FLUSH'" 
setup and hold times 
must be met. 


3b. When 
FLUSH", 
is driven 
asynchronously, 
it must be driven 
low (active) 
for two 
ClKs 
prior to and two ClKs 
after 
the 
falling 
edge of RESET 
to invoke 
tbe Three-State 
Output 
Test 
Mode. All outputs 
are guaranteed 
three-stated 
within 
10 ClKs 
of RESET 
being deasserted. 


4. AHOlD 
should 
be driven 
high (active) 
for the ClK 
edge prior to the falling 
edge of RESET to invoke 
the Suild-In-Self-Test 
(SIST). AHOlD 
setup and hold times 
must be met. 


5. Hold is recognized 
normally 
during 
RESET. 


6. 15 ClKs 
RESET 
pulse width 
for warm 
resets. 
Power-up 
resets 
require 
RESET 
to be asserted 
for at least 
1 ms after Vcc 
and ClK 
are stable. 


intel~ 


CLK2 


RESET 


OUTPUT 
SIGNALS 


INPUT! 
OUTPUT 
SIGNALS 


1. 15 ClKs reset pulse widthfor warm reset. Power-upresets require reset to be asserted for at least 1 ms after Vcc 
and ClK are stable. 
2. When UP# is active. the fallingedge of RESET willdisable all the internalinputbuffersexcept UP#. RESET and the 
internalinputbuffersof ClK willalso be disabled. 


The OX register always contains a component identifi- 
er at the conclusion of RESET. The upper byte of OX 
(DR) will contain 04 and the lower byte of OX (DL) 
will contain a CPU type/stepping identifier. 


Table 5·1. CPU ID Codes 


SLEnhanced 
Component 
Revision 
Intel486TM CPU 
ID (DH) 
ID (DL) 


Intel486 SX CPU 
04 
2x 
Intel486 
OX CPU 
04 
1x 
Intel486 
OX2 CPU 
04 
3x 


The SL Enhanced Inte1486 microprocessor family im- 
plements a new instruction 
that makes information 
available to software about the family, model and step- 
ping of the microprocessor on which it is executing. 
Support of this instruction is indicated by the presence 
of a user-modifiable bit in position EFLAGS.2I, 
re- 
ferred to as the EFLAGS.IO bit. The actual state of the 
EFLAGS.IO 
bit is irrelevant and provides no signifi- 
cance to the hardware. This bit is reset to zero upon 
device reset (RESET or SRESET) for compatibility 
with existing Intel486 processor designs. 


CPU 
EAX 
OPCODE 
Instruction 
Core 
Input 
Description 
Clocks 
Value 


14 
1 
CPU 
Identification 
OFA2 
CPUIO 


9 
o or> 
1 Intel string/ 
null registers 


The CPUIO instruction requires the user to pass an 
input parameter to the CPU in the EAX register. The 
CPU response is returned to the user in registers EAX, 
EBX, ECX and EOX. 
• When the parameter passed in EAX is zero, the reg- 


ister values returned upon instruction execution are: 


EAX[31:0] 
- 
I 
EBX[31:0] 
- 
756E6547 
ECX[31:0] 
- 
6C65746E 
EOX[31:0] 
- 
49656E69 


The values in EBX, ECX, and EOX indicate an Intel 
microprocessor. When taken in the proper order, they 
decode to the string "GenuineIntel". 


• 


• When the parameter passed in EAX is one, the reg- 


ister values returned are as follows: 
EAX[3:0] 
EAX[7:4] 


EAX[lS:12] 


EAX[3l:16] 
EBX[3l:0] 
ECX[31:0] 
EDX[3l:0] 


- 
Stepping IDo 
-Model 
i486 DX CPU = I 
i486 SX CPU = 2 
i486 DX2 CPU = 3 


- 
Family 
i486 CPU = 4 
-ססoo 
-RESERVED 
-0ס0ooooo 
-0ס0ooooo 
-ס0ooooo1 
bit 0 = FPU 


The value returned in EAX after CPUID instruction 
execution is identical to the value loaded into EDX 
upon device reset. Software must avoid any dependency 
upon the state of reserved processor bits. 
• When the parameter passed in EAX is greater than 


one, the register values returned upon instruction 
execution are: 
EAX[3l:0] 
EBX[3l:0] 
ECX[3l:0] 
EDX[3l:0] 


-0ס0ooooo 
-0ס0ooooo 
-0ס0ooooo 
-0ס0ooooo 


6.1.1 
3.3V D.C. CHARACTERISTICS 


Table 
6·1. 3.3V D.C. Specifications 
Functional operating range: Vcc = 3.3V ±0.3V; TCASE = O"Cto +85°C 


Symbol 
Parameter 
Mln 
Typ 
Max 
Unit 
TestCond. 


VIL 
Input LOW Voltage 
-0.3 
+0.8 
V 


VIH 
Input HIGH Voltage 
2.0 
Vcc+0.3 
V 
Note 4 


VIHC 
Input HIGH Voltage 
of CLK, CLK2 
Vcc-0.6 
Vcc+0.3 
V 


VOL 
Output 
LOW Voltage 
IOL = 2.0mA 
0.4 
V 


IOL = 100 uA 
0.2 
V 


VOH 
Output HIGH Voltage 
IOH = -2.0 
mA 
2.4 
V 
IOH = -100 
uA 
Vcc-0.2 
V 


Iccu 
UP# 
Active Supply Current 
15 
35 
mA 
Note 5 


III 
Input Leakage Current 
-15 
15 
,...A 
Note 1 


IIH 
Input Leakage Current 
200 
,...A 
Note 2 


IlL 
Input Leakage 
Current 
-400 
,...A 
Note 3 


ILO 
Output 
Leakage, Current 
-15 
15 
,...A 


CIN 
Input Capacitance 
10 
pF 
Note 6 


COUT 
Output or I/O Capacitance 
10 
pF 
Note 6 


CCLK 
CLK Capacitance 
6 
pF 
Note 6 


NOTES: 
1. This parameter is for inputs without pull-ups or pull downs and OV < VIN < Vcc. 
2. This parameter is for inputs with Pull downs and VIH = 2.4V. 
3. This parameter is for inputs with pull-ups and VIL = OAV. 
4. All inputs except CLK, CLK2. 
5. When the CPU is in Stop Grant state, the Iccu of the host CPU is less than 2 mA. 
6. Fc= 1 MHz; Not 100% tested. 


• 
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Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


3.3V 
Icc Active 
25 MHz 
250mA 
315 mA 


33 MHz 
300mA 
385mA 


lec Stop Grant 
25 MHz 
20mA 
40mA 


33 MHz 
25mA 
50mA 


Ice Stop Clock(1) 
OMHz 
100 IJoA 
1 mA 


Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


3.3V 
Ice Active 
33 MHz 
330mA 
415 mA 


lec Stop Grant 
33 MHz 
25mA 
50mA 


lec Stop Clock(1) 
OMHz 
100 IJoA 
1 mA 


Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


3.3V 
lecActive 
40 MHz 
375 mA 
450 mA 


50 MHz 
460mA 
550mA 


Ice Stop Grant 
40 MHz 
20mA. 
40mA 


50 MHz 
23mA 
50mA 


Ice Stop Clock(1) 
OMHz 
100 IJoA 
1 mA 


NOTE: 
1. The VIH and VIL levels 
must be equal to Vcc 
and OV, respectively, 
in order to meet the Ice Stop Clock 
specifications. 


Table 6·5. 5V D.C. Specifications 


Functional operating range: VCC = 5V ± 5%; TCASE 
= O"Cto + 85°C 


Symbol 
Parameter 
Min 
Typical 
Max 
Unit 
Test Condition 


VIL 
Input lOW 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input HIGH Voltage 
2.0 
Vcc+0.3 
V 


VOL 
Output 
lOW 
Voltage 
0.45 
V 
Note 1 


VOH 
Output 
HIGH Voltage 
2.4 
V 
Note 2 


Iccu 
UP# 
Active Supply Current 
25 
50 
mA 
Note 6 


III 
Input leakage 
Current 
-15 
15 
/LA 
Note 3 


IIH 
Input leakage 
Current 
200 
/LA 
Note 4 


IlL 
Input leakage 
Current 
-400 
/LA 
Note 5 


ILO 
Output leakage 
Current 
-15 
15 
/LA 


CIN 
Input Capacitance 
PGA 
20 
pF 
Note 7 
PQFP 
10 
pF 


COUT 
Output or I/O Capacitance 
PGA 
20 
pF 
Note 7 
PQFP 
10 
pF 


CCLK 
ClK 
Capacitance 
PGA 
20 
pF 
Note 7 
PQFP 
6 
pF 


NOTES: 
1. This parameter is measured at: 
Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 
Address, Data, BEn 
-1.0 mA 
Definition, Control 
- 0.9 mA 


3. This parameter is for inputs without pull-ups or pull-downs and OV " 
VIN " 
Vcc. 
4. This parameter is for inputs with pull-downs and VIH = 2.4V. 
5. This parameter is for inputs with pull-ups and VIL = 0.45V. 
6. When the CPU is in Stop Grant state, the Iccu of the host CPU is less than 2 mA. 
7. Fc= 1 MHz; Not 100% tested. 
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Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


5V 
lee Active 
25 MHz 
430 mA 
560mA 


33 MHz 
590 mA 
685mA 


lee Stop Grant 
25 MHz 
35mA 
65mA 


33 MHz 
40mA 
80mA 


lee Stop Clock(1) 
OMHz 
200,...A 
2mA 


Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


5V 
lee Active 
33 MHz 
500mA 
630 mA 


50 MHz 
775mA 
950mA 


lee Stop Grant 
33 MHz 
40mA 
80mA 


50 MHz 
55mA 
100 mA 


lee Stop Clock(1) 
OmHz 
200,...A 
2mA 


Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


5V 
lee Active 
50 MHz 
775mA 
950 mA 


66 MHz 
975 mA 
1200 mA 


lee Stop Grant 
50 MHz 
35mA 
70mA 


66 MHz 
45mA 
90mA 


lee Stop Clock(1) 
OMHz 
200,...A 
2mA 


NOTE: 
1. The VIH and VIL levels must be equal to Vcc and av, respectively, in order to meet the Ice Stop elock specifications. 


5000 


4500 


4000 


3500 


3000 
Typical Power 2500 
(mWATIS) 
2000 


1500 


1000 


500 
o 
o 


.........•• 


~ 
~ 
,/ 
.....---./ 
-+- 5V ;486 SX CPU 
¥/- 
_ 
5V ;486 OX, OX2 


~ 
-A-- 
3.3V ;486 SX, OX, OX2 


~ 
....,j~ 
~ 
7 
---- 
- 
./"~ - 


Figure 6·1. Frequency vs Power (Typ) in 1X ClK Mode for Sl 
Enhanced Intel486 SX, OX and OX2 CPUs 


The A.C. specifications given in the tables of this section consist of output delays, input setup requirements and input 
hold requirements. All A.C. specifications are relative to the rising edge of the input system clock (CLK) unless 
otherwise specified. 


Table 6·9. 5V A.C. Characteristics 
(1X Clock) 
Frequency = 25 and 33 MHz (Preliminary Information) 
Functional operating range: VCC = SV ± S%; TCASE = O"Cto + 8SoC;CL = SOpF unless otherwise specified. 


Symbol 
Parameter 
Mln 
Max 
Mln 
Max 
Unit 
Notes 


Frequency 
8 
25 
8 
33 
MHz 
Note 1 


t1 
ClK Period 


, 


40 
125 
30 
125 
ns 


t1a 
ClK Period Stability 
0.1 
0.1 
% 
Adjacent clocks 


t2 
ClK High Time 
14 
11 
ns 
at2V 


t3 
ClKlowTime 
14 
11 
ns 
atO.8V 
4 
ClK Fall Time 
4 
3 
ns 
2VtoO.8V 


ts 
ClK Rise Time 
4 
3 
ns 
0.8Vto2V 


tt> 
A2-A31, PWT, PCD, BEO-3#<,M/IO#, 
3 
19 
3 
16 
ns 
D/C#, 
W/R#, 
ADS#, lOCK#, 
BREQ, 


HlDA, SMIACT#, FERR#· Valid Delay 


t7 
A2-A31, PWT, PCD, BEO-3#, M/IO#, 
28 
20 
ns 
Note 2 


D/C#, 
W/R#, 
ADS#, lOCK#, 
BREQ, 


HlDA Float Delay 


ta 
PCHK# Valid Delay 
3 
24 
3 
22 
ns 


tea 
BLAST#, PlOCK# Valid Delay 
3 
24 
3 
20 
ns 


tg 
BLAST#, PLOCK# Float Delay 
28 
20 
ns 
Note 2 


t10 
00-031, 
DPO-DP3 Write Data Valid Delay 
3 
20 
3 
18 
ns 


• 
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Table 6·9. 5V A.C. Characteristics 
(1X Clock) 
Frequency 
= 25 and 33 MHz (Preliminary 
Information) 
(Continued) 
Functional 
operating 
range: 
V CC = SV ± S%; T CASE = O"C to + 8SoC; CL = SO pF unless otherwise 
specified. 


Symbol 
Parameter 
Min 
Max 
Min 
Max 
Unit 
Notes 


tll 
00-031, 
DPO-DP3 
Write Data Float Delay 
28 
20 
ns 
Note 2 


t12 
EADS # Setup Time 
8 
5 
ns 


t13 
EADS# 
Hold Time 
3 
3 
ns 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
8 
5 
ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
3 
3 
ns 


t16 
RDY#, 
BRDY# 
Setup Time 
8 
5 
ns 


t17 
RDY#, 
BRDY# 
Hold Time 
3 
3 
ns 


t16 
HOLD, AHOLD Setup Time 
10 
6 
ns 


t16a 
BOFF # Setup Time 
10 
8 
ns 


t19 
HOLD, AHOLD, 
BOFF # Hold Time 
3 
3 
ns 


t20 
FLUSH#, 
A20M#, 
NMI,INTR, 
SMI#, 
10 
5 
ns 
STPCLK#, 
SRESET, 
RESET,IGNNE#* 
Setup Time 


t21 
FLUSH#, 
A20M#, 
NMI,INTR, 
SMI#, 
3 
3 
ns 
STPCLK#, 
SRESET, 
RESET,IGNNE#* 
Holdime 


t22 
DO-D31, 
DPO-DP3, 
A4-A31 
Read Setup Time 
5 
5 
ns 


t23 
DO-D31, 
DPO-DP3, 
A4-A31 
Read Hold Time 
3 
3 
ns 


NOTES: 
·Present only in the SL Enhanced Intel466 OX and OX2 CPUs. 
1. 0 MHz operation is guaranteed when the STPCLK# and STOP GRANT bus cycle protocol is used. 
2. Not 100% tested, guaranteed by design characterization. 
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Table 
6-10. 5V A.C. Characteristics 
(1X Clock) 
Frequency 
= 50 MHz (Inte1486 
OX CPU) 


(Preliminary 
Information) 
Functional operating range: VCC = 5V ± 5%; TCASE = O"Cto + 85°C; CL= 0 pF unless otherwise specified. 


Symbol 
Parameter 
Min 
Max 
Unit 
Notes 


Frequency 
16 
50 
MHz 
Note 1 


t1 
ClK 
Period 
20 
62.5 
ns 


t1a 
ClK 
Period Stability 
0.1 
% 
Adjacent 
clocks 


t2 
ClK 
High Time 
7 
ns 
at2V 


t3 
ClKlowTime 
7 
ns 
at 0.8V 


~ 
ClK 
Fall Time 
2 
ns 
2Vto 
0.8V 


t5 
ClK 
Rise Time 
2 
ns 
0.8Vto 
2V 


ts 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
D/C#, 
3 
12 
ns 
W/R#, 
ADS#, 
lOCK#, 
BREQ, HlDA, 


SMIACT #, FERR # Valid Delay 


t] 
A2-A31, 
PWT, pca, BEO-3#, 
M/IO#, 
O/C#, 
18 
ns 
Note 2 
W/R#, 
ADS#, 
lOCK#, 
BREQ, HlDA 
Float 
Delay 


ta 
PCHK# 
Valid Delay 
3 
14 
ns 


t8a 
BLAST #, PLOCK # Valid Delay 
3 
12 
ns 


t9 
BLAST #, PLOCK # Float Delay 
18 
ns 
Note 2 


t10 
DO-031, 
DPO-DP3 
Write Data Valid Delay 
3 
12 
ns 


t11 
DO-031, 
DPO-DP3 
Write Data Float Delay 
18 
ns 
Note 2 


t12 
EADS # Setup Time 
5 
ns 


t13 
EADS# 
Hold Time 
2 
ns 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
5 
ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
2 
ns 


t16 
RDY #, BRDY # Setup Time 
5 
ns 


t17 
RDY #, BROY # Hold Time 
2 
ns 


t18 
HOLD, AHOlD, 
BOFF # Setup Time 
5 
ns 


t19 
HOLD, AHOlD, 
BOFF # Hold Time 
2 
ns 


t20 
FlUSH#, 
A20M#, 
NMI, INTR, SMI#, 
5 
ns 
STPClK#, 
SRESET, 
RESET,IGNNE# 
Setup 
Time 


t21 
FlUSH#, 
A20M#, 
NMI, INTR, SMI#, 
2 
ns 
STPClK#, 
SRESET, 
RESET,IGNNE# 
Hold 
Time 


t22 
DO-D31, 
DPO-OP3, 
A4-A31 
Read Setup Time 
4 
ns 


t23 
DO-D31, 
DPO-DP3, 
A4-A31 
Read Hold Time 
2 
ns 


NOTES: 
1.0 MHz operation is guaranteed when the STPCLK# and STOP GRANT bus cycle protocol is used. 
2. Not 100% tested, guaranteed by design characterization. 
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The following 
specifications 
are different 
for existing 
Intel486 
DX2 CPU products. 
A system board 
that will support 
all of the SL Enhanced 
Intel486 
CPU products 
should 
be designed 
to the worst-case 
specifications 
of the 2S and 33 


MHz local bus timings. 


Table 6-11. A.C. Characteristics 
(1X Clock) 
Frequency = 50 and 66 MHz (Inte1486 DX2 CPU) 
(Preliminary Information) 


Functional 
operating 
range: VCC = SV ± S%; TCASE = O"C to + 8SoC; CL = SO pF unless otherwise 
specified. 


DX2-50 
DX2-66 


Symbol 
Parameter 
Mln 
Max 
Mln 
Max 
Unit 


Frequency 
50 
66 
MHz 


ClK Frequency 
8 
25 
8 
33 
MHz 


ts 
A2-A31, PWT, PCD, BEO-3#, M/IO#, 
14 
ns 
D/C#, 
W/R#, 
ADS#, lOCK#, 
BREQ, 
HlDA, SMIACT#, FERR# Valid Delay 


t8 
PCHK# Valid Delay 
14 
ns 


t8a 
BLAST#, PLOCK# Valid Delay 
14 
ns 


tlO 
00-031, 
DPO-DP3 Write Data Valid Delay 
14 
ns 


t18 
HOLD, AHOlD Setup Time 
8 
ns 


t18a 
BOFF# Setup Time 
8 
7 
ns 


t20 
FlUSH#, 
A20M#, NMI,INTR, SMI#, 
8 
ns 
STPClK#, 
SRESET, RESET,IGNNE# 
Setup Time 


NOTES: 
1.0 MHz operation is guaranteed 
when the STPCLK# 
and STOP GRANT 
bus cycle protocol is used. 


2. Not 100% tested, 
guaranteed 
by design characterization. 


6.2.2 3.3V A.C. CHARACTERISTICS 


Table 6·12. 3.3V A.C. Characteristics 
(1X Clock Option) 
Frequency = 20 (Intel486 DX2-40 CPU), 25, (Intel486 
DX2-50 and SX·25 CPUs) and 33 MHz. (Preliminary Information) 


Functional 
operating 
range: Vcc 
= 3.3V ±0.3V; 
TCASE = O"C to + 85°C; CL = 50 pF unless otherwise 
specified. 


Symbol 
Parameter 
Mln Max Min Max Mln Max Figure 
Unit 
Notes 


Frequency 
8 
20 
8 
25 
8 
33 
MHz 
Note 1 


tl 
ClK Period 
50 
125 
40 
125 
30 
125 
ns 


tla 
ClK Period Stability 
0.1 
0.1 
0.1 
% 
Adjacent clocks 


t2 
ClK High Time 
16 
14 
11 
ns 
at2V 


t3 
ClKlowTime 
16 
14 
11 
ns 
at 0.8V 
4 
ClK Fall Time 
6 
4 
3 
ns 
2VtoO.8V 


t5 
ClK Rise Time 
6 
4 
3 
ns 
0.8Vto2V 


ts 
A2-A31, PWT, PCD, BEO-3#, 
3 
23 
3 
19 
3 
16 
ns 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 


lOCK#, 
BREQ, HlDA, SMIACT#, 


FERR#. Valid Delay 


t7 
A2-A31, PWT, PCD, BEO-3#, 
37 
28 
20 
ns 
Note 2 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 


lOCK#, 
BREQ, HlDA Float Delay 


t8 
PCHK# Valid Delay 
3 
28 
3 
24 
3 
22 
ns 


tea 
BLAST#, PLOCK# Valid Delay 
3 
28 
3 
24 
3 
20 
ns 


t9 
BLAST#, PLOCK# Float Delay 
37 
28 
20 
ns 
Note 2 


tlO 
DO-D31, DPO-DP3 
3 
26 
3 
20 
3 
19 
ns 
Write Data Valid Delay 


tll 
DO-D31, DPO-DP3 
37 
28 
20 
ns 
Note 2 
Write DAta Float Delay 


t12 
EADS# Setup Time 
10 
8 
6 
ns 


t13 
EADS# Hold Time 
3 
3 
3 
ns 


t14 
KEN#, BS16#, BS8# Setup Time 
10 
8 
6 
ns 


t15 
KEN#, BS16#, BS8# Hold Time 
3 
3 
3 
ns 


t16 
RDY#, BRDY# Setup Time 
10 
8 
6 
ns 


t17 
RDY#, BRDY# Hold Time 
3 
3 
3 
ns 


t18 
HOLD, AHOlD Setup Time 
12 
10 
6 
ns 


t18a 
BOFF# Setup Time 
12 
10 
9 
ns 


t19 
HOLD, AHOlD, BOFF# Hold Time 
3 
3 
3 
ns 


• 


Table 
6-12. 3.3V A.C. Characteristics 
(1X Clock 
Option) 
Frequency 
= 20, (Inte1486 
DX2-40 CPU), 25, (Inte1486 
DX2-50 and SX·25 CPUs) and 33 MHz. (Preliminary 
Information) 


Functional 
operating 
range: VCC = 3.3V ± 0.3V; T CASE = O°C to + 85°C; CL = 50 pF unless otherwise 
specified. 


Symbol 
Parameter 
Mln 
Max 
Mln 
Max 
Mln 
Max 
Figure 
Unit 
Notes 


t20 
FLUSH#, 
A20M#, 
NMI, INTR, SMI#, 
12 
10 
6 
ns 
STPCLK#, 
SRESET, 
RESET, IGNNE#' 
Setup 
Time 


t21 
FLUSH#, 
A20M#, 
NMI,INTR, 
SMI#, 
3 
3 
3 
ns 
STPCLK#, 
SRESET, 
RESET, IGNNE#' 
Hold 
Time 


t22 
00-031, 
OPO-OP3, 
A4-A31 
Read Setup Time 
6 
6 
6 
ns 


t23 
00-031, 
OPO-OP3, 
A4-A31 
Read Hold Time 
3 
3 
3 
ns 


NOTES: 
• Present only in the SL Enhanced Intel486 OX and DX2 CPUs. 
1. 0 MHz operation is guaranteed when the STPCLK# and STOP GRANT bus cycle protocol is used. 
2. Not 100% tested, guaranteed by design characterization. 


INPUT_S 
f_IX_V_AU_~_I: __ t_,t 


Ix = t12, t14, t16, t18, t22, t18a 
ty = t13, t15, t17, t19, t21, t23 


Output Valid Delay 
Ix = t6, t8a, t10 
Output Float Delay 
ty = t7, t9, t11 


Figure 
6·4. Output 
Valid and Float 
Delay Timing 


~~ 
VALID X 
_ 


PCH_K# 
__ 
18 _-J:-8=V=ALIo=>e 


1 14 ref 
+ 
t 
O· Mill 
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7· Max 
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3t008 
f 
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-.jI-0.13 Min 
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Un_.: mm 
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Tolerance 
Window 
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Lead Skew lrom Theoretical 
True PO'Kion 


•• 


The SL Enhanced Intel486 microprocessors are speci- 
fied for operation when TC (the case temperature) is 
within the range of 0 ·C - 
85 ·C. Tc may be measured 
in 
any 
environment 
to 
determine 
whether 
the 
Inte1486 microprocessor is within the specified operat- 
ing range. The case temperature, with and without heat 
sink should be measured using a 0.005" 
diameter 
(AWG # 36) thermocouple with a 90· angle adhesive 
bond at the center of the package top surface, opposite 
the pins. Figures 6-7 and 6-8 illustrate this methodol- 
ogy. 
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The ambient temperature can be calculated from 8JC 
and 8JA from the following equations. 


TJ = Tc + P • 8JC 
TA = TJ - 
P • 8JA 
TC = TA + P' 
[8JA-8Jd 


Where TJ. TA. Tc 
= Junction, 
Ambient and Case 
Temperature. 
respectively. 8JC, 8JA = 
Junction-to- 
Case and Junction-to-Ambient 
thermal Resistance, re- 
spectively. 


The values for 8JA and 8JC are given in the following 
tables for a variety of packages and operating frequen- 
cies. 


Refer to the OverDrive Processor socket section for 
OverDrive processor 8JA and 8JC values. 


Figure 6·7. Case Temperature Measurement without Heat Sink (0.005" Dia. Thermocouple 
on the 
Center of the Package Top Surface with a 90· Angle Adhesive Bond) 


Figure 6·8. Case Temperature Measurement with Heat Sink (0.005" Dla. 
Thermocouple 
on the Center of the Package Top Surface with a 90· Angle 
Adhesive Bond through a Hole Drilled at the Heat Sink Base) 


8JC 
8JA Ys.Airflow· ft/min. (m/sec) 


0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 
I 
With Heat Sink 
1.5 
13 
8.0 
6.0 
5.0 
4.5 
4.25 
I 
Without 
Heat Sink 
1.5 
17 
14.5 
12.5 
11.0 
10.0 
9.5 


8JC 
8JA Ys.Airflow· ft/min. (m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


I 
With Heat Sink 
3.5 
17.0 
10.5 
8.5 
8.0 


I 
Without 
Heat Sink 
3.5 
20.5 
16.5 
14.0 
12.5 


8JA Ys.Airflow· ft/mln. (m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


Intel486 
SX CPU Without 
Heat Sink 
36.0 
27.5 
25.0 
22.5 


Intel486 
OX CPU Without 
Heat Sink 
25.0 
17.5 
15.0 
13.0 


Intel486 
OX2 CPU Without 
Heat Sink 
24.0 
17.0 
15.0 
13.0 


8JC Ys.Airflow· ft/min. (m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


I 
Intel486 
SX CPU 
4.0 
7.5 
8.0 
8.5 


I 
Intel486 
OX and OX2 CPUs 
3.5 
6.0 
6.0 
6.0 


The following tables show maximum ambient temperatures of SL Enhanced Intel486 CPUs for each package and 
operating frequency. The maximum ambient temperatures listed below are not valid for the OverDrive Processor. 


Freq. 
Airflow· ft/min. (m/sec) 


0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


TambientOC 
33 
49 
65 
71 
74 
76 
76 
With Heat Sink 
50 
30 
54 
64 
68 
71 
72 


66 
16 
46 
58 
64 
67 
69 


TambientOC 
33 
36 
44 
50 
55 
58 
60 
Without 
Heat Sink 
50 
11 
23 
33 
40 
45 
47 


66 
-8 
7 
19 
28 
34 
37 
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Freq. 
Airflow 
- ft/min. 
(m/sec) 


0 
200 
400 
600 
800 
1000 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 
(5.07) 


Tambient·C 
25 
53 
67 
72 
75 
77 
77 
With Heat Sink 
33 
46 
63 
70 
73 
75 
76 


Tambient·C 
25 
42 
49 
54 
58 
61 
63 
Without 
Heat Sink 
33 
32 
40 
47 
52 
56 
58 


Freq. 
Airflow 
- ft/min. 
(m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


Tambient·C 
33 
42 
63 
69 
71 
With Heat Sink 


Tambient·C 
33 
31 
44 
52 
57 
Without 
Heat Sink 


Freq. 
Airflow· 
ft/min. 
(m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


Tambient·C 
25 
47 
65 
71 
72 
With Heat Sink 
33 
39 
61 
68 
70 


Tambient·C 
25 
37 
49 
56 
60 
Without 
Heat Sink 
33 
27 
40 
49 
54 


Freq. 
Airflow 
- ft/mln. 
(m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


I 


Tambient·C 
25 
51.1 
64.0 
67.0 
70.0 
Without 
Heat Sink 
33 
44.5 
59.5 
63.5 
67.0 
Intel486 
SX CPU 


Freq. 
Airflow· 
ft/min. 
(m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


I 


Tambient·C 
33 
55.5 
69.0 
72.5 
75.5 
Without 
Heat Sink 
Intel486 
OX CPU 


Freq. 
Airflow 
- ft/min. 
(m/sec) 


0 
200 
400 
600 
(0) 
(1.01) 
(2.03) 
(3.04) 


I 


Tambient·C 
40 
54.5 
68.5 
71.5 
74.5 
Without 
Heat Sink 
50 
48.0 
65.0 
68.5 
72.0 
Intel486 
OX2 CPU 


For further details about thermal and mechanical package specifications and methodologies, refer to the 1994 
Packaging Handbook (order number 240800). 


SL Enhanced Intel486 SX, OX, and OX2 CPUs. Fig- 
ures 6-11 and 6-12 apply to 5V SL Enhanced Inte1486 
OX and OX2 CPUs. Figures 6-13 and 6-14 apply to 5V 
SL Enhanced Intel486 SX CPUs. The figures apply to 
all frequencies specified for each corresponding prod- 
uct. 


The capacitive derating curves illustrate output delay 
versus capacitive load for 3.3V and 5V SL Enhanced 
Inte1486 microprocessors. 
The derating curves show 
the delays for the rising and fal1ingedges under worst- 
case conditions. Figures 6-9 and 6-10 apply to all 3.3V 


NOTE: 
This graph willnot be linearoutside of the capacitive range shown. 
nom = nominalvalue fromthe A.C.Characteristicstable. 


Figure 6-9. Typical 
Loading 
Delay versus Load Capacitance 
under 
Worst-Case 
Conditions 
for a Low-to-Hlgh 
Transition 


NOTE: 
This graph willnot be linearoutside of the capacitive range shown. 
nom = nominalvalue fromthe A.C.Characteristicstable. 


Figure 6·10. Typical 
Loading 
Delay versus Load Capacitance 
under 
Worst-Case 
Conditions 
for a Hlgh-to-Low 
Transition 
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nom+4 


nom+3 


nom+2 


Delay (ns) 


nom+l 


nom 


nom-l 


25 
50 
75 
100 


NOTE: 
This graph 
will not be linear outside 
of the capacitive 
range shown. 
nom = nominal 
value from the A.C. Characteristics 
table. 


Figure 6-11. Typical 
Loading 
Delay versus Load Capacitance 
under 
Worst-Case 
Condltons 
for a Low-ta-High 
Transition 


nom+? 


nom+6 


nom+5 


nom+. 


nom+3 
Delay (ns) 
nom+2 


nom+l 


nom 


nom-l 


nom-2 


25 
50 
75 
100 


NOTE: 
This graph 
will not be linear outside' of the capacitive 
range 
shown. 


nom = nominal 
value from the A.C. Characteristics 
table. 


Figure 6-12. Typical 
Loading 
Delay versus Load Capacitance 
under 
Worst-Case 
Conditions 
for a High-to-Low 
Transition 


NOTE: 
This graph 
will not be linear outside 
of the capacitive 
range shown. 
nom 
= nominal 
value from the A.C. Characteristics 
table. 


Figure 6-13. Typical 
Loading 
Delay versus Load Capacitance 
under 


Worst-Case 
Conditions 
for a Low-to-Hlgh 
Transition 


nom+7 


nom+6 


nom+5 


nom+4 


nom+3 
Delay (ns) 
nom+2 


nom+l 


nom 


nom-1 


nom-2 


25 
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NOTE: 
This graph 
will not be linear outside 
of the capacitive 
range shown. 
nom 
= nominal 
value from the A.C. Characteristics 
table. 


Figure 6-14. Typical 
Loading 
Delay versus Load Capacitance 
under 


Worst-Case 
Conditions 
for a Hlgh-to-Low 
Transition 
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The SL Enhanced Intel486 CPU offers 2X clock mode 
for systems that reiy on dynamic frequency scaling for 
CPU power management. This product is not intended 
for the desktop computer. This 2X clock CPU differs 
from the IX clock CPU in the following ways: 


Pin AssignmentlFunction: The 2X clock product has a 
CLK2 input, rather than the IX clock product's CLK 
input. The CLK2 input must be synchronized to the 
system phase using the falling edge of RESET. (For 
reference, the pinout change from the existing Low 
Power Intei486 DX and SX CPUs is also shown. The 
CLKSEL pin is not used on the SL Enhanced Intei486 
CPUs, as it is on the existing Low Power Intel486 DX 
and SX CPUs.) 


Clock Control: The CLK2 input can be changed dy- 
namically. The Stop Clock interrupt is handled in a 
different manner. 


AC Specifications: In general, the AC specifications for 
the 2X clock device will have slightly longer setups, 


holds, and maximum valid delays. This is consistent 
with the existing Low Power Intei486 DX and SX CPU 
products. 


Upgrades: There are no end user upgrade products 
planned for the 2X clock mode product. The UP # 
function is still provided for use by system designers 
that offer SX to DX CPU upgrade cards. 


This section will explain the differences between the 
CPU with the 2X clock mode and the CPU with the IX 
clock mode. 


The SL Enhanced Intei486 CPU with the 2X clock 
option is available in the 196 Lead PQFP package. The 
pinout is identical to the SL Enhanced Intel486 CPU 
with the IX clock option with the exception of the 
name of the clock input. The IX clock input is called 
CLK and the 2X clock input is called CLK2. 


Table 7-1 shows the change between the existing prod- 
ucts and new products. 


Pin 
Low Power 
SLEnhanced 
Low Power 
SL Enhanced 
Intel486 
SX CPU 
Intel486 
SX CPU 
Intel486 
OX CPU 
Intel486 
OX CPU 


75 
NC 
STPCLK# 
NC 
STPCLK# 


77 
NC 
NC 
IGNNE# 
IGNNE# 


81 
NC 
NC 
FERR# 
FERR# 


85 
NC 
SMI# 
NC 
SMI# 


92 
NC 
SMIACT# 
NC 
SMIACT# 


94 
NC 
SRESET 
NC 
SRESET 


127 
CLKSEL 
NC 
CLKSEL 
NC 


Symbol 
Type 
Name and Function 


ClK2 
I 
CLK2 provides 
the fundamental 
timing for the CPU. Both of the internal timing 
phases, phase-1 
(ph1) and phase-2 
(ph2), are provided 
by the external ClK2 
input. 


All external timing parameters 
are specified 
with respect to the phase-1 
rising edge of 
ClK2. 


For the 2X clock mode the ClK 
frequency 
is twice the frequency 
of the CPU. 


RESET 
I 
The RESET input forces the CPU to begin execution 
at a known state. The CPU 
cannot 
begin execution 
of instructions 
until at least 1 ms after Vcc and ClK2 
have 
reached 
their proper AC and DC specifications. 
However, 
for soft resets, RESET 
should remain active for at least 30 ClK2 
periods (equal to 15 internal CPU ClK). 


The RESET pin should remain active during this time to insure proper CPU operation. 
RESET is active HIGH. Reset is asynchronous, 
but must meet setup and hold times 
t20, t20a and t21 for recognition 
in any specific 
clock. 


RESET sets the 5MBASE 
descriptor 
to default address of 30000H. 
If the system 
uses 5MBASE 
relocation, 
then the SRESET pin should be used for soft resets. 


For the 2X clock mode, the falling edge of RESET synchronizes 
the CPU internal 
clock phase. RESET must be used at power up and anytime the phase of the CPU 
clock must be re-synchronized 
to the system phase. 


SRESET 
I 
The SRESET pin duplicates 
all the functionality 
of the RESET pin with the following 
two exceptions: 


1. The 5MBASE 
register will retain its previous value. 


2. If UP# 
is asserted, 
SRESET will not have an effect on the host microprocessor. 


For soft resets, SRESET should remain active for at least 30 ClK2 
periods 
(equal to 
15 internal CPU ClK). 
SRESET is active HIGH. SRESET is asynchronous 
but must 
meet setup and hold times t20 and t21 for recognition 
in any specific 
clock. 
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15 CLKmin 


~S 
~ 
This edge has no relationship J' --------------- 


to Internal clock phase 


The frequency of CLK2 is twice the internal frequency 
of the CPU. The internal clock is comprised of two 
phases, "PHI" 
and "PH2". 
Each CLK2 period is a 
phase of the internal clock. Figure 7-1 illustrates the 
relationship between the CLK2 input and the internal 
phases. All set-up, hold, float-delay and valid delay 
timings are referenced to the rising edge of phase 1 of 
CLK2. Thus it is important to synchronize the external 
circuitry with the phase of the CLK2 input. The inter- 
nal processor clock phase is determined at the falling 
edge of the RESET input. RESET must meet the speci- 
fied setup and hold times to correctly synchronize the 
internal clock phase. See Figure 7-2. 


The CPU with the 2X clock option does not rely on an 
internal 
Phase Lock Loop to generate the internal 
phase clocks. Therefore, the frequency of the CLK2 
input can be changed dynamically or "on-the-fly". 


The 2X clock mode SL Enhanced Intel486 CPU pro- 
vides an interrupt mechanism, STPCLK "', that places 
the CPU into a known state. Although the frequency of 
the CLK2 input can be dynamically changed between 0 
MHz and the maximum operating frequency of the 


CPU, operation between 0 MHz and 8 MHz is not 
tested. Stopping the CLK2 input with the CPU in a 
known state requires use of the STPCLK'" 
mechanism. 


When the CPU recognizes a STPCLK'" 
request, the 


processor will stop execution on the next instruction 
boundary, stop the pre-fetcher, empty all internal pipe- 
lines and the write buffers, and then generate a Stop 
Grant bus cycle. At this point the CPU is in the Stop 
Grant state. 


The rising edge of STPCLK '" will tell the CPU that it 
can return to program execution at the instruction fol- 
lowing the interrupted instruction. 


Unlike the normal interrupts, 
INTR 
and NMI, the 


STPCLK'" 
interrupt 
does not initiate interrupt 
ac- 


knowledge cycles or interrupt vector table reads. 


STPCLK'" 
is active LOW and is provided with an in- 


ternal pull-up resistor. STPCLK'" 
is an asynchronous 


signal, but must remain active until the processor issues 
the Stop Grant bus cycle. STPCLK'" 
may be de-assert- 


ed at any time after the processor has issued the Stop 
Grant bus cycle. Note that STPCLK'" 
should NOT be 


de-asserted before the CPU has issued the Stop Grant 
bus cycle. STPCLK'" 
must be deasserted for a mini- 


mum of 5 clocks after RDY'" is returned active for the 
Stop Grant bus cycle before being asserted again. 


4 Halt State 
1 Normal Execution 
HALT 


CLK2 is running 
Frequencycan be changed 
dynamically 


INTR, NMI,SMI# 
j~ 
RESET,SRESET 
STPCLK#asserted 


& StopGrant Bus 
STPCLK# deasserted 
Cyclegenerated 
or RESET, SRESET 


I 


STPCLK#deasserted 
2 Stop Grant State 


HOLDIBOFF/AHOLD,EADS#respond 


•.. 
ICC depends on CLK2 Frequency 


STPCLK#asserted 
!' 


& StopGrant Bus Cycle generated 
StopCLK2 input 
Start CLK2 input 


, 


3 Stop Clock State 


GLK2input is stopped 


CPU will not respond to inputs 


This is the Normal operating state of the CPU. During 
this state, the CLK2 input frequency can be changed 
dynamically or "on-the-fly" 
for power consumption 
control with no clock control latency. This capability 
provides a wide range of performance/power consump- 
tion options. Operation of the processor is tested be- 
tween 8 MHz and the maximum operating frequency of 
the CPU. Operation below 8 MHz is guaranteed by 
design, though is not 100% tested. Operation 
at 0 
MHz 
is 
tested 
when 
the 
stop 
clock 
protocol 
(STPCLK II) is used. 


The CPU enters the Stop Grant state in response to a 
STPCLK II interrupt. The CPU will generate a Stop 
Grant bus cycle when it enters this state from the Nor- 
mal state or the HALT state. The CPU will not gener- 
ate a Stop Grant bus cycle when it enters the Stop 
Grant state from the Stop Clock state. 


While in the Stop Grant state, the pull-up resistors on 
STPCLKII and UPII are disabled internally. The sys- 
tem must continue to drive these inputs to the state 
they were in immediately before the CPU entered the 
Stop Grant state. For minimum CPU power consump- 
tion, all other input pins should be driven to their inac- 
tive level while the CPU is in the Stop Grant state. 


• 


During the Stop Grant state, the CPU will respond to 
HOLD, AHOLD and BOFF# 
normally and can per- 
form cache invalidates. An active edge on either the 
SMI # or NMI interrupts will be latched and will be 
serviced after the rising edge of STPCLK #. An INTR 
request will be serviced after the CPU returns to the 
normal state as long as INTR is held active until the 
CPU issues an interrupt acknowledge bus cycle. 


The CPU enters the Stop Clock state when the system 
stops the CLK2 input. The system can stop the CLK2 
input on either a logic high or a logic low. The CLK2 
input must be restarted in the same state as when it was 
stopped. In other words, any CLK2 input state can be 
stretched. (See Figure 7-4 for details). CPU operation 
at 0 MHz is tested only when the CPU is in the Stop 
Clock state. 


In the Stop Clock state, the CPU does not respond to 
any stimulus. The CPU must re-enter the Stop Grant 
state (CLK2 input must be restarted) in order to per- 
form any bus actions such as HOLD/HLDA 
cycles, 
invalidates (AHOLDIEADS# 
or FLUSH# 
cycles), 
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and BOFF #. It is recommended that CLK2 be restart- 
ed 2 clocks before and continue until 2 clocks after the 
transition 
of 
the 
HOLD, 
AHOLD, 
EADS#, 


FLUSH#, 
or BOFF# 
signals. 


The interrupt signals (SMI #, NMI, and INTR) will be 
recognized and serviced corre,ctly if the input is held in 
the active state until the CPU returns to the Stop Grant 
state. The SL Enhanced Inte1486 CPU Family requires 
that INTR be held active until the CPU issues an inter- 
rupt acknowledge cycle in order to guarantee recogni- 
tion. This condition also applies to the existing Intel486 
CPUs. 


The CPU enters the HALT state from the Normal state 
on a HLT instruction. The system can place the CPU 
into the Stop Grant state from the HALT state by as- 
serting the STPCLK # input. The CPU will generate a 
Stop Grant bus cycle when it enters the Stop Grant 
state. If the CPU entered the Stop Grant state from the 
HALT state then it will return to the HALT state when 
the STPCLK # interrupt is de-asserted. When the CPU 
re-enters the HALT state it will generate a HALT bus 
cycle. 


• Normal State 
o Stop Clock State 
Ice max = 1 mA @ 3.3V 
Ice max = 2 mA @ 5V 


C Stop and Restart CLK2 within Stop Grant State 
(Fully tested) 


33MHz 
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Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


3.3V 
Ice Active 
25 MHz 
250 mA 
315 mA 


33 MHz 
330 mA 
415 mA 


Ice Stop Clock 
OMHz 
100 J.loA 
1 mA 


Vcc 
Parameter 
Operating 
Frequency 
Typical 
Maximum 


3.3V 
Ice Active 
33 MHz 
330 mA 
415 mA 


Ice Stop Clock 
OMHz 
100 J.loA 
1 mA 
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Vcc 
Parameter 
Operating Frequency 
Typical 
Maximum 


5V 
lee Active 
25 MHz 
430mA 
560 mA 


33 MHz 
590 mA 
685 mA 


lee Stop Clock 
OMHz 
200p.A 
2mA 


Vcc 
Parameter 
Operating Frequency 
Typical 
Maximum 


5V 
lee Active 
33 MHz 
500mA 
630mA 


lee Stop Clock 
OMHz 
200p.A 
2mA 


-+- i486 SX SV 
-II- i486 DX SV 


-.- 
i486 SX, DX 3.3V 
2000 


Typical Power IS00 
(mWATIS) 


Figure 7-6. Frequency YS. Power (Typ) in 2X ClK Mode for 3.3V and 5V, 


Sl Enhanced Intel486 SX and OX CPUs 


7.4 D.C.Specifications for 2X Clock 
Option 
7.5 A.C. Specifications for 2X Clock 
Option 


For 2X clock D.C. specifications, refer to the IX clock 
D.C. specifications (Section 6.1). 


The A.C. specifications given in the tables of this sec- 
tion consist of output delays, input setup requirements 
and input hold requirements. All A.C. specifications 
are relative to the rising edge of the phase I of the input 
system clock (CLK2), unless otherwise specified. 


7.5.1 
5V A.C. CHARACTERISTICS 


Table 
7·7. 5V A.C. Characteristics 
(2X Clock) 
Frequency 
= 25 and 33 MHz (Preliminary 
Information) 
Functional operating range: VCC = 5V ± 5%; TCASE = O·C + 85 ·C; CL = 50 pF unless otherwise specified. 


Symbol 
Parameter 
Mln 
Max 
Min 
Max 
Unit 
Notes 


Frequency 
25 
33 
MHz 
Note 1 


CLK2 Frequency 
50 
66 
MHz 
Note 1 


tl 
CLK2 Period 
20 
15 
ns 


t2 
CLK2 High Time 
7 
5 
ns 
at2V 


t3 
CLK2 Low Time 
7 
5 
ns 
at 0.8V 


4 
CLK2 Fall Time 
2 
2 
ns 
2V to 0.8V (2) 


t5 
CLK2 Rise Time 
2 
2 
ns 
0.8V to 2V (2) 


t6 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
3 
19 
3 
17 
ns 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
BREQ, 
HLDA, SMIACT#, 
FERR#' 
Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
28 
21 
ns 
Note 2 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
BREQ, 
HLDA Float Delay 


t8 
PCHK# 
Valid Delay 
3 
24 
3 
23 
ns 


t8a 
BLAST #, PLOCK # Valid Delay 
3 
24 
3 
21 
ns 


t9 
BLAST #, PLOCK # Float Delay 
28 
21 
ns 
Note 2 


tl0 
00-031, 
DPO-DP3 
Write Data Valid Delay 
3 
20 
3 
19 
ns 


t11 
00-031, 
DPO-DP3 
Write Data Float Delay 
28 
21 
ns 
Note 2 


t12 
EADS# 
Setup Time 
9 
6 
ns 


t13 
EADS# 
Hold Time 
4 
4 
ns 


t14 
KEN #, BS16 #, BS8 # Setup Time 
9 
6 
ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
4 
4 
ns 


t16 
ROY #, BRDY # Setup Time 
9 
6 
ns 


t17 
RDY#, 
BRDY# 
Hold Time 
4 
4 
ns 


t18 
HOLD, AHOLD Setup Time 
11 
7 
ns 


t18a 
BOFF # Setup Time 
11 
9 
ns 


t19 
HOLD, AHOLD, 
BOFF # Hold Time 
4 
4 
ns 


t20 
FLUSH#, 
A20M#, 
NMI,INTR, 
SMI#, 
11 
6 
ns 
STPCLK#, 
SRESET, 
RESET, IGNNE#' 
Setup Time 


t20a 
RESET Falling Edge Setup Time 
9 
5 
ns 


t21 
FLUSH#, 
A20M#, 
NMI, INTR, SMI#, 
4 
4 
ns 
STPCLK#, 
SRESET, 
RESET,IGNNE#' 
Hold Time 
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Table 
7·7. 5V A.C. Characteristics 
(2X Clock) 
Frequency 
= 25 and 33 MHz (Preliminary 
Information) 
(Continued) 
Functional 
operating range: V CC = 5V ± 5%; T CASE = DoC to + 85°C; CL = 50 pF unless otherwise specified. 


Symbol 
Parameter 
Mln 
Max 
Mln 
Max 
Unit 
Notes 


t22 
DO-D31, 
DPO-DP3, 
A4-A31 
Read Setup Time 
6 
6 
ns 


t23 
DO-D31, 
DPO-DP3, 
A4-A31 
Read Hold Time 
4 
4 
ns 


NOTES: 
, Present only in the SL Enhanced Intel486TMDX microprocessor. 
1.0 MHz operation is tested using the STPCLK# and STOP GRANT bus cycle protocol. Operation between 0 MHz < CLK2 
< 8 MHz is guaranteed by design characterization, but is not 100% tested. 
2. Not 100% tested. guaranteed by design characterization. 


Table 7-8. 3.3V A.C. Characteristics 
(2X Clock) 
Frequency 
= 25 and 33 MHz (Preliminary 
Information) 
Functional 
operating range: V CC = 3.3V ±0.3V; TCASE = O°C to + 85°C; CL = 50 pF unless otherwise specified 


Symbol 
Parameter 
Min 
Max 
Min 
Max 
Unit 
Notes 


Frequency 
25 
33 
MHz 
Note 1 


CLK2 Frequency 
50 
66 
MHz 
Note 1 


t1 
CLK2 Period 
20 
15 
ns 


t2 
CLK2 High Time 
7 
5 
ns 
at2V 


t3 
CLK2 Low Time 
7 
5 
ns 
atO.8V 


t4 
CLK2 Fall Time 
2 
2 
ns 
2V to 0.8V(2) 


t5 
CLK2 Rise Time 
2 
2 
ns 
0.8V to 2V(2) 


ta 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
3 
19 
3 
17 
ns 


D/C#, 
W/R#, 
ADS#, 
LOCK#, 
BREQ. 
HLDA, SMIACT#, 
FERR#' 
Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
28 
21 
ns 
Note 2 


D/C#, 
W/R#, 
ADS#, 
LOCK#, 
BREQ, 


HLDA Float Delay 


t8 
PCHK # Valid Delay 
3 
24 
3 
23 
ns 


t8a 
BLAST #, PLOCK # Valid Delay 
3 
24 
3 
21 
ns 


t9 
BLAST #, PLOCK # Float Delay 
28 
21 
ns 
Note 2 


tlO 
DO-D31, 
DPO-DP3 
Write Data Valid Delay 
3 
20 
3 
19 
ns 


tl1 
DO-D31, 
DPO-DP3 
Write Data Float Delay 
28 
21 
ns 
Note 2 


t12 
EADS# 
Setup Time 
9 
6 
ns 


t13 
EADS# 
Hold Time 
4 
4 
ns 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
9 
6 
ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
4 
4 
ns 


t16 
RDY #, BRDY # Setup Time 
9 
6 
ns 


t17 
RDY #, BRDY # Hold Time 
4 
4 
ns 


t18 
HOLD, AHOLD Setup Time 
11 
7 
ns 


t18a 
BOFF # Setup Time 
11 
9 
ns 
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Table 7·8. 3.3V A.C. Characteristics 
(2X Clock) 
Frequency 
= 25 and 33 MHz (Preliminary 
Information) 
(Continued) 
Functional 
operating range: Vcc 
= 3.3V ±0.3V; 
TCASE = O"C to + 85°C; CL = 50 pF unless otherwise spt;eified 


Symbol 
Parameter 
Mln 
Max 
Min 
Max 
Unit 
Notes 


t19 
HOLO, AHOLO, BOFF # Hold Time 
4 
4 
ns 


t20 
FLUSH#, 
A20M#, 
NMI, INTR, SMI#, 
11 
6 
ns 
STPCLK#, 
SRESET, 
RESET, IGNNE#· 
Setup 
Time 


t20a 
RESET Falling Egde Setup Time 
9 
5 
ns 


t21 
FLUSH#, 
A20M#, 
NMI,INTR, 
SMI#, 
4 
4 
ns 
STPCLK#, 
SRESET, 
RESET, IGNNE#· 
Hold 
Time 


t22 
00-031, 
OPO-OP3, 
A4-A31 
Read Setup Time 
6 
6 
ns 


t23 
00-031, 
OPO-OP3, 
A4-A31 
Read Hold Time 
4 
4 
ns 


NOTES: 
• Present only in the SL Enhanced Intel4B6TMOX Microprocessor 
1. 0 MHz operation 
is tested 
using the 
STPCLK# 
and STOP GRANT bus cycle 
protocol. 
Operation between 
o MHz<CLK2<B 
MHz is guaranteed by design characterization, but is not 100% tested. 


2. Not 100% tested, guaranteed by design characterization. 
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Output Valid Delay 
tx = t6, tB, tBa, t10 
Output Float Delay 
ty = t7, t9, t11 


Figure 
7-9. Valid and Float 
Delay Timings 


241696-40 
tx = t12, t14, t16, t1B, t20, t20a, t22, t1Ba 
ty = t13, t15, tH, t19, t21, t23 


Figure 7·8. Setup 
and Hold Timings 
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8.1 . Test Access Port 


TCK 
This Test Clock signal is an input to the CPU and pro- 
vides the clocking function required by the JTAG 
Boundary scan feature. TCK is used to clock state in- 
formation and data into the component on the rising 
edge of TCK on TMS and TDI, respectively. Data is 
clocked out of the CPU on the falling edge of TCK and 
TOO. 


TDI 
This Test Data Input signal is the serial input used to 
shift JTAG instruction and data into the CPU. TDI is 
sampled on the rising edge of TCK, during the SHIFf- 


IR and SHIFf-DR 
TAP controller states. During all 
other tap controller states, TDI is a "don't care". 


TDO 
This Test Data Output signal is the serial output used 
to shift JTAG instructions and the data out of the 
CPU. TOO is driven on the falling edge ofTCK during' 
the SHIFf-IR 
and SHIFf-DR 
TAP controller states. 


At all other times, TOO is driven to the high imped- 
ance state. 


TMS 
This Test Mode Select signal is decoded by the JTAG 
TAP (Test Access Port) to select the operation of the 
test logic. TMS is sampled on the rising edge of TCK. 
To guarantee deterministic behavior of the TAP con- 
troller, TMS is provided with an internal pull-up resis- 
tor. 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


t24 
TCK Frequency 
8 
MHz 
Note 2 


t25 
TCK Period 
125 
ns 


t26 
TCK High Time 
40 
ns 
@2.0V 


t27 
TCKLowTime 
40 
ns 
@0.8V 


t28 
TCK Rise Time 
8 
ns 
Note 1 


t29 
TCKFaliTime 
8 
ns 
Note 1 


t30 
TDI, TMS Setup Time 
8 
ns 
Note 3 


t31 
TDI, TMS Hold Time 
10 
ns 
Note 3 


t32 
TOO Valid Delay 
3 
30 
ns 
Note 3 


t33 
TOO Float Delay 
36 
ns 
Note 3 


t34 
All outputs 
(Non-Test 
Valid Delay) 
3 
30 
ns 
Note 3 


t35 
All Outputs 
(Non-Test 
Float Delay) 
36 
ns 
Note 3 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
ns 
Note 3 


t37 
All Inputs (Non-Test) 
Hold Time 
10 
ns 
Note 3 


NOTES: 
1. Rise/Fall Times are measured between 0.8V and 0.2V. Rise/Fall times can be relaxed by 1 ns per 10 ns increase in TCK 
period. 
2. TCK period ~ ClK period. 
3. Parameter measured from TCK. 
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8.2 Boundary Scan Component 
Identification Support 


• Manufacturer Identity (bits 1-11). This is a com- 


pressed form of the JEDEC (pub 106-A) manufac- 
turers code. Intel is identified by a hex value of 009. 


• Part Number (bits 12-27). IEEE only requires that 


2 different parts in the same package style with the 
TAP pins in the same location must have different 
values. Intel has sub divided this field further as in- 
dicated in the following tables. These sub fields are 
Vcc, Type, Family, and Model. 


• Version (bits 28-31). 
IEEE recommends that this 


field should be assigned to identify the variant of a 
component type. Intel is using this field to identify 
major steppings of the parts. 


The IEEE standard 1149.1 includes an optional public 
instruction called "IDCODE" 
which is used to verify 


the correct parts are installed in the correct locations 
on a board. The SL Enhanced InteI486 CPUs support 
this instruction with a IDCODE value of 0010. When 
implemented IEEE requires that the register that is se- 
lected for the scan path be 32 bits long with four specif- 
ic sub fields. These fields are: 


• 
LSB (bit 0) must have a value of "I". 


Intel486 
CMD 
Vcc 
Intel 
Intel486 
MFGID 
1st 
Boundary 


CPU 
Register 
Version 
1=3 
Architecture 
CPU 
Model 
Intel = 009H 
bit 
Scan 


Type 
0=5 
Type 
Family 
ID (Hex) 


OX2 
0001 
xxxx* 
0 
000001 
0100 
00101 
00000001001 
1 
x0285013H 


OX 
0001 
xxxx* 
0 
000001 
0100 
00001 
00000001001 
1 
x0281013H 


SX 
0001 
xxxx* 
0 
000001 
0100 
00010 
00000001001 
1 
x0282013H 


OX 
1001 
xxxx* 
0 
000001 
0100 
00001 
00000001001 
1 
x0281013H 


2xCLK 


SX 
1001 
xxxx* 
0 
000001 
0100 
00010 
00000001001 
1 
x0282013H 


2xCLK 


• 
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Intel486 
CMD 
Vcc 
Intel 
Intel486 
MFGID 
1st 
Boundary 


CPU 
Register 
Version 
1=3 
Architecture 
CPU 
Model 
Intel 
= 009H 
bit 
Scan 
Type 
0=5 
Type 
Family 
10 (Hex) 


OX2 
0001 
xxxx· 
1 
000001 
0100 
00101 
00000001001 
1 
x8285013H 


OX 
0001 
xxxx· 
1 
000001 
0100 
00001 
00000001001 
1 
x8281013H 


SX 
0001 
xxxx· 
1 
000001 
0100 
00010 
00000001001 
1 
x8282013H 


OX 
1001 
xxxx· 
1 
000001 
0100 
00001 
00000001001 
1 
x8281013H 
2xCLK 


SX 
1001 
xxxx· 
1 
000001 
0100 
00010 
00000001001 
1 
x8282013H 
2xCLK 


NOTES: 
1. DX indicates the inclusionof a FPU, and SX indicates the exclusionof a FPU. 
2. The distinctionbetween 1x and 2x bus clocks is determined by reading the 4 bits loaded into the shift-registerduringthe 
"Capture-IR"state. (labeled here as "cmd reg") 


9.0 
OverDrive™ 
PROCESSOR 
SOCKET FOR SL ENHANCED 
INTEL486 CPU-BASED SYSTEMS 


This section provides OverDrive Processor socket spec- 
ifications 
for systems based 
on the SL Enhanced 


Inte1486 SX, DX and DX2 CPUs. 


The OverDrive 
Processor socket for Intel486 DX2 


CPU-based 
systems is a superset of the OverDrive 


Processor socket for Intel486 SX and DX CPU-based 
systems. Section 9.1 specifies the OverDrive Processor 
socket for Intel486 SX and DX CPU-based systems. 
Section 9.2 specifies the Pentium™ OverDrive Proces- 
sor socket for 5V Inte1486 DX2 CPU-based systems. 


The OverDrive Processor socket specifications in this 
section are only valid for systems using IX clock mode 
CPUs. There are no OverDrive Processor socket speci- 
fications compatible with 2X clock mode CPUs. 


9.1 
OverDrive Processor Socket for 
5V SL Enhanced Intel486 SX and 
DX CPU-Based Systems 


This section contains OverDrive Processor socket spec- 
ifications for 5V systems based on the SL Enhanced 
Inte1486 SX or DX CPUs. The OverDrive Processor 
socket specifications are also valid for 5V systems based 
on the existing Intel486 SX or DX CPUs. 


9.1.1 
OVERDRIVE 
PROCESSOR 
SOCKET 
CIRCUIT 
DESIGN 


Figures 9-1 and 9-2 show the circuits which interface 
the original CPU with the OverDrive Processor socket. 
These circuits allow SL Enhanced Intel486 SX and DX 
(as well as Intel486 SX and DX) CPU-based systems to 
be upgraded with the OverDrive Processor. 
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9.1.1.1 OverDrive Processor Socket Circuit for 
SL Enhanced Intel486 DX CPU-Based 
Systems 


The OverDrive Processor socket circuit for Intel486 
DX CPU-based systems allows the Intel486 DX CPU 
to directly recognize when the OverDrive Processor 
socket is populated. When the UP if pin is driven active 
to the Intel486 DX CPU. the CPU three-states all of its 
output pins and enters power-down mode. This circuit 
is shown in Figure 9-1. 


9.1.1.2 OverDrive Processor Socket Circuit for 
SL Enhanced Intel486 SX CPU·Based 
Systems 


The OverDrive Processor socket circuit for Intel486 SX 
CPU-based systems allows the microprocessor to di- 
rectly recognize when the OverDrive Processor socket 
is populated. When the UP if pin is driven active to the 
Intel486 SX CPU. the CPU three-states all of its output 
pins and enters power-down mode. This circuit 
is 


shown in Figure 9-2. 
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The maximum and minimum dimensions for the 169 
pin, PGA package with heat sink are shown in Table 9- 
I.The maximum height ofthe OverDrive Processor for 5V 
Intel486 SX and DX CPU-based systems from the pin 
stand-offs to the top of the heat sink, including the ad- 
hesive thickness, is 0.552 inches. 


The OverDrive Processors for Intel486 SX and DX 
CPU-based systems are supplied with an attached heat 
sink. Intel486 SX and DX microprocessor system de- 
signs must provide space for the heat sink on the Over- 
Drive Processor. 


DIMENSION 
(inches) 
Minimum 
Maximum 


A. Heat Sink Width 
1.520 
1.550 


B. PGA Package 
Width 
1.735 
1.765 


C. Heat Sink Edge Gap 
0.065 
0.155 


D. Heat Sink Height 
0.312 
0.360 


E. Adhesive 
Thickness 
0.008 
0.012 


F. Package 
Height from Stand-Ofts 
0.140 
0.180 


G. Total Height from Package 
Stand-Offs 
to Top of Heat Sink 
0.460 
0.552 


INTEL OVERDRI~MPROCESSOR 


OMNI-DIRECTIONAL 
HEAT SINK 


ADHESIVE 


Figure 
9-3. OverDrlve™ 
Processor 
for 5V SL Enhanced 
Intel486TM 
SX and OX CPU·Based 
Systems 
(169 pin, PGA Package 
with 
Heat Sink) 


The heat generated by the OverDrive Processor re- 
quires careful management of heat dissipation. 


Thermal Equations: The method for calculating the 
heat dissipation for an OverDrive Processor (with a 
heat sink) or a standard CPU is the same except that 
the reference point for measuring the device tempera- 
ture is different. The OverDrive Processor specifies 
Tsink (the temperature at the outside center base of the 
heat sink, not on the heat sink marking plate or cooling 
posts) and 6JS (the thermal resistance from the silicon 
junction 
to the heat 
sink base). The relationships 


between temperature, thermal resistance and power are 
shown in the following equations: 


Tambient = Ambient Temperature, 
Pmax 
= Power (Ice • Vee), 
6SA 
= 6JA - 
6Js· 


The maximum Ice 
values in Table 9-2 are the best 
known design estimates and should be used as the max- 
imum specification. 


Table 9-2. OverDrlve™ Processor Maximum and Typlcallcc 
Values 
for SV Intel486 SX and OX CPU-Based Sytems 


System Frequency (MHz) 
Typlcallcc 
( mAl 
Maximum Icc ( mAl 


25 
TBD 
1000 


33 
TBD 
1250 


The thermal resistance values for the OverDrive Proc- 
essor (169 pin, PGA package with heat sink) are shown 
in Table 9-3. The maximum TA values shown in Table 


SVOverDrive 
Processor 
o 


12.0 


9-4 were calculated using Ts = 85·C, Vee = 5.3V, the 
maximum Ice values shown in Table 9-2, and 6JS and 
6JA values shown in Table 9-3. 


800 


4.6 


SV OverDrive 
Airflow (Ft/mln., LFM) 
Processor 
fCLK{MHz) 
0 
200 
400 
600 
800 


TA("C) 
25 
31 
49 
60 
68 
70 


33 
18 
40 
53 
63 
67 


For further details about thermal and mechanical package specifications and methodologies, refer to the 1993 Packaging 
Handbook (order number 240800). 


The OverDrive Processor may internally execute in- 
structions at multiple frequencies of the input clock. 
Thus software (or instruction based) timing loops will 
execute faster on the OverDrive Processor than on the 
Intel486 SX or DX CPU (at the same input clock fre- 
quency). Instructions such as NOP, LOOP, and JMP 
$+ 2, have been used by BIOS to implement timing 
loops, that are required, for example, to enforce recov- 
ery time between consecutive accesses for I/O devices. 


These instruction-based 
timing loop implementations 
may require modification for systems intended to be 
upgraded with the OverDrive Processor. 


In order to avoid any incompatibilities, it is recom- 
mended that timing loops be implemented in hardware 
rather than in software. This provides transparency and 
also requires no change to BIOS or I/O device drivers 
in the future when moving to higher processor clock 
speeds. As an example, a timing loop may be imple- 
mented as follows: The software performs a dummy 
I/O instruction to an unused I/O port. The hardware 
for the bus controller logic recognizes this I/O instruc- 


• 


tion and delays the termination of the I/O cycle to the 
CPU by keeping RDY IF or BRDY IF deasserted for the 
appropriate amount of time. 
The electrical functionality of the OverDrive Processor 
socket can be verified by fully testing the system with a 
populated OverDrive Processor socket. We recommend 
the system be tested with all compatible OverDrive 
Processors to ensure there are no compatibility issues. 
The OverDrive Processor does not support the Bounda- 
ry Scan feature. 


9.1.6 
OVERDRIVE 
PROCESSOR 
SOCKET 
PINOUT 
FOR 5V INTEL486 
SX AND 
OX CPU·BASED 
SYSTEMS 


A 
B 
C 
0 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
AHOLD (ADS_ 8516" 
Borr. 
Vss 
eD" 
Vss 
Vss 
PCD 
Vss 
Vss 
Vss 
W/Rtl 
Vss 
PCHK" 
NC 
ADS- 


16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
INTR 
NC 
RESET 
8SS" 
Vee 
ROY. 
Vee 
Vee 
BEl" 
Vee 
Vee 
Vee 
1Il/101r 
Vec 
PLOCK" 
BLAST" 
A' 
15 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
15 


IGNNE_ 
N"' 
FLUSH" 
A201ll_ 
HOLD 
KEN_ 
STPClKtI 
BRDY. 
9[2" 
BEO_ 
PWT 
o/etl 
LOCK" 
HLDA 
BREQ 
A3 
A. 
14 
0 
0 
0 • • • 
• • • 
0 
0 
0 
14 


NC 
UP' 
INC 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
A2 
Vcc 
Vss 


13 
0 
0 
0 • 
• 
0 
0 
0 
13 


FERR" 
NC 
NC 
PLUG 
PLUG 
A7 
A8 
AID 


12 
0 
0 
0 • 
•• 
0 
0 
0 
12 
NC 
Ne 
Sl.4IACU 
PLUG 
PLUG 
AS 
All 
Vss 


1 1 
0 
0 
0 
0 
0 
0 
1 1 
Vss 
Vcc 
NC 
A. 
Vee 
Vss 


10 
0 
0 
0 
OverDrive 
Processor 
Socket 
for 
0 
0 
0 
10 
NC 
SIIlI" 
SRESET 
5V 
i486 
SX &: ox CPU-based 
Systems 
A13 
Vcc 
Vss 


9 
0 
0 
0 
0 
0 
0 
9 
Vss 
Vcc 
030 
TOP SIDE VIEW 


AI. 
Vee 
Vss 


8 
0 
0 
0 
169 
Contact 
PGA 
0 
0 
0 
8 
02. 
031 
028 
A20 
Vcc 
Vss 


7 
0 
0 
0 
R.v.6/1/93 
0 
0 
0 
7 
Vss 
Vec 
02. 
A22 
A1S 
A12 


6 
0 
0 
0 • 
• 
0 
0 
0 
6 


02' 
025 
027 
PLUG 
PLUG 
A2' 
Vee 
Vss 


5 
0 
0 
0 • 
• 
0 
0 
0 
5 
DP3 
Vss 
Vee 
PLUG 
PLUG 
A21 
AI. 
AU 


4 
0 
0 
0 
0 • • 
• • • 
0 
0 
0 
4 
023 
Vss 
Vcc 
KEY 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
A19 
Vss 
NC 


3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
NC 
Vss 
ClK 
017 
010 
015 
012 
DP2 
01. 
014 
07 
D' 
DPO 
A30 
A17 
Vee 
A23 


2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 


022 
021 
018 
013 
Vee 
08 
Vee 
03 
05 
Vce 
D. 
Vee 
01 
A2. 
Vss 
A25 
A2. 


1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 


020 
01. 
011 
D. 
Vss 
DPI 
Vss 
Vss 
Vee 
Vss 
Vss 
Vss 
02 
DO 
A31 
A28 
A27 


A 
B 
C 
0 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 
241696-65 


NOTE: 
All NC pins must remain 
unconnected. 
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OverDrive 
Processor 
Socket 
for 


5V i486 
SX & DX CPU-based 
Systems 


BOTTOM SIDE 
VIEW 
169 Contact 
PGA 


NOTE: 
All NC pins must remain unconnected. 


o 
""'/ 


• 


Address 
Data 
Control 
Control 
N/C1 
Vcc 
Vss 


A2 
014 
Do 
P1 
A20M# 
015 
PWT 
L15 
A3 
B7 
A7 
A3 
R15 
01 
N2 
A05# 
517 
PLOCK# 
016 
A10 
B9 
A9 
A4 
516 
02 
N1 
AHOLO 
A17 
ROY# 
F16 
A12 
B11 
A11 
A5 
012 
03 
H2 
BEO# 
K15 
RE5ET 
C16 
A14 
C4 
B3 
A6 
515 
04 
M3 
BE1# 
J16 
5MI# 
B10 
B12 
C5 
B4 
A7 
013 
05 
J2 
BE2# 
J15 
5MIACT# 
C12 
B13 
E2 
B5 
A8 
R13 
06 
L2 
BE3# 
F17 
5RE5ET 
C10 
B16 
E16 
E1 
A9 
011 
07 
L3 
BLA5T# 
R16 
5TPCLK# 
G15 
C11 
G2 
E17 
A10 
513 
08 
F2 
BOFF# 
017 
UP# 
B14 
C13 
G16 
G1 
An 
R12 
09 
01 
BROY# 
H15 
W/R# 
N17 
R17 
H16 
G17 
A12 
57 
010 
E3 
BREO 
015 
54 
J1 
H1 
A13 
010 
011 
C1 
B58# 
016 
K2 
H17 
A14 
55 
012 
G3 
B516# 
C17 
K16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
L16 
K17 
A16 
09 
014 
K3 
O/C# 
M15 
Position 
INC2 
M2 
L1 
A17 
03 
015 
F3 
OPO 
N3 
A18 
R5 
016 
J3 
OP1 
F1 
KEY 
04 
C14 
M16 
L17 


A19 
04 
017 
03 
OP2 
H3 
PLUG 
05 
P16 
M1 


A20 
08 
018 
C2 
OP3 
A5 
PLUG 
013 
R3 
M17 


A21 
05 
019 
B1 
EA05# 
B17 
PLUG 
014 
R6 
P17 


A22 
07 
020 
A1 
FERR# 
A13 
PLUG 
E4 
R8 
02 


A23 
53 
021 
B2 
FLU5H# 
C15 
PLUG 
E14 
R9 
R4 


A24 
06 
022 
A2 
HLOA 
P15 
PLUG 
N4 
R10 
56 


A25 
R2 
023 
A4 
HOLD 
E15 
PLUG 
N14 
R11 
58 


A26 
52 
024 
A6 
IGNNE# 
A15 
PLUG 
P4 
R14 
59 


A27 
51 
025 
B6 
INTR 
A16 
PLUG 
P5 
510 


A28 
R1 
026 
C7 
KEN# 
F15 
PLUG 
P13 
511 


A29 
P2 
027 
C6 
LOCK# 
Nt5 
PLUG 
P14 
512 


A30 
P3 
028 
C8 
M/IO# 
N16 
514 


A31 
01 
029 
A8 
NMI 
B15 
030 
C9 
PCO 
J17 
031 
B8 
PCHK# 
017 


NOTES: 
The correct orientation of the OverDrive Processor 
for 5V Intel486TM SX and DX CPU-based 
systems 
IS keyed by the 'KEY' 
pin (D4). Plugs must be provided in three other inner corner pin locations 
(D14, P4, and P14) and the next nearest 
pin 
locations in all corners as listed above. 
1. All NC pins must remain unconnected. 
2. The INC pin is defined to be an internal no-connect. This means that the pin is not internally connected 
and may be used 
for the routing of external signals. 


9.2 
Pentium™ 
OverDrive 
Processor 
Socket for 5V SL Enhanced 
Intel486 DX2 CPU·Based 
Systems 
The OverDrive 
Processor 
is compatible 
with the maxi- 
mum 
rating 
specification 
and the D.C.! A.C. specifica- 
tions 
of the Inte1486 
SX and DX 
CPUs. 
Refer 
to the 
thermal 
management 
section 
(Section 
9.1.3) 
for 
the 
maximum 
power 
supply 
current 
and operating 
temper- 
ature 
specifications. 


This section 
contains 
the specification 
for the Pentium 
OverDrive 
Processor 
socket 
for systems 
based 
on the 
Inte1486 DX2 CPU. 
These OverDrive 
Processor 
socket 
specifications 
are also valid for 5V systems 
based on the 
existing 
Intel486 
DX2 CPU. 
Please note that the Penti- 
um OverDrive 
Processor 
socket 
specifications 
present- 
ed in this 
document 
are 
preliminary 
and 
subject 
to 
change. 
For more updated 
information 
about 
the Over- 
Drive 
Processor 
socket, 
please contact 
your local Intel 
representative. 
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9.2.1 
PENTIUM 
OVERDRIVE 
PROCESSOR 
SOCKET 
OVERVIEW 
FOR 5V SL 
ENHANCED 
Intel486 
DX2 CPU-BASED 
SYSTEMS 


The OverDrive Processor socket specifies 237 contacts, 
which correspond to a standard 240 pin socket with one 
inside "KEY" contact, one outer "KEY" contact and 
four "orientation" contacts plugged on the outside cor- 
ner. The inside "KEY" 
contact provides backward 
compatibility with the OverDrive Processor socket for 
Intel486 SX and DX CPU-based systems. The four 
contacts plugged on the outside corner and the outer 
"KEY" pin ensure proper orientation for the Pentium 
OverDrive Processor. Additionally, all 120 inner con- 
tacts of a standard 240 pin socket (the inner II rows 
minus the "KEY" 
pin) should be plugged to ensure 
proper alignment of the OverDrive Processor for In- 
tel486 SX and DX CPU-based systems. 


To support Intel's SMM of the SL Enhanced Intel486 
processor, new pins have been added to the Pentium 
OverDrive Processor socket. 


The INC Pin is defined to be an internal no-connect. 
Thismeans that the pinisnot internally connected and may 
be used for the routing of external signals. For example, 
pin DI5 is defined to be the FERR# 
pin on the In- 
tel486 DX CPU. By routing DI5 and BI4 together, the 
FERR # signal can be made to appear on both pins. 


Note that the SRESET pin has been defined at location 
D II for compatibility with the SL Enhanced Intel486 
DX2 Processor. On the Pentium OverDrive processor, 
D II is defined as an INC pin. The OverDrive processor 
signal INIT and the SL Enhanced Intel486 CPU signal 
SRESET may be electrically connected on the mother- 
board, provided the system can handle the functionality 
differences depending on which part is inserted into the 
socket. Specifically, INIT and SRESET are similar ex- 
cept that SRESET will flush the on-chip cache, while 
INIT 
will keep the cache contents 
intact. 
Systems 


should not depend on SRESET to flush the cache in 
future versions of the SL Enhanced Intel486 processor, 
as previously described in this document. 


The Pentium OverDrive Processor is designed to fit in a 
standard 24O-lead(19 x 19) PGA socket with four cor- 
ner pins removed. The Pentium OverDrive Processor 
uses an active heat sink, and therefore, requires vertical 
clearance to allow adequate air circulation. 


The maximum and minimum dimensions of the Penti- 
um OverDrive Processor package with a fanlheat sink 
are shown in Table 9-6. The fanlheat sink unit is divid- 
ed into the size of the actual heat sink, and the required 
free space above the heat sink. The total height required 
for the Pentium OverDrive Processor from the mother- 
board will depend on the height of the PGA socket. 
The total external height given in Table 9-6 is only 
measured from the PGA pin stand-offs. Table 9-6 also 
details the minimum clearance needed around all four 
sides of the PGA package. 
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Table 9·6. Pentlum™ OverDrive Processor, 236 Pin PGA 
Package Dimensions with Active Heat Sink Attached 


Length and Width 
Height 
Component 
(Inches) 
(Inches) 


Minimum 
Maximum 
Minimum 
Maximum 


PGA Package 
1.950 
1.975 
0.140 
0.180 


Adhesive 
N/A 
N/A 
0.008 
0.012 


Heat Sink Unit 
1.830 
1.850 
N/A 
N/A 


Heat Sink 
N/A 
N/A 
0.790 
0.810 


Req'd Free Space 
N/A 
N/A 
0.400 
0.400 


External Total 
1.950 
1.975 
1.338 
1.402 


Space from Package 
0.200 
0.200 
N/A 
N/A 


Since the Pentium OverDrive Processor dissipates more 
power than the Intel486 CPU family members, it re- 
quires a larger cooling capacity. To facilitate the task of 
cooling the Pentium OverDrive Processor, Intel will 
ship the product with a fanlheat sink. No external con- 
nections (i.e., power) will be required for the fanlheat 
sink. All the needed connections will be made through 
the pins of the processor. The amount of extra power 
needed for the fan is accounted for in the lee numbers 
of the processor (see Section 9.2.3). To ensure adequate 
air circulation, the additional clearance specified in Ta- 
ble 9-6 must be provided. 
' 


The Pentium OverDrive Processor and system chassis 
have several unique design requirements due to the at- 
tached active heat sink. The following sections provide 
sample maximum system operating temperature calcu- 
lations so systems may be designed to comply with the 
thermal requirements of the Pentium OverDrive Proc- 
essor. 


The following equation can be used to calculate the 
maximum operating temperature of a system: 


TA(in): The temperature of the air going into the fanl 
heat sink. 


TSINK: Temperature of heat sink base, as measured in 
the center. 


8sl: Heat Sink to Internal Temperature [TA(in)] Ther- 
mal Resistance 


Since the Pentium OverDrive Processor uses an active 
heat sink, 8s1 (as shown in Table 9-7) is relatively con- 
stant, regardless of the airflow provided to the proc- 
essor. Table 9-8 details the maximum 
current 
re- 
quirements of the Pentium OverDrive Processor. The 
maximum ambient temperature 
specification for the 
Pentium OverDrive Processor is 55°C for both 25 MHz 
and 33 MHz processors with the heat sink attached. 
Therefore, the internal temperature of the air (TA (in) 
may not exceed 55°C under the worst case operating 
conditions specified for the system. This ensures that 
the value of TSINK does not exceed 85°C. 


Table 9·7. Thermal Resistance ("C/W) 
8S1 


Processor Type 
8S1 - 
°C/W 


Fan/Heat Sink 
2.4 


Table 9·8. Pentlum™ OverDrive Processor 
Typical and Maximum lee Values 


System 
Processor 
Processor 
Frequency 
Typical lee 
Maximum lee 
(MHz) 
(mA) 
(mA) 


25 
TBD 
1900 


33 
TBD 
2500 


Ice is dependent 
upon 
the Vee level of the 
system, 
processor 
bus loading, 
software 
code sequences, 
and sil- 


icon process 
variations. 


TA(in) 
= 85°C - 
(12.5 Watts 
• 2.4 °C/Watt) 


TA(in) 
= 85°C 
- 
30"C 


TA(in) 
= 55°C 


Maximum 
TA(in) 
is specified 
and 
can be verified 
by 
using the equation 
and parameters 
provided. 


TA(in) 
= TSINK 
- 
(Power 
• 8sv 


TA(in) 
= 85°C - 
«2.5A 
• 5V) • 2.4 °C/W) 


Assuming 
the internal 
system 
ambient 
TA(in) 
is within 
5°C-IO"C 
of TA(out), 
this would 
allow the maximum 


Ta(out) 
temperature 
to be approximately 
45°C-50"C. 


It is the responsibility 
of the system 
designer 
to ensure 


TA(in) 
meets 
this specification 
by providing 
sufficient 


airflow 
around 
the Pentium 
OverDrive 
Processor 
to re- 


move the heated 
air expelled 
by the fanlheat 
sink. 


9.2.4 
PENTIUMTM 
OVERDRIVE 
PROCESSOR 
SOCKET 
PINOUT 
FOR 5V SL ENHANCED 
INTEL486 
DX2 CPU-BASED 
SYSTEMS 


A 
8 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 
T 
U 


19 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
19 
NC 
RES 
vss 
vcc 
vss 
INIT 
vss 
vss 
vcc 
vcc 
vcc 
vss 
vss 
RES 
vss 
vcc 
vss 
RES 
RES 


18 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
18 
RES AHOLOEAOS. BS16. 
BOFF. vss 
BE3· 
vss 
vss 
PCO 
vss 
vss 
vss 
W/R. 
vss 
PCHK. 
INC 
ADS· 
RES 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
vss 
INTR 
RES 
RESET BS8. 
vcc 
ROY· 
vcc 
vcc 
BEl. 
vcc 
vcc 
vcc •••/10. 
vcc PLOCK. BLAST. A4 
vss 


16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
vcc IGNNE. 
N•••I FLUSH. 
A20 •••• HOLD KEN.sTPCLK.BROY. 
BE2. 
BEO. 
PWT 
o/c. 
LOCK. 
HLOA 
BRED 
A3 
A6 
vcc 


15 
0 
0 
0 
0 
• • • 
• • • 
0 
0 
0 
0 
15 
vss 
RES 
UP. 
INC 
PLUG PLUG PLUG 
PLUG PLUG PLUG 
A2 
vcc 
vss 
vss 


14 
0 
0 
0 
0 
• 
• 
0 
0 
0 
0 
1.4- 
vss 
FERR. 
INC 
NC 
PLUG 
PLUG 
A7 
A8 
AlO 
vss 


13 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
13 
vss 
INC 
INC s •••IACT. 
PLUG 
PLUG 
AS 
All 
vss 
vss 


0 
0 
0 
0 
Pentium 
OverDrive Processor 
Socket for 
0 
0 
0 
0 
12 
vss 
vss 
vcc 
INC 
5V i486 
DX2 CPU-based 
systems 
A9 
vcc 
vss 
vss 
12 


11 
0 
0 
0 
0 
0 
0 
0 
0 
11 
vcc 
INC 
S•••I. 
INC 
A13 
vcc 
vss 
vcc 


10 
0 
0 
0 
0 
0 
0 
0 
0 
10 
vcc 
vss 
vcc 
030 
A16 
vcc 
vss 
vcc 
TOP SIDE VIEW 
9 
0 
0 
0 
0 
0 
0 
0 
0 
9 
vcc 
029 
031 
028 
237 
Contact 
PGA 
A20 
vcc 
vss 
vcc 


8 
0 
0 
0 
0 
0 
0 
0 
0 
8 
vss 
vss 
vcc 
026 
A22 
A15 
A12 
vss 


7 
0 
0 
0 
0 • 
Rev, 8/4/93 
• 
0 
0 
0 
0 
7 
RES 
024 
025 
027 
PLUG 
PLUG 
A24 
vcc 
vss 
vss 


6 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
6 
RES 
OP3 
vss 
vcc 
PLUG 
PLUG 
A2l 
A18 
AU 
vss 


5 
0 
0 
0 
0 
0 • • 
• • • 
0 
0 
0 
0 
5 
vss 
023 
vss 
vcc 
KEY 
PLUG PLUG 
PLUG PLUG PLUG 
A19 
vss 
INC 
vss 


.4- 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
4 
vcc 
RES 
vss 
CLK 
017 
010 
015 
012 
OP2 
016 
014 
07 
04 
OPO 
A30 
A17 
vcc 
A23 
vcc 


3 
• 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 
PLUG 
022 
021 
018 
013 
vcc 
08 
vcc 
03 
05 
vcc 
06 
vcc 
01 
A29 
vss 
A25 
A26 
vss 


2 
• 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 
PLUG 
020 
019 
011 
09 
vss 
OPI 
vss 
vss 
VCC 
vss 
vss 
vss 
02 
00 
A31 
A28 
'A27 
RES 


1 
0 • • 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 
EY 
PLUG PLUG 
vcc 
vss 
RES 
RES 
vss 
vcc 
vcc 
vcc 
VS5 
RES 
RES 
vss 
vcc 
vss 
RES 
RES 


A 
B 
C 
D 
E 
F 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 
T 
U 


241696-57 


NOTE: 
All NC and RES pins must remain unconnected. 


Figure 9·7. Pentlum™ 
OverDrive 
Processor 
Socket 
Pinout 
for 5V SL Enhanced 
Intel486 
DX2 CPU-Based 
Systems 
(Top Side View) 


• 


intel~ 


U 
T 
S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
D 
C 
B 
A 


19 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
19 
RES 
RES 
vSS 
vCC 
vSS 
RES 
vSS 
vSS 
vCC 
vCC 
vCC 
vSS 
vSS 
INIT 
vSS 
vCC 
vSS 
RES 
NC 


18 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
18 
RES 
ADS" 
INC 
PCHK" 
vSS 
W/R" 
vSS 
vSS 
vSS 
PCO 
vSS 
vSS 
BE3" 
vSS 
BOff" 
BS16" 
[A~S'' 
AHOlO 
RES 


17 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
17 
vss 
A4 
BLAST" PLOCK" VCC M/IO" 
VCC 
VCC 
VCC 
BEl" 
VCC 
VCC 
ROY" 
VCC 
BS8" 
RESET 
RES 
INTR 
vss 


16 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
16 
vcc 
A6 
A3 
BREO HLOA LOCK" 
O/C" 
PWT 
BEO" 
BE2" 
BROY" STPCLK" KEN" 
HOLD A20M" 
fLUSH" 
NMI 
IGNNE" vcc 


15 
0 
0 
0 
0 • • • 
• • • 
0 
0 
0 
0 
15 
vss 
vss 
vcc 
A2 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
INC 
UP" 
RES 
vss 


14 
0 
0 
0 
0 
• 
• 
0 
0 
0 
0 
14 
vss 
Ala 
A8 
A7 
PLUG 
PLUG 
NC 
tNC 
fERR" 
vss 


13 
0 
0 
0 
0 
• 
• 
0 
0 
0 
0 
13 
vss 
Vss 
All 
AS 
PLUG 
PLUG SMIACT" IHe 
INC 
Vss 
0 
0 
0 
0 
Pentium 
OverDrive 
Processor 
Socket 
for 
0 
0 
0 
0 
12 
vss 
vss 
vcc 
A9 
5V i486 
DX2 CPU-based 
systems 
INC 
vcc 
vss 
vss 
12 


11 
0 
0 
0 
0 
0 
0 
0 
0 
11 
vcc 
vss 
vcc 
A13 
INC 
SMI" 
INC 
vcc 


10 
0 
0 
0 
0 
0 
0 
0 
0 
10 
vcc 
vss 
vcc 
A16 
030 
vcc 
vss 
vcc 
BOTTOM SIDE VIEW 


9 
0 
0 
0 
0 
0 
0 
0 
0 
9 
vcc 
vss 
vcc 
A20 
237 
Contact 
PGA 
028 
031 
029 
vcc 
8 
0 
0 
0 
0 
0 
0 
0 
0 
8 
vss 
A12 
A15 
A22 
026 
vcc 
vss 
vss 
7 
0 
0 
0 
0 • 


Rev. 8/4/93 
• 
0 
0 
0 
0 
7 
vss 
vss 
vcc 
A24 
PLUG 
PLUG 
027 
025 
024 
RES 


6 
0 
0 
0 
0 • 
• 
0 
0 
0 
0 
6 
vss 
A14 
A18 
A2l 
PLUG 
PLUG 
vcc 
vss 
OP3 
RES 


5 
0 
0 
0 
0 • • • 
• • 
0 
0 
0 
0 
0 
5 
vss 
tNC 
vss 
A19 
PLUG 
PLUG 
PLUG 
PLUG 
PLUG 
KEY 
vcc 
vss 
023 
vss 
4 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
4 
vcc 
A23 
vcc 
A17 
A30 
OPO 
04 
07 
014 
016 
OP2 
012 
015 
010 
017 
ClK 
vss 
RES 
vcc 
3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 • 
3 
vss 
A26 
A25 
vss 
A29 
01 
vcc 
06 
vcc 
05 
03 
vcc 
08 
vcc 
013 
018 
021 
022 
PLUG 


2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 • 
2 
RES 
A27 
A28 
A31 
DO 
02 
vss 
vss 
vss 
vcc 
vss 
vss 
OPl 
vss 
09 
011 
019 
020 
PLUG 


1 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 • • 
0 
1 
RES 
RES 
vss 
vcc 
vss 
RES 
RES 
vss 
vcc 
vcc 
vcc 
vss 
RES 
RES 
vss 
vcc 
PLUG 
PLUG 


U 
T 
S 
R 
Q 
P 
N 
M 
L 
K 
J 
H 
G 
F 
E 
D 
C 
B 
A 
241696-58 


NOTE: 
All NC and RES pins must remain unconnected. 


Figure 9·8. Pentlum™ 
OverDrive 
Processor 
Socket 
Pinout 


for 5V SL Enhanced 
Intel486 
DX2 CPU·Based 
Systems 
(Bottom 
Side View) 


Address 
Data 
Control 
Control 
RES· 
VCC 
Vss 


A2 
R15 
DO 
02 
A20M# 
E16 
ROY# 
G17 
A6 
A4 
L1 
A5 
M18 
A3 
516 
01 
P3 
A05# 
T18 
RE5ET 
017 
A7 
A9 
L3 
A8 
M19 
A4 
T17 
02 
P2 
AHOLO 
B18 
5MI# 
C11 
A18 
A10 
L17 
A12 
N2 
A5 
R13 
03 
J3 
BEO# 
L16 
5MIACT# 
013 
A19 
A11 
L19 
A13 
N18 
A6 
T16 
04 
N4 
BEH 
K17 
5TPCLK# 
H16 
B4 
A16 
M17 
A14 
N19 
A7 
R14 
05 
K3 
BEU 
K16 
UP# 
C15 
B15 
C8 
N3 
A15 
01 
A8 
514 
06 
M3 
BE3# 
G18 
W/R# 
P18 
B19 
C10 
N17 
A17 
018 
A9 
R12 
07 
M4 
BLA5T# 
517 
C17 
C12 
017 
B8 
019 
A10 
T14 
08 
G3 
BOFF# 
E18 
F1 
01 
R1 
B10 
R3 
A11 
513 
09 
E2 
BROY# 
J16 
G1 
05 
R19 
B12 
51 
A12 
T8 
010 
F4 
BREO 
R16 
N1 
06 
54 
C4 
55 
A13 
R11 
011 
02 
B58# 
E17 
P1 
019 
57 
C5 
519 
A14 
T6 
012 
H4 
B516# 
018 
P19 
F3 
59 
C6 
T7 
A15 
58 
013 
E3 
CLK 
04 
T1 
F17 
510 
C19 
T9 
A16 
R10 
014 
L4 
O/C# 
N16 
T19 
H3 
511 
E1 
no 


A17 
R4 
015 
G4 
OPO 
P4 
Position 
U1 
H17 
512 
E19 
T11 


A18 
56 
016 
K4 
OP1 
G2 
KEY 
E5 
U2 
J1 
515 
F2 
T12 
A19 
R5 
017 
E4 
OP2 
J4 
KEY 
A1 
U18 
J17 
U4 
F18 
T13 
A20 
R9 
018 
03 
OP3 
B6 
PLUG 
A2 
U19 
J19 
U9 
G19 
T15 


A21 
R6 
019 
C2 
EA05# 
C18 
PLUG 
A3 
N/C· 
K1 
U10 
H1 
U3 


A22 
R8 
020 
B2 
FERR# 
B14 
PLUG 
B1 
A19 
K2 
U11 
H2 
U5 
A23 
T4 
021 
C3 
FLU5H# 
016 
PLUG 
C1 
014 
K19 
U16 
H18 
U6 


A24 
R7 
022 
B3 
HLOA 
016 
PLUG 
E6 
INC 
H19 
U7 


A25 
53 
023 
B5 
HOLD 
F16 
PLUG 
E14 
B11 
J2 
U8 
A26 
T3 
024 
B7 
IGNNE# 
B16 
PLUG 
E15 
B13 
J18 
U12 
A27 
T2 
025 
C7 
INIT 
F19 
PLUG 
F5 
C13 
L2 
U13 
A28 
52 
026 
08 
INTR 
B17 
PLUG 
F15 
C14 
L18 
U14 
A29 
03 
027 
07 
KEN# 
G16 
PLUG 
P5 
011 
M1 
U15 
A30 
04 
028 
09 
LOCK# 
P16 
PLUG 
P15 
012 
M2 
U17 
A31 
R2 
029 
B9 
M/IO# 
P17 
PLUG 
05 
015 
030 
010 
NMI 
C16 
PLUG 
06 
518 
031 
C9 
PCO 
K18 
PLUG 
014 
T5 
PCHK# 
R18 
PLUG 
015 
PLOCK# 
R17 
PWT 
M16 


• All RES pins are reserved for later use by Intel.To ensure proper operation of the microprocessor,all RES and N/C pins 
should be left unconnected. Please contact Intelfor design information. 


The 5V Pentium OverDrive Processor is compatible 
with the A.C. specifications for the 5V SL Enhanced 
Intel486 DX2 CPU (maximum lee current values pro- 
vided in Section 9.2.3). 


• 


intel~ 


Revision 
-002 of the 
SL Enhanced 
Inte1486 
Microprocessor 
Data 
Sheet 
Addendum 
contains 
many 
updates 
and 
improvements 
to the original 
version. 
The sections 
significantly 
revised 
since version 
-001 are: 


Table 
1-1 
Added 
3.3V SO MHz 
version 
of the Inte1486 DX2 CPU. 


Table 2-3 
Added 
note to explain 
that pins A3, A14, Bl4 and Bl6 
are Boundary 
Scan pins for the SO MHz 
version 
of the standard 
Intel486 
DX CPU. 


Corrected 
PGA 
control 
pins, Fl7 
and AlO. 


Added 
new table to show pinout 
differences 
between 
the SL Enhanced 
Inte1486 SX CPU and the 


SL Enhanced 
Intel486 
DX and DX2 CPUs 
for pins 66 and 72. 


Added 
clarification 
for the STPCLK 
# signal with respect 
to the Stop Grant 
bus cycle. 


Table 2-4 


Table 2-5 


Table 2-7, 
Sections 
3.2 
and 7.3.2 


Figure 
3-1 


Section 
3.5.6 


Section 
4.3.1 


Section 
4.3.3.1 
and Table 4-1 


Sections 
6.0 
to 6.4 


Table 6-1 


Table 6-2 


Table 6-4 


Figure 
6-1 


Table 6-11 


Tables 
6-15a 
and 6-15b 


Tables 
6-20, 6-21 
and 6-22 


Section 
6.4 


Figures 
9-4 
and 9-5 


Section 
9.2 


Modified 
figure 
to show that 
STPCLK 
# must 
be deasserted 
for at least S clocks 
after 
ROY # 


becomes 
active. 


Added 
statement 
about 
Auto 
Idle Power 
Down. 


Added 
clarification 
about 
the SMI # specification. 


Added 
information 
about 
SMRAM 
state save map. 


Added 
max. D.C. spec. of 6 pF for CCLK. 


Changed 
3.3V Inte1486 SX-33 CPU 
typical 
and maximum 
Icc 
specifications. 


Added 
Icc 
values for 3.3V Inte1486 DX2 CPU. 


Added 
3.3V Intel486 
DX2-S0 
CPU 
to graph. 


Added 
minimum 
CLK 
frequency 
specifications 
for SOand 66 MHz versions 
of SV Inte1486 DX2 


CPU. 


Changed 
8JA and 8JC specifications 
for Intel486 
DX and DX2 CPUs. 


Recalculated 
Tambient values for Intel486 
SX, DX and DX2 CPUs 
based 
on Tease of 8SoC, new 


8JA, 8JC and Icc 
specifications. 


Added 
capacitive 
derating 
information 
section. 


Corrected 
pinouts 
of the OverDrive 
processor 
for SV Intel486 
SX and DX CPUs. 


Modified 
SV OverDrive 
Processor 
Socket section 
to reflect new Pentium™ 
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APPENDIX 
A 
SYSTEM DESIGN NOTES 


When the SL Enhanced Intel486 CPUs are operating in 
Real Mode, the physical address at which instructions 
and data are fetched is determined by the segment reg- _ 
ister and an offset (i.e., CS and IP for instructions). 
When a new value is loaded into a segment register, the 
new value is shifted to the left by four bits and stored in 
a segment base register that corresponds to that partic- 
ular segment (CSBASE, DSBASE, ESBASE, etc.). It is 
the value stored in the segment base register that is 
actually used to generate a physical address. For exam- 
ple, the linear address to be used for fetching instruc- 
tions is determined by adding the value contained in the 
CS segment base register with the value in the IP regis- 
ter. 


When the CPU is in Protected Mode, the segment reg- 
isters are used as selectors to a descriptor table. Each 
descriptor in a descriptor table contains information 
about the segment in use, including the segments BASE 
address (i.e., CSBASE), the limit (or size of the seg- 
ment), as well as protection level, privileges, operand 
sizes, and the segment type. In Protected Mode, the 
linear address is determined by adding the base portion 
of the descriptor to the appropriate offset. 


When in System Management Mode, the CPU operates 
in a pseudo-Real Mode, with address calculation per- 
formed in the Real Mode manner. However, the CPU 
adds the value in the segment base register with the 
value in the EIP register, rather than the IP register, so 
there are no limits as to the segment size. The physical 
address of an instruction is obtained by adding the val- 
ue in CSBASE to the value in EIP. 


When entering SMM, it may be necessary to initialize 
the segment registers to point to SMRAM (see section 
4.4.2. for their value on SMM entry). If 5MBASE has 
not been relocated, then the necessary segment registers 
can be initialized to point to SMRAM by using the 
value in the CS register, 3000H, which points to the 
SMRAM address space. 


When an SMI # occurs after 5MBASE has been modi- 2 
fied, CSBASE 
is loaded 
with 
the 
new 
value - of 
5MBASE. However, the CS selector register stiD con- 
tains the value 3000H, not the value corresponding to 
the new 5MBASE. 


In order to initialize the segment registers to point to 
the new SMRAM area, we need to read the value of 
5MBASE from the SMM state save in memory. Since 
the data segment registers are initialized to 0, we can- 
not use them to access the SMM state save area. How- 
ever, we can perform a read relative to the CS register 
by using a CS override prefix to a normal memory read. 
Although CS still contains 3000H, CSBASE contains 
the value of 5MBASE, and CSBASE is used for the 
address generation. 


Once the value of 5MBASE is obtained, it must be 
shifted to the right by four bits to get the appropriate 
value to be placed in the segment registers. The CS 
register itself can be initialized by executing a far jump 
- instruction to an address within 5MBASE, which caus- 
es CS to be reloaded with a value corresponding to 
5MBASE. 


Figure A-I describes one method of initializing the seg- 
ment registers when 5MBASE has been relocated. This 
method works if 5MBASE is less than I Megabyte. 
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mov 
mov 
si,FEF8H 
eax,cs:[si] 
;SMBASE slot in SMM state save area 
;copy 5MBASE from 5MBASE:FEF8H 
to eax 


mov 
ror 
cl,4 
eax,cl 


;to load cs, we need to execute a far jump to an address 
that has been stored 
;at memory 
location 
PTR_ADDR 


;store the 5MBASE value and an offset to a memory 
location 
that can be used as 
;an indirect 
jump address 


;PTR_ADDR is the location used to 
;store the jump address 
;OFFSET is the address where 
;execution continues 
after the 
;far jump 


mov 
inc 
inc 
mov 


cS:[di],bx 
di 
di 
cs: [di] ,ax 
;store the segment address 
for the 
;far jump, which 
is 5MBASE 
;bx now contains the address 
of the 
;location holding 
the jump address 


;initialize DS and ES with the correct address 
of 5MBASE 


ds,ax 
es,ax 


;jump to address 
stored at memory 
;location pointed 
to by bx 


;CS now contains 
the correct value 
of 5MBASE, and execution 
continues 
from the 
;address 5MBASE:OFFSET 


A.2.1 LOADING SMRAM WITH AN INITIAL SMI 
HANDLER 


Under normal conditions, the SMRAM address space 
should only be accessible by the CPU while it is in 
SMM mode. However, some provision must be made 
for providing the initial SMM interrupt handler rou- 
tine. 


Since System Management Mode must be transparent 
to all operating systems, the initial SMM handler must 
be loaded by the system BIOS. At some time during the 
power on sequence, the system BIOS will need to move 
the SMM handler routine from the BIOS ROM to the 
SMRAM. The system designer must provide a hard- 
ware mechanism that allows access to SMRAM while 
SMIACT # 
from the CPU is inactive. One method 
would be to provide an I/O port in the memory con- 
troller that forces memory cycles at a given address to 
be relocated to the SMRAM. Once the initial SMM 
handler has been loaded to SMRAM, the I/O 
port 
would be disabled to protect against accidental accesses 
toSMRAM. 


The system BIOS must provide an SMM handler at the 
address 38000H. If the system designer has chosen to 


take advantage of the SMRAM relocation feature of 
the CPU, this handler must change the 5MBASE regis- 
ter in the SMM state save. Next, the BIOS must move 
the full featured SMM handler to the new address. An 
SMI # 
must be generated 
in order 
to change the 
5MBASE register before the BIOS passes control to the 
operating system. 


A.2.2 SMRAM HIDDEN FROM DMA AND BUS 
MASTERS 


In a system that allows DMA or other devices to take 
control of the system bus, care must be taken to ensure 
that only the master CPU can access SMRAM. If an 
external bus master requests use of the system bus (by 
asserting HOLD or BOFF#) 
while the CPU is execut- 
ing an SMM handler routine, the CPU would respond 2 
by passing control of the bus to the requesting device. 
The system memory controller must redirect any mem- 
ory accesses that are not generated by the CPU to nor- 
mal system memory as if SMIACT# 
was inactive. 


DMA accesses to the SMRAM area must be redirected 
to the correct address space when the initialization rou- 
tine is loading SMRAM, as well as when the CPU is in 
SMM. 


\~----- 
\~ 
~r- 
'-- 
SMRAMCS# blockedby 
HLDA or BOFF# 


It is not recommended to block bus control requests 
when in SMM, since the increased bus access latency 
could cause compatibility issues with some software or 
expansion hardware. 


A.2.3 
ACCESSING 
SYSTEM 
MEMORY 
FROM 
WITHIN 
SMM 


In order to enter a suspend state where power is re- 
moved from some or all of system memory, it is neces- 
sary for the CPU to have access to the entire system 
address space from within SMM. Access to system 
memory from within SMM requires that the memory 
controller decode both SMIACT# 
and the CPU ad- 
dress to determine accesses to SMRAM. Only those 
memory addresses that are defmed as being SMRAM 
space would be directed to SMRAM. If SMRAM is 
located at an address that 
overlays normal system 
memory address space (see section 4.6.2.), the CPU 
must have a method of accessing both SMRAM (for 
code reads) and system memory simultaneously. 


Ideally, a method of accessing system memory that is 
mapped underneath 
SMRAM would be provided by 
the system memory controller. The memory controller 
would provide a register that allows system memory at 
a given address to be remapped to a different address, 
which is not overlaid by SMRAM. When the SMM 
handler implements a suspend, it would first move all 
of system memory that is not underneath SMRAM to a 
non-volatile medium (such as a hard disk drive). Next, 
the SMRAM image would be transferred to the non- 
volatile 
medium. 
Finally, 
the 
memory 
underneath 
SMRAM would be accessed and copied to the non-vol- 
atile medium with a CPU read to the remap address 
space, which is redirected to the overlaid system memo- 
ry (see Figure A-3). 
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Remap feature 
CPU accesses 
disabled 
to remap 
address space 
Remap feature 
enabled 


If the memory controller does not provide a method of 
accessing overlaid system memory, it is possible to im- 
plement a software procedure to accomplish the same 
goal. However, the software method is quite complex, 
and a hardware method is preferred. A description of 
the software method follows. 


The ability to access the system memory that is located 
in the address space under SMRAM requires a method 
of resuming from SMM to a predetermined 
address 
space. This can be accomplished with the following 
procedure. 


When resuming from SMM, the CPU continues execu- 
tion at the address contained in the CS and EIP slots 
within the SMM state save. However, the resume ad- 
dress cannot be changed by simply modifying the CS 
and EIP slots, since the CPU will use the CS descriptor 
to determine the actual resume address. The descriptor 
registers are stored in reserved slots in the SMM state 
save, and they cannot be directly modified. 


By replacing the suspend state save with a previously 
obtained image of a state save that returns to a known 
location, the SMM suspend handler can force a return 
to a given address: 
I. During initial system power up, execute an SMI # 
from a predetermined address (the address immedi- 
ately preceding the address to which you later wish 
to resume). This can be accomplished by generating 
an SMI # in response to an I/O instruction or exe- 
cuting a halt instruction and using an SMI # to exit 
the halt state. 


2. Save the state save from this SMM to a safe location 
(SMRAM). 
3. When the system needs to resume to a given address 
from some other SMI "', the stored state save can be 
substituted for the state save generated from that 
particular SMM. 


Now that we have the ability to resume from SMM to a 
predetermined address, we can access the entire system 
memory space from within SMM before executing a 
suspend: 
1. During a suspend SMM, save all system memory 
except that which is located underneath SMRAM to 
a specified (and reserved) section of the hard disk. 
The ability to access system memory requires the 
memory controller to decode both SMIACf'" 
and 
the CPU address, and direct a limited section (may- 
be 64 or 
128K) of the CPU 
address 
space to 
SMRAM. All other CPU memory accesses should 
go to normal system memory. 
2. Save the contents of the SMM state save to the hard 
disk. 


3. Modify the SMM state save so that the RSM in- 
struction will return to a predefJ,lledaddress, which 
is not in the application that was interrupted. The 
code at this address must contain the remainder of 
the suspend SMM handler. The predefined address 
can be anywhere in the CPU address space, since 
the contents of system memory have already been 
saved to disk. 
4. Execute a RSM instruction, which exits SMM and 
returns control to a predetermined address (which 
must contain the rest of the SMM suspend handler). 


5. Save the rest of system memory (that which is locat- 
ed underneath SMRAM) to the hard disk. This ad- 
dress space can now be accessed with normal move 
instructions, since we are no longer in SMM. 


6. Save a flag (in CMOS memory) indicating that the 
next reset should cause a resume from suspend. 
7. Powerdown the memory (and possibly the CPU). 
8. When power is restored, the CPU is reset and begins 
execution of the POST in BIOS. Early in the POST, 
the system should check the status of the suspend 
flag. 


9. Load 
a preliminary 
SMM 
handler 
to 
location 
38000H and generate an SMI"'. The SMM handler 
should read the 5MBASE slot from the SMM state 
save that was stored to hard disk. 5MBASE is then 
modified to point to the final SMRAM location and 
the system resumes from SMM back to the system 
BIOS. 
10. Restore the contents of system memory located un- 
derneath SMRAM from the hard disk. 


II. Generate a second SMI'" , which executes an SMM 
handler at the original value of 5MBASE (before 
the suspend SMM). The SMM handler restores the 
contents of the rest of system memory from the hard 
disk, and then restores the original SMM state save 
to the SMM state save area in SMRAM, discarding 
the most recent SMM state save. 
12. Execute an RSM instruction, which returns execu- 


tion to the application that was interrupted by the 
suspend request. 


A.3 
Interrupts 
and Exceptions 
During 
SMM Handler Routines 


To ensure transparency to existing system software, the 
SMM handler should not depend on interrupt or excep- • 
tion handlers provided by the operating system. How- 
ever, in some cases it may be necessary to service inter- 
rupts 
or exceptions 
while in System Management 
Mode. In these cases, SMM compliant interrupt and 
exception handlers, as well as an SMM compliant inter- 
rupt vector table, should be provided. 


An SMI'" 
interrupt 
request can be generated while 
code is running under any of the other three CPUoper- 
ating modes (Real, Virtual-86, or Protected). When en- 
tering the SMM handler, the CPU enters a pseudo-real 
mode, and the beginning of the interrupt vector table 
must be located at the address OOOOOOOOH. 
Before al- 
lowing any interrupts or exceptions to occur, the SMM 
handler routine must provide a valid interrupt vector 
table. Any code that is executed before setting up an 
SMM compliant interrupt vector table must be written 
carefully to ensure that no exceptions are generated. 


The system memory controller could relocate accesses 
to the SMM interrupt vector table to a location within 
SMRAM. In this case, when SMIACf'" 
is active, all 
accesses to the lowest I Kbyte of the CPU address 
space would be redirected to SMRAM, which would 
contain an SMM compliant vector table that has al- 
ready been initialized. 


If the system memory controller does not redirect inter- 
rupt vector table reads to an address within SMRAM, 
there are three steps required to provide an SMM com- 
pliant interrupt vector table: 
I. Save the contents of memory at address OOOOOOOOH 
to SMRAM 
2. Provide vectors for any possible interrupts or execp- 
tions at the apropriate location in the vector table 
3. Restore 
the 
original 
memory 
contents 
from 
SMRAM before exiting the SMM handler routine 


A.3.2 
INTERRUPTS 
AND SUBROUTINES 
WITH 
SMRAM 
RELOCATION 


There is an additional issue that must be considered if 
interrupts or exceptions are to be executed within SMM 
and SMRAM has been relocated. Interrupt or subrou- 
tine calls from within SMM operate in a manner similar 
to Real Mode. When a subroutine is called or an inter- 
rupt is recognized, the 16-bit CS and IP registers are 
pushed onto the stack to provide a return address. 


When SMRAM is relocated to an address space above 
1M and an interrupt or subroutine call occurs, only 16 
bits of the EIP register are pushed onto the stack. 
When returning from the subroutine or interrupt, the 
CPU will vector to a location where the upper 16bits of 
EIP are zero. This can be avoided for subroutines by 
using an address size override before calling the subrou- 
tine. However, the issue remains for interrupts. 


A.4 
Floating Point Operation 
and 
SMM 


A.4.1 
THE NEED TO SAVE THE FPU 
ENVIRONMENT 


When the CPU enters System Management Mode, the 
context information for the interrupted application is 
automatically 
saved to a specific state save address. 


When the SMM handler returns control to the inter- 
rupted application by executing the RSM instruction, 
the context information from the interrupted applica- 
tion is restored to the CPU by reading from the state 
save location. This mechanism allows the SMM han- 
dier routine to modify most of the CPU registers with- 
out the need to explicitly save them to memory. How- 
ever, the registers in the CPU's Floating Point Unit 
(FPU) are not automatically saved when the CPU en- 
ters SMM. If the SMM handler needs to modify any of 
the registers in the FPU, or if the register data will be 
lost due to entering a power down state, the SMM han- 
dier must first explicitly save the FPU state as it existed 
in the interrupted application. 


There are two instances in which an SMM handler rou- 
tine must be aware of the Floating Point Unit (FPU): 
1. When removing power from the CPU / FPU for the 
purpose of executing a suspend sequence. 


2. When the SMM handler uses FPU instructions. 


In both of these cases, the SMM handler must save the 
state of the FPU as it was left by the interrupted appli- 
cation. 


The information stored by the FPU state save instruc- 
tions (FSAVE, FNSA VE, FSTENV, and FNSTENV) 
is dependent on the operating mode of the CPU. The 
FPU state save instructions store the FPU state infor- 
mation in one offour formats: 16-bit Real Mode, 32-bit 
Real Mode, 16-bit Protected Mode, or 32-bit Protected 
Mode, depending on the CPU operating mode. The 
content of the information saved also varies slightly, 
depending on the CPU operating mode in which the 
save instruction was executed. For example, the 32-bit 
Protected Mode FNSA VE instruction saves the address 
of the last executed FPU instruction and its operands in 
the form of a segment selector and a 32-bit offset. In 
contrast, the 16-bit Real Mode FNSA VE instruction 
saves the address information in the form of a 20 bit 
physical address. Since the format with which the FPU 
state restore instructions (FRSTOR and FLDENV) re- 
call the information is also dependent on the operating 
mode of the CPU, the save and restore instructions 
must be executed from the same CPU operating mode. 


A.4.2 
SAVING 
THE STATE 
OF THE FLOATING 
POINT 
UNIT 


When an SMM handler routine needs to save the state 
of the Floating Point Unit, it must save all FPU state 
information necessary for the interrupted application to 
continue processing. This state information includes the 
contents of the Floating Point Unit stack, which re- 
quires use of the FNSA VE or FSAVE instruction 
(FSTENV 
does not save the contents 
of the FPU 


stack). If the last executed non-control Floating Point 
instruction caused an error (such as a divide by 0), the 
saved information must also include the address of the 
failing instruction and the addresses of any operands 
for that instruction. Without these addresses, it would 
be impossible for the FPU exception handler of the in- 
terrupted application to correct the error and restart 
the instruction. 


The FNSA VE and FSAVE instructions differ in that 
FNSA VE does not wait for the FPU to check for an 
existing error condition before storing the FPU envi- 
ronment information. If there is an unmasked FPU ex- 
ception condition pending, execution of the FSAVE in- 
struction will force the CPU to wait until the error 
condition is cleared by the software exception handler. 
Because the CPU is in System Management Mode, the 
appropriate exception handler will not be available, and 
the FPU error would not be corrected in the manner 
expected by the interrupted application program. For 
this reason, the FNSA VE instruction should be used 
when saving the environment of the FPU within SMM. 


Since the SMM handler does not know the CPU mode 
in which the interrupted application was executing (16 
or 32 bit, Real or Protected), the SMM handler must 
execute the FNSA VE instruction in a mode in which 


all FPU state information is stored. The 32-bit Protect- 
ed Mode format of the FNSA VE instruction is a super 
set of all other formats of the FNSA VE instruction. 
Therefore, 
executing 
the 
32-bit 
Protected 
Mode 
FNSA VE instruction ensures that all FPU state infor- 
mation will be saved. 


Executing the FNSA VE instruction in 32-bit Protected 
Mode requires that the CPU be temporarily placed in 
Protected Mode. Rather than perform all of the setup 
details and overhead necessary to place the CPU into 
Protected Mode, including the initialization of all de- 
scriptors and descriptor tables, it is possible to tempo- 
rarily place the CPU into Protected Mode for the pur- 
pose of executing only a few carefully written instruc- 
tions. This can be accomplished by setting the PE bit in 
the eRO register, and then executing a short jump to 
clear the instruction pipelines. 


It is important to note that any instruction that modi- 
fies a segment register will cause the CPU to attempt to 
load a new descriptor from the descriptor table. (The 
occurrence of an interrupt or an exception would cause 
the CPU to load a new descriptor, so interrupts must be 
disabled during this sequence.) Since neither the de- 
scriptors nor the descriptor table have been initialized, 
this would cause the system to crash. Therefore, all 


segment registers that are to be used in the FPU state 
save instructions 
must be initialized before entering 
Protected Mode. 


Figure A-4 gives an example of the code that can be 
used to place the CPU in Protected Mode and save the 
FPU state. 


Note that the no wait form (FNSA VE) of the save in- 
struction must be used. In the event that the previous 
FPU instruction caused a floating point error, we do 
not want to wait for this error to be serviced before 
executing the save instruction. 
Additionally, 
if the 
FSAVE instruction were used, the operand size over- 
ride prefix would be incorrectly applied to the implicit 
WAIT instruction which precedes FSAVE, rather than 
to the save instruction itself. 


Before exiting the SMM handler and returning to the • 
interrupted 
application, 
the register contents of the 
Floating Point Unit must be returned to their previous 
values. This can be accomplished by executing the 32- 
bit Protected Mode format of the FRSTOR instruction. 
Figure A-5 gives an example code segment that can be 
used to restore the FPU to the state in which it was 
interrupted by the SMI request. 


mov 
dX,SEGMENT 
;SEGMENT is the segment to be used by 
;the save instruction, 
mov 
dS,dx 
; normally 
it should point to SMRAM 
mov 
si,OFFSET 
;OFFSET is the offset used in the save 
;instruction 


;set the PE bit in CRO 


mov 
eax,crO 
;read the old value of CRO 
or 
eax,OOOOOOOlH 
;set the PE bit 
mov 
crO, eax 


jmp 
protect: 


;use an operand size override prefix 
;to set 32-bit format 
;FPU state saved to SEGMENT:OFFSET 


;now return to real mode to continue with the SMM handler 
(no jump is 
;required) 


mov 
and 
mov 


eax,crO 
eax,OFFFFFFFEH 
crO,eax 


mov 
dX,SEGMENT 
;SEGMENT is the segment 
to be used by 
;the restore 
instruction, 
mov 
dS,dx 
; normally 
it should point 
to SMRAM 
mov 
si,OFFSET 
;OFFSET is the offset used in the 
;restore instruction 


;set the PE bit in CRO 


mov 
eax,crO 
;read the old value 
of CRe. 
or 
eax,ObOOOOOlH 
;set the PE bit 


'mov 
crO, eax 


jmp 
protect: 


;we can now recall the state of the FPU from the previous 
FNSAVE 
instruction 
;(in the protected 
mode 
format) 


;use an operand 
size override 
prefix 
;to set 32-bi t format 
;FPU state restored 
from 
;SEGMENT :OFFSET 


;now return to real mode to continue 
with the SMM handler 
(no jump is 
;required) 


mov 
and 
mov 


eax,crO 
eax,FFFFFFFEH 
crO, eax 


Note that the no wait form (FNRSTOR) of the restore 
instruction must be used. If the FRSTOR instruction 
were used, the operand size override prefix would be 
incorrectly applied to the implicit WAIT instruction 
which precedes FRSTOR, rather than to the save in- 
struction itself. 


A.S 
Support for Power Managed 
Peripherals 


Before power is removed from any device, the state of 
that device must be saved in a protected memory space 
so that the device can be reinitialized to its previous 
state. If a peripheral contains a write only register, the 
value in that register can be recovered by providing 
shadow registers that are both readable and writeable. 


These shadow registers would be updated every time 
the peripheral registers are written, but they have no 
function other than tracking the data written to a par- 
ticular register. 


In addition to the write only registers in a system, there 
are several other registers that must be shadowed. Any 
device that requires registers to be programmed in a 
particular sequence must also have its registers shad- 
owed. Examples in a typical personal computer include 
the programmable interrupt controller, the DMA con- 
troller, and the programmable timer/counter. 


It is also possible to perform shadowing of some write 
only registers using SMM. Any time a write cycle is 
generated to a write only register, the system can gener- 
ate an SMI#. The SMM handler can use the CPU state 
information saved in the SMM state save to save the 
data from the interrupted I/O cycle to a predetermined 
location in the SMRAM space. 


intel~ 


The information contained in the SMM state save can 
be used (with the knowledge that the SMI# was in 
response to an I/O write instruction) to determine both 
the address and the data of the interrupted write in- 
struction. The SMM handler can examine the OP- 
CODEs of previous instructions by decrementing the 
IP (or EIP) register. Once the correct OPCODE is de- 
termined, it can be used with the values in the EAX 
and DX slots of the SMM state save to update the 
information in the memory used to shadow the I/O 
register. I/O write instructions occur in one of three 
forms: 1) a write to an address that is specified in the 
OPCODE; 2) a write to an address contained in the 
DX register; or 3) a string write to an address con- 
tained in the DX register. 


The I/O 
write instructions have the following OP- 
CODEs: 


Instruction 
OPCODE 
Notes 


OUTx,al 
E6x 
x is the address 
of the I/O port 


OUTx,ax 
E7x 
x is the address 
of the I/O port 


OUTx,eax 
E7x 
x is the address 
of the I/O port 


OUTdx,al 
EE 


OUTdx,ax 
EF 


OUTdx,eax 
EF 


OUTSB 
6E 


OUTSW 
6F 


OUTSD 
6F 


The SMM handler must know whether a particular 
I/O port is 16or 32 bits in order to distinguish between 
16 and 32 bit I/O write cycles. 


The SMM handler can decrement the value of IP con- 
tained in the state save, and then examine the memory 
contents at that address. If the SMM handler knows 
that the last instruction was an I/O write instruction, 
and writes to I/O addresses 6EH, 6FH, OEEH, and 
OEFH will not cause an SMI#, it can use the SMM 
state save data for EAX and EDX to reconstruct the 
last instruction. 


A.5.2 HANDLING INTERRUPTED I/O WRITE 
SEQUENCES 


In a typical personal computer, there are several hard- 
ware devices that require the control registers for that 
device to be progra=ed 
in a particular order. For ex- 
ample, the interrupt controller, the DMA controller, 


the programmable timer/counter, 
the keyboard con- 
troller, and the real time clock all require a series of 
accesses to properly initialize the registers in that par- 
ticular device. Some of these devices may require suc- 
cessiveaccessesto registers located at different address- 
es, while others may require several control registers to 
be programmed through write cycles to the same ad- 
dress. 


If an SMI request interrupts an application that is in 
the process of initializing the registers in one of these 
devices, special care must be taken to ensure that the 
peripheral is returned to its original state when control 
is returned to the interrupted program. For some SMM 
handler events, it may be necessary to power down the 
device or change the state of a register within the de- 
vice. In these cases, the SMM handler must return con- 
trol to the interrupted application in such a way that 
the application can continue with the correct sequential 
access in the interrupted sequence. 


To accomplish this, the SMM handler must restore the 
original values of all registers in the device, and restart 
the interrupted sequence so that the application may 
continue where it left off. This requires system hard- 
ware to shadow all registers that need to be accessed in 
the sequence, keep an index indicating which position 
in the sequence the register occupies, and keep a point- 
er so that SMM software knows to which register the 
last access was directed. This pointer would indicate 
the last register of each sequence that was programmed 
in the particular peripheral. 


For example, progra=ing 
the master interrupt con- 
troller requires a write to I/O port 20H (ICWl) fol- 
lowed by four write cycles to I/O port 21H (ICW2, 
ICW3, ICW4, and OCWl). If this sequence is inter- 
rupted by an SMI request, and the resulting SMM han- 
dler either modifies or powers down the interrupt con- 
troller, the SMM handler must return control to the 
interrupted application such that the following access 
to the interrupt controller would access the correct reg- 
ister in the sequence. System hardware must save the 
contents of each of the registers, as well as a pointer 
indicating which register was last written. 


Before returning control to the interrupted application, 
the SMM handler must initialize ICWI-ICW4 and 
OCWI to their previous values. It would then re-write 
the appropriate registers so that the first access by the 
application program would be to the location in the 
sequence followingthe last location it programmed be-. 
fore it was interrupted by the SMI request. 


A similar procedure must be followed for each of the 
peripherals that require control registers to be initial- 
ized in a particular order. 


• 


3.3V Intel486™ 
SX 
MICROPROCESSOR 


• 
Lower Supply Voltage Required 
- 
Operates from 3.0V to 3.6V 
- 
50% Lower Power than 5V CPU 


- 
Lower Power Consumption Produces 
Less Heat 
- 
Longer Battery Life for Portable 
Applications 


• 
Binary Compatible with Large Software 
Base 
-MS-DOS', 
OS/2", 
Windows' 
-UNIX'" 
System V/386 
-iRMX®, 
iRMK Kernels 


• 
High Integration Enables On-Chip 
- 
8 Kbyte Code and Data Cache 
- 
Paged, Virtual Memory Management 


• 
Easy To Use 
- 
Built in Self Test 
-Intel 
Software Support 
- 
Extensive Third Party Software 
Support 


• 
196-Lead PQFP Package 


• 
High Performance Design 
-lntel486 
One Clock Instruction Core 


- 
25 MHz, 20 MHz, and 16 MHz Clock 
Frequencies 
- 
80 Mbytes/sec Burst Bus at 25 MHz 


- 
CHMOS V Process Technology 


- 
Dynamic Bus Sizing for 8-, 16-, 
32-Blt Busses 


• 
Complete 32-Bit Architecture 
- 
Address and Data Busses 


-Registers 
- 
8-, 16-, 32-Bit Data Types 


• 
IEEE 1149.1 Boundary Scan 
Compatibility 
- 
For Surface Mount Production 
Testing 


The 3.3V 
Intel486 
SX microprocessor 
provides 
a low-cost 
entry 
point 
to powerful 
Intel486 
microprocessor 


based 
portable 
computing. 
The Intel486 
SX microprocessor 
is a binary compatible 
derivative 
of the Intel486 


OX microprocessor, 
giving it access 
to the $40 billion 
installed 
software 
base of over 50,000 
MS-DOS, 
Win- 


dows, OS/2, 
and UNIX system V/386 
applications. 
The Intel486 
SX microprocessor 
has the same integrated 


RISC integer core, 8 Kbyte cache memory, 
and memory 
management 
unit as the Intel486 
OX microprocessor. 


The high-performance 
RISC integer core of the Intel486 
SX microprocessor 
executes 
frequently-used 
instruc- 


tions in one clock cycle. An 8 Kbyte unified code and data cache allow this performance 
level to be sustained. 


An 80 Mbyte/sec 
burst bus at 25 MHz ensures 
high system 
throughput 
even with inexpensive 
DRAMs. 
The 


25 MHz Intel486 SX microprocessor 
has a Norton SI V5.0 rating of 54.1, a Dhrystone 
MIPS rating of 20.1 and a 


13.3 SPEC Integer rating. It provides 
up to 70% greater performance 
than a 33 MHz Intel386™ 
OX microproc- 


essor with an external 
cache 
(depending 
on the application) 
at a lower system 
cost. 


This documentation 
lists the A.C. and D.C. specifications 
of the 3.3V Intel486 
SX microprocessor 
and is a 


supplement 
to the Intel486™ 
SX Microprocessor/lntel487™ 
SX Math Coprocessor 
Data Book (Order 
Num- 


ber 240950-002). 


i386, Inte1386,1387,Intel387, i486, Inte1486,i487, Intel487 are trademarks of Intel Corp. 
'MS-DOS and Windows are trademarks of Microsoft Corp. 


"OS/2 
is a trademark of IBM. 
'''UNIX 
is a registered trademark of UNIX Systems labs 


The ratings listed in this section are stress ratings 
only, and functional operation at the maximumsis 
not guaranteed.Functionaloperatingconditions are 
given in the D.C. Specificationsand A.C. Specifica- 
tions sections. 


Extendedexposureto the MaximumRatingsmayaf- 
fect device reliability. Furthermore,although the In- 
tel486 SX microprocessor contains protective cir- 
cuitryto resistdamagefrom static electric discharge, 
alwaystake precautionsto avoid highstatic voltages 
or electric fields. 


CaseTemperatureunder Bias 
StorageTemperature 
Voltage on Any Pin with 
Respectto Ground 
SupplyVoltage with 
Respectto Vss 


D.C. SPECIFICATIONS 
Functionaloperating range:Vcc = 3.3V ± 0.3V; TCASE= O·Cto + 85·C 


3.3V Intel486™ 
SX Microprocessor 
D.C. Parametric 
Values 


Symbol 
Parameter 
Mln 
Max 
Units 


Vil 
Input lOW Voltage 
- 0.3 
+ 0.8 
V 


VIH 
Input HIGHVoltage 
2.0 
VCC+ 0.3 
V 


VOL 
Output lOW Voltage 
IOl = 4/5 mA 
0.45 
V 
IOl = 0.1 mA 
0. 
V 


VOH 
OutputHIGHVoltage 
IOH= -1/0.9 
mA 
V 
IOH= -0.1 mA 
V 


Icc 
SupplyCurrent 
ClK = 8 MHz 
150 
mA 
ClK = 16 MHz 
260 
mA 
ClK = 20 MHz 
290 
mA 
ClK = 25 MHz 
340 
mA 


III 
Input leakage Current 
± 15 
p.A 


IIH 
Input leakage Curren 
200 
p.A 


III 
Input leakage Cur nt 
- 400 
p.A 


ILO 
Output leakag 
± 15 
p.A 


CIN 
InputCapaci 
10 
pF 


COUT 
Outputor I/O Capacitance 
10 
pF 


CClK 
ClK Capacitance 
6 
pF 


NOTES: 
1. This parameter is measured at: 
Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 


Address, Data, BEn 
-1.0 mA 
Definition, Control 
-0.9 mA 


-65·C to + 110·C 
-65·Cto 
+ 150·C 


(Note4) 


(Note 5) 


(Note 6) 


FC = 1 MHz(7) 


FC = 1 MHz(7) 


FC= 1 MHz(7) 


3. Typical supply current (3.3V): 
ICC@ 
8 MHz= 90mA 


@ 16 MHz= 180 mA 


@ 20 MHz= 210 mA 


@ 25 MHz= 250 mA 
4. This parameter is for inputs without pUll ups or pUll 
downs and OV ,;; VIN ,;; Vcc. 
5. This parameter is for inputs with pull downs and 
VIH = 2.4V. 
6. This parameter is for inputs with pull ups and 
VIL = 0.45V. 
7. Not 100% tested. 


• 


A.C. specifications measurement is defined by the 
timing diagrams in the base Intel486 SX documenta- 
tion (refer to the cover page of this document for the 
order number). The Boundary Scan (JTAG) timing 
waveforms are included in this document following 
the A.C. Specifications section. 


The A.C. Specifications given in the following tables 
consist of output delays, input setup requirements 
and input hold requirementts. All A.C. specifications 
are relative to the rising edge of the ClK signal. 


16 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 
vcc = 3.3V ± 0.3V; TCASE= O°Cto + 85°C; CL = 50 pF unless otherwise specified. 


Symbol 
Parameter 
Mln 
Max 
Figure 
Units 
Notes 


Frequency 
8 
16 
MHz 
1X Clock 


t1 
ClK Period 
62.5 
125 
12.1 
ns 


t1a 
ClK Period Stability 
0.1% 
12.1 
ns 
Adjacent Clocks 


t2 
ClK High Time 
20 
1~\ 
ns 
at2V 


t3 
ClK low Time 
~O 
l f~~1 
ns 
at 0.8V 


t4 
ClK Fall Time 
A',,"'" ~ 
,(1 12.1 
ns 
2V to 0.8V 


t5 
ClK Rise Time 
",\Y-' 
..~~f8 
12.1 
ns 
0.8Vt02V 


t6 
A2-A31, PWT, PCD, BEO-3#, M/IO#~~, 


~ ~~' 


12.5 
ns 
W/R#, ADS#, lOCK#, 
BREQ, HlpA~\!afid D~~y 


t7 
A2-A31, PWT, PCD, BEO-3~£i!O\;, 
D~9:!!J~~.:.."\' 
42 
12.6 
ns 
After Clock Edges(1) 
W/R#, ADS#, lOCK#, 
BR~.At.DA 
F\I!"" ~V'IN, 


ta 
PCHK# Valid Delay 
~ 
vAtr' 
3 
35 
12.4 
ns 
.t ...•. 


taa 
BLASH, 
PLOCK# VaHdDelay/~"""AO'+ 
3 
35 
12.5 
ns 


tg 
BLAST#, PLOCK# Float Delll~ '..,~ 
42 
12.6 
ns 
After Clock Edges(1) 


t10 
00-031, 
DPO-DP3 Write D~VamN5elay 
3 
30 
12.5 
ns 


t11 
00-031, 
DPO-DP3 '«iit~Data 
Float Delay 
42 
12.6 
ns 
After Clock Edges(1) 


t12 
EADS# Setup Tim'~f" 
12 
12.2 
ns 


t13 
EADS# Hold Time 
3 
12.2 
ns 


t14 
KEN#, BS16#, BS8# Setup Time 
12 
12.2 
ns 


t15 
KEN#, BS16#, BS8# Hold Time 
3 
12.2 
ns 


t16 
RDY#, BRDY# Setup Time 
12 
12.2 
ns 


16 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 
vcc = 3.3V ±0.3V; TCASE = O°C to + 85°C; CL = 50 pF unless otherwise 
specified. 
(Continued) 


Symbol 
Parameter 
Min 
Max 
Figure 
Unit 
Notes 


t17 
ROY #, BROY # Hold Time 
3 
12.2 
ns 


t18 
HOLD, AHOLO, BOFF # Setup Time 
12 
12.2 
ns 


t19 
HOLD, AHOLO, 
BOFF # Setup Time 
3 
12.2 
ns 


t20 
RESET, FLUSH #, A20M #, NMI, INTR Setup Time 
14 
12.2 
ns 


t21 
RESET, FLUSH #, A20M # , NMI, INTR Hold Time 
3 
12.2 
ns 


t22 
00-031 
, OPO-OP3, 
A4-A31 
Read Setup Time 
10 
~ 
12.2 
ns 
. 


t23 
00-031 
, OPO-OP3, 
A4-A31 
Read Hold Time 
3 
12.2 
ns 


t24 
TCK Frequency 
16 
12.7 
MHz 
1X Clock 


t25 
TCK Period 
.~')~ 
£'> 
62.5 
12.7 
ns 
(Note 3) 


t26 
TCK High Time 
;,~~ 
JR:,.~ 
10 
12.7 
ns 
@2.0V 


t27 
TCK Low Time 
16!:...~ 
~~""-~ 
10 
12.7 
ns 
@0.8V 


t28 
TCK Rise Time 
~v 
~'\~~~' 
4 
12.7 
ns 
(Note 2) 


t29 
TCK Fall Time 
.~0-~~VdJ 
4 
12.7 
ns 
(Note 2) 


t30 
TOI, TMS Setup Time 
S:~~<:1::>"V 
8 
12.7 
ns 
(Note 4) 


t31 
TOI, TMS Hold Time 


v 
.~. 


10 
12.7 
ns 
(Note 4) 


t32 
TOO Valid Delay 
3 
25 
12.7 
ns 
(Note 4) 


t33 
TOO Float Delay 
TBO 
12.7 
ns 
(Note 4) 


t34 
All Outputs 
(Non-Test 
Valid Delay) 
3 
25 
12.7 
ns 
(Note 4) 


t35 
All Outputs 
(Non-Test 
Float Delay) 
36 
12.7 
ns 
(Note 4) 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
12.7 
ns 
(Note 4) 


t37 
All Inputs (Non-Test) 
Hold Time 
10 
12.7 
ns 
(Note 4) 


NOTES: 
1. Not 100% tested, 
guaranteed 
by design 
characterization. 
2. Rise/Fall 
times 
are measured 
between 
0.8V and 2.0V. Rise/Fall 
times 
can be relaxed 
by 1 ns per 10 ns increase 
in TCK 
period. 
3. TCK period 
~ eLK period. 
4. Parameter 
measured 
from TCK. Boundary 
scan tested 
at Vcc greater 
than or equal to 3.3V. 


20 MHz 3.3V Intel486™ 
SX Microprocessor 
A.C. Characteristics 
vcc 
= 3.3V ±0.3V; TCASE= O·Cto + 85·C; CL = 50 pF unless otherwise specified. 


Symbol 
Parameter 
Min 
Max 
Figure 
Unit 
Notes 


Frequency 
8 
20 
MHz 1X Clock 


t1 
ClK Period 
50 
125 
12.1 
ns 


t1a 
ClK Period Stability 
0.1% 
12.1 
ns 
Adjacent Clocks 


t2 
ClK High Time 
16 
12.1 
ns 
at2V 


t3 
ClK low Time 
16 
12.1 
ns 
at 0.8V 


t4 
ClK Fall Time 
6 
12.1 
ns 
2VtoO.8V 


t5 
ClK Rise Time 
6 
12.1 
ns 
0.8Vto 2V 


t6 
A2-A31, PWT, PCD, BEO-3#, MIIO#, 
D/C#, 
3 
23 
12.5 
ns 
W/R#, 
ADS#, lOCK#, 
BREQ, HlDA Valid Delay 


t7 
A2-A31, PWT, PCD,BEO-3#, 
M/IO#, 
D/C#, 
37 


~~~ 


ns 
After Clock Edges(1) 


WIR #, ADS#, lOCK #, BREQ, HlDA Float Delay 


t8 
PCHK# Valid Delay 
t 
3 
28#; 
32.4 
ns 


t8a 
BLAST#, PLOCK# Valid Delay 
~, 
3 -ii', 
12.5 
ns 


t9 
BLAST#, PLOCK# Float Delay 
~~' 
1.•( 737 
12.6 
ns 
After Clock Edges(1) 


t10 
DO-D31 , DPO-DP3 Write Data Valid p~" 
(~~ ~,12.5 
ns 


t11 
DO-D31 , DPO-DP3 Write Data FI~~ay 
A 
~ 
"~7 
12.6 
ns 
After Clock Edges(1) 


t12 
EADS# Setup Time 
..6:"' .... 
0..~.to 
12.2 
ns 


t13 
EADS# Hqld Time 
...tjl.~. 
.....•-.~ 
.•.. 
3 
12.2 
ns 


t14 
KEN#, BS16#, BS8#~etr.pTim.~;"",.,~ 
10 
12.2 
ns 


t15 
KEN#, BS16#, BS8# HoldTJ~ 
~.:fl- 
3 
12.2 
ns 


t16 
RDY#, BRDY# SetupTime\..~· 
~~. 
10 
12.2 
ns 


t17 
RDY#'BRDY#~ 
3 
12.2 
ns 


t18 
HOLD, AHOLD, B 
Setup Time 
12 
12.2 
ns 


t19 
HOLD, AHOLD, BOFF# Setup Time 
3 
12.2 
ns 


t20 
RESET, FlUSH#, 
A20M#, NMI, INTR Setup Time 
12 
12.2 
ns 


t21 
RESET, FlUSH#, 
A20M#, NMI, INTR Hold Time 
3 
12.2 
ns 


t22 
DO-D31 , DPO-DP3, A4-A31 Read Setup Time 
8 
12.2 
ns 


t23 
DO-D31 , DPO-DP3, A4-A31 Read Hold Time 
3 
12.2 
ns 


t24 
TCK Frequency 
20 
12.7 
MHz 1X Clock 


t25 
TCK Period 
50 
12.7 
ns 
(Note 3) 


t26 
TCK High Time 
10 
12.7 
ns 
@2.0V 


t27 
TCK low Time 
10 
12.7 
ns 
@0.8V 


t28 
TCK Rise Time 
4 
12.7 
ns 
(Note 2) 


t29 
TCKFaliTime 
4 
12.7 
ns 
(Note 2) 


intel~ 


20 MHz 3.3V Intel486™ SX Microprocessor 
A.C. Characteristics 
vcc = 3.3V 
±0.3V; 
TCASE = O·C to + 85·C; Cl = 50 pF unless otherwise 
specified. 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
e 
Unit 
Notes 


t30 
TDI, TMS Setup Time 
ns 
(Note 4) 


t31 
TDI, TMS Hold Time 
ns 
(Note 4) 


t32 
TOO Valid Delay 
ns 
(Note 4) 


t33 
TOO Float Delay 
ns 
(Note 4) 


t34 
All Outputs 
(Non-Test 
Valid ~ 
y) 
ns 
(Note 4) 


t35 
All Outputs 
(Non-Test 
F~t 
Delay) 
ns 
(Note 4) 


t36 
All Inputs (Non-Test 
ns 
(Note 4) 


t37 
All Inputs (Non-Test) 
Hold T 
ns 
(Note 4) 


NOTES: 
• 
1. Not 100% tested. guaranteed by desig~a~11 
Ion. 
2. Rise/Fall times are measured between O]V and2.0V. Rise/Fall times can be relaxed by 1 ns per 10 ns increase in TCK 
period. 
3. TCK period ~ ClK period. 
4. Parameter measured from TC{(. 
undary scan tested at Vcc greater than or equal to 3.3V. 


25 MHz 3.3V Intel486™ SX Microprocessor 
A.C. Characteristics 
vcc = 3.3V 
±0.3V; 
TCASE = O·C to + 85·C; Cl = 50 pF unless otherwise 
specified. 


Symbol 
Parameter 
Min 
Max 
Figure 
Unit 
Notes 


Frequency 
8 
25 
MHz 
1X Clock 


t1 
ClK 
Period 
40 
125 
12,," 
ns 


t1a 
ClK 
Period Stability 
0.1% 
~1 
ns 
Adjacent 
Clocks 


t2 
ClK 
High Time 
" 
14 
. 
12.1 
ns 
at2V 


t3 
ClK 
low 
Time 
(>:-.N' 
14 " 


12.1 
ns 
at 0.8V 


t4 
ClK 
Fall Time 
&~v 
" " 


12.1 
ns 
2VtoO.8V 


t5 
ClK 
Rise Time 
I 
~ 
4 
12.1 
ns 
0.8Vto2V 


t6 
A2-A31. 
PWT. PCD. BEO-~;cJ::~OW, 
o/\?i"-' 
~. 
20 
12.5 
ns 
W/R#, 
AOS#, 
lOCK#, 
BR 
LOA Vafld'delay 


t7 
A2-A31, 
PWT, PCD, BEO- 
,M/IO#. 
:~~\~ 
28 
12.6 
ns 
After Clock Edges(1) 
W/R#, 
ADS#. 
lOCK#, 
aREQ. 
f::U.~AFI 
'Delay 


te 
PCHK# 
Valid Delay 
.~ 
""",'-~'" 
3 
24 
12.4 
ns 


tea 
BLASH, 
PlOCK# 
Valid o~y ~~. 
3 
24 
12.5 
ns 


tg 
BLASH, 
PLOCK # FJI'ab:>elay 
28 
12.6 
ns 
After Clock Edges(1) 


t10 
00-031 
, DPO-O~~te 
Data Valid Delay 
3 
20 
12.5 
ns 


t11 
00-031, 
OPO-OP3'Write 
Data Float Delay 
28 
12.6 
ns 
After Clock Edges(1) 


t12 
EADS# 
Setup Time 
8 
12.2 
ns 


t13 
EADS# 
Hold Time 
3 
12.2 
ns 


3.3V Intel486TM SX MICROPROCESSOR 
intel~ 


25 MHz 3.3V Intel486TM SX Microprocessor 
A.C. Characteristics 


vcc 
= 3.3V 
±0.3V; 
TCASE = O·C to + 85·C; Cl = 50 pF unless otherwise 
specified. 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
Figure 
Unit 
Notes 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
8 
12.2 
ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
3 
12.2 
ns 


t16 
RDY #, BRDY # Setup Time 
. 
8 
12.2 
ns 


t17 
RDY #, BRDY # Hold Time 
3 
12.2 
ns 


t18 
HOLD, AHOLD, 
BOFF # Setup Time 
10 
12.2 
ns 


t19 
HOLD, AHOLD, 
BOFF # Setup Time 
3 
.•. 


12.2 
ns 


t20 
RESET, FLUSH#, 
A20M#, 
NMI,INTR 
Setup rime 
10 
12.2 
ns 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, INTR HoldTime 
~ 
12.2 
ns 


t22 
DO-D31, 
DPO-DP3, 
M-A31 
Read Setdp\l'ime 
••.•.~ El ••.. 
12.2 
ns 


t23 
DO-D31, 
DPO-DP3, 
M-A31 
Rea~ 
Time <)' f ~ ~ 
12.2 
ns 


t24 
TCK Frequency 
..... ~ '.. 
'"' 
25 
12.7 
MHz 
1X Clock 


t25 
TCK Period 
_~F 
..If.V ••.~.,. 
40 
12.7 
ns 
(Note 3) 


t26 
TCK High Time 
A.~- 
.•• A~ 
10 
12.7 
ns 
@2.0V 


t27 
TCKLowTime 
*'" 
.""7 Jf'Itt.~- 
10 
12.7 
ns 
@0.8V 


t28 
TCK Rise Time 
--<'-«~ 
4 
12.7 
ns 
(Note 2) 


t29 
TCKFallTime 
,~-~ 
4 
12.7 
ns 
(Note 2) 


t30 
TOI, TMS Setup Ti~ 
8 
12.7 
ns 
(Note 4) 


t31 
TOI, TMS Hold ~ 
10 
12.7 
ns 
(Note 4) 


t32 
TDO Valid Delay - 
3 
25 
12.7 
ns 
(Note 4) 


t33 
TDO Float Delay 
TBD 
12.7 
ns 
(Note 4) 


t34 
All Outputs 
(Non-Test 
Valid Delay) 
3 
25 
12.7 
ns 
(Note 4) 


t35 
All Outputs 
(Non-Test 
Float Delay) 
36 
12.7 
ns 
(Note 4) 


t36 
All Inputs (Non-Test) 
Setup Time 
8 
12.7 
ns 
(Note 4) 


t37 
All Inputs (Non-Test) 
Hold Time 
10 
12.7 
ns 
(Note 4) 


NOTES: 
1. Not 100% tested, 
guaranteed 
by design 
characterization. 


2. Rise/Fall 
times 
are measured 
between 
0.8V and 2.0V. Rise/Fall 
times 
can be relaxed 
by 1 ns per 10 ns increase 
in TCK 
period. 
3. TCK period 
~ ClK 
period. 
4. Parameter 
measured 
from TCK. Boundary 
scan tested 
at Vcc greater 
than or equal to 3.3V. 
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Intel486™ 
FAMILY OF MICROPROCESSORS 
LOW POWER VERSION 
DATA SHEET 


Low Power Intel486™ 
SX CPU/lnteI487™ 
SX MCP 
Low Power Intel486 DX CPU 


• 
168-Lead Pin Grid Array for Intel486™ 
SX 
Microprocessor 


• 
196-Lead Plastic Quad Flat Package for 
Intel486™ 
SX Microprocessor 


• 
169-Pln Grid Array Package for Intel487™ 
SX Math CoProcessor 


• 
High Performance 
Design 
- 
Intel486TM One Clock Instruction 
Core 
- 
16/20/25 
MHz Operation 
for 
Intel486™ 
SX 
- 
25 MHz Operation 
for Intel486™ 
DX 
- 
64 MByte/Sec 
Burst Bus 
- 
CHMOS IV Process Technology 
- 
Dynamic Bus Sizing for 8-,16- and 
32-Bit Buses 


• 
Complete 
32-Blt Architecture 
- 
Address 
and Data Buses 
-Registers 
- 
8-, 16- and 32-Blt Data Types 


• 
Multiprocessor 
Support 
- 
Multiprocessor 
Instructions 
- 
Cache Consistency 
Protocols 
- 
Support 
for Second Level Cache 


The data sheet describes both the Low Power Intel486 SX and the Low Power Intel486 DX microprocessors. 
The Intel487 SX Math CoProcessor will support the low power Intel486 SX microprocessor as an optional 
upgrade available through the retail channel. 
The Low Power Intel486 family microprocessors meet today's need for high performance portables. Their 
combination of special features like dynamic frequency scaling, lower minimum frequency, improved Vee 
operation and high integration contribute significantly to lower power dissipation and meet the needs of 
portable computing. 
The Low Power capability is achieved by operating the Intel486 microprocessor in the 2X mode. The frequency 
can be varied dynamically between maximum to minimum as needed. The frequency change does not affect 
contents of the registers and data integrity is maintained. Power dissipation is reduced significantly at 2 MHz 
where Ice is only 150 mA compared to 600 mA at 20 MHz. Low power versions are offered for both the 
Intel486 SX and the Intel486 DX microprocessors. 
The Low Power Intel486 microprocessors are 100-percent compatible with all versions of the Intel386™ 
microprocessor family, assuring compatibility with the more than $40 billion software base of MS-DOS, Win- 
dows, OS/2 and UNIX/System operating system applications. The Low Power Intel486 microprocessor inte- 
grates the same RISC-technology, one clock per instruction integer core, on-chip cache, and memory man- 
agement unit as the standard Intel486 microprocessor. 
The Intel487 SX Math CoProcessor provides optional math upgrade capability for the Intel486 SX micro- 
processor and supports low power operation; providing end-users increased floating point performance for 
more than 2100 software packages that were designed to use Intel Math CoProcessors. Note that the Intel 
OverDrive™ Processor does not work in systems based on the Low Power Intel486 CPU. 
·MS·DOS and Windows are trademarks of Microsoft Corp. 


··OS/2 is a trademark of International Business Mllchines. 
···UNIX is a trademark of UNIX Systems Laboratories. 


• 
Lower Power Dissipation 
- 
Dynamic Frequency 
Scalability 
-Icc(max) 
Reduced to 150 mA at 2 MHz 
-Improved 
Vcc Rating (± 10%) 


• 
Binary Compatible 
with Large Software 
Base 
- 
MS-DOS·, OS/2··, 
Windows· 
-UNIX··· 
System V/386 
-IRMX@, 
IRMK Kernels 


• 
High Integration 
Enables On-Chip 
- 
8 KByte Code and Data Cache 
- 
Floating 
Point Unit on the Intel486 DX 
CPU and Intel487™ 
SX Math 
CoProcessor 
- 
Paged, Virtual Memory Management 


• 
Easy to Use 
- 
Built-In self Test 
- 
Hardware 
Debugging 
Support 
-Intel 
Software 
Support 
- 
Extensive 
Third Party Software 
Support 


• 
168-Lead Pin Grid Array Package for 
Intel486 DX Microprocessor 
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This document 
should 
be used In conjunction 
with 
the 
Intel486™ 
OX Microprocessor 
data 
sheet (order number 240440-004, June 1991) and 
the 
Intel486™ 
SX Microprocessor 
data sheet 
(order number 240950-002, October 1991). 


The Low Power Intel486 microprocessor brings In- 
tel486 technology and performance to the portable 
computer 
market. The 
low 
power 
capability 
is 
achieved by a frequency scalability feature during 
normal operation. The operating frequency can be 
brought down dynamically resulting in lower power 
supply current (led. This results in minimal power 
dissipation which ensures a longer battery life. 


The Low Power Intel486 microprocessor integrates 
the same RISC-technology, one clock per instruc- 
tion integer core, on-chip cache, and memory man- 
agement unit as the standard Intel486 microproces- 
sor. 


The Low Power Intel486 microprocessor has the fol- 
lowing special features: 
• Frequency Scalability-This 
is achieved by op- 
erating the Intel486 microprocessor in the 2X 
clock mode. The frequency can be varied dynam- 
ically from maximum back to minimum or vice 
versa. The frequency change does not affect' the 
register content of the CPU, thus data integrity is 
maintained. 


• Lower 
Minimum Frequency-The 
Low Power 
Intel486 microprocessor can be operated at a 
minimum frequency of 2 MHz, at which lec(max) 
is only 150 mA, compared to an lec(max) of 600 
mA at 20 MHz operation. The power dissipation is 
thus drastically reduced ensuring a longer battery 
life. 


• Improved 
Vcc Operation-The 
Low Power In- 
tel486 microprocessor has an improved Vcc rat- 
ing of ± 10%. Again this feature makes it ex- 
tremely attractive to portable battery powered ap- 
plications. 


The above three features ensure power savings for 
portable computer systems resulting in prolonged 
battery life. 


Besides the above special features, the Low Power 
Intel486 microprocessor has an identical feature set 
to the standard Intel486 CPU. This includes: 
• Binary Compatibility-The 
Low Power Intel486 
CPU is binary compatible with the 8086, 8088, 
80186,80286, i386™ SX, i386™ OX, Intel486TM 
SX and Intel486™ OX CPUs. 


• Full 32-Bit Integer Processor-The 
Low Power 
Intel486 CPU performs a complete set of arith- 
metic and logical operations on 8-, 16-, and 32-bit 
data types using a full-width ALU and eight gener- 
al-purpose registers. 
• Separate 
32-Blt Address 
and Data Paths- 


Four gigabytes of physical memory can be ad- 
dressed directly. 


• Single-Cycle 
Execution-Many 
instructions ex- 
ecute in a single clock cycle. 


• On-Chip Floating Point Unit-This 
is available 
on the Intel486 OX CPU. The 32-, 64-, and 80-bit 
formats specified in IEEE standard 754 are sup- 
ported. The unit is binary compatible with the 
8087,80287, i387™, i387TMSX, and Intel487™ 
math coprocessors and the Intel486TMCPU. 


• On-Chip Memory Management Unlt-Address- 


management 
and 
memory-space 
protection 
mechanisms maintain the integrity of memory. 
This is necessary in multitasking and virtual-mem- 
ory environments, like those implemented by the 
UNIX and OS/2 operating systems. Both memory 
segmentation and paging are supported. 
• On-Chip Cache with Cache Consistency 
Sup- 
port-The 
internal write-through cache can hold 
8 KBytes of data or instructions. Cache hits are 
as fast as read accesses to a processor register. 
Bus activity is tracked to detect alterations in the 
memory which internal cache represents. The in- 
ternal cache can be invalidated or flushed, so 
that an external cache controller can maintain 
cache consistency in multi-processor environ- 
ments. 
• External Cache Control-Write-back 
and flush 
controls over an external cache are provided so 
that the processor can maintain cache consisten- 
cy in multi-processor environments. 


• Instruction 
Plpellnlng- 
The fetching, decoding, 
execution and address translation of instructions 
are overlaped within the Low Power Intel486 mi- 
croprocessor. This results in a continuous execu- 
tion rate of one clock cycle per instruction, for 
most instructions. 


• Burst Cycles-Burst 
transfers allow a new dou- 
bleword to be read from memory each clock cy- 
cle. With this capability the internal cache and in- 
struction prefetch buffer can be filled very rapidly. 


• Write 
Buffers-The 
processor contains write 
buffers to enhance the performance of consecu- 
tive writes to memory. The Low Power Intel486 
CPU can continue operations internally after a 
write, without waiting for the write to be executed 
on the external bus. 


• Bus Backoff-If 
another bus master needs con- 
trol of the bus during a Low Power Intel486 micro- 
processor initiated bus cycle, the Low Power 


Intel486 microprocessor will float its bus signals. 
then restart its cycle when the bus becomes avail- 
able again. 


• lnatruc:tlon R••••• 
-Programs 
can continue ex- 


ecution following an exception generated by an 


unsuccessful attempt to access memory. This fea- 
ture is important for supporting demand-paged vir- 
tual memory applications. 


• Dynamic Bus Sizing-External 
controllers can 


dynamically alter the effective width of the data 
bus. Bus widths of 8, 16 or 32 bits can be used. 
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BS16. 
vss 
vss 
PeO 
vss 
PCHK. 
ADS. 


EAOS. 
BOff. 
BE3. 
Vss 
Vss 
Vss 
VSS 
NC 
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Figure 1-5. Low Power 
Intel486™ 
SX CPU 196·Lead 
PQFP Pinout 
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Address 
Data 
Control 
N/C 
Vcc 
Vss 


A2 
014 
Do 
P1 
A20M# 
015 
A10 
B7 
A7 
A3 
R15 
01 
N2 
A05# 
517 
A12 
B9 
A9 
A4 
516 
02 
N1 
AHOLO 
A17 
A13 
B11 
A11 
A5 
012 
03 
H2 
BEO# 
K15 
A14 
C4 
B3 
A6 
515 
04 
M3 
BE1# 
J16 
B10 
C5 
B4 
A7 
013 
05 
J2 
BEU 
J15 
B12 
E2 
B5 
A8 
R13 
06 
L2 
BE3# 
F17 
B13 
E16 
E1 
A9 
011 
07 
L3 
BLA5T# 
R16 
B14 
G2 
E17 
AlO 
513 
08 
F2 
BOFF# 
017 
B16 
G16 
G1 
A11 
R12 
09 
01 
BROY# 
H15 
C10 
H16 
G17 
A12 
57 
010 
E3 
BREO 
015 
C11 
J1 
H1 
A13 
010 
011 
C1 
B58# 
016 
C12 
K2 
H17 
A14 
55 
012 
G3 
B516# 
C17 
C13 
K16 
K1 
A15 
R7 
013 
02 
CLK2 
C3 
G15 
L16 
K17 
A16 
09 
014 
K3 
CLK5EL 
A3 
R17 
M2 
L1 
A1] 
03 
015 
F3 
O/C# 
M15 
54 
M16 
L17 
A18 
R5 
016 
J3 
OPO 
N3 
P16 
M1 
A19 
04 
01] 
03 
OP1 
F1 
R3 
M17 
A20 
08 
018 
C2 
OP2 
H3 
R6 
P17 
A21 
05 
019 
B1 
OP3 
A5 
R8 
02 
A22 
07 
020 
A1 
EA05# 
B17 
R9 
R4 
A23 
53 
021 
B2 
FERR# 
C14 
R10 
56 
A24 
06 
022 
A2 
FLU5H# 
C15 
R11 
58 
A25 
R2 
023 
A4 
HLOA 
P15 
R14 
59 
A26 
52 
024 
A6 
HOLD 
E15 
510 
A27 
51 
025 
B6 
IGNNE# 
A15 
511 
A28 
R1 
.026 
C7 
INTR 
A16 
512 
A29 
P2 
027 
C6 
KEN# 
F15 
514 
A30 
P3 
028 
C8 
LOCK# 
N15 
A31 
01 
029 
A8 
M/IO# 
N16 
030 
C9 
NMI 
B15 
031 
B8 
PCO 
J17 
PCHK# 
017 
PWT 
L15 
PLOCK # 
016 
ROY# 
F16 
RE5ET 
C16 
W/R# 
N17 


intel~ 


Address 
Data 
Control 
HIC 
Vcc 
Vss 


A2 
014 
Do 
P1 
A20M# 
015 
A10 
87 
A7 
A3 
R15 
01 
N2 
A05# 
517 
A12 
89 
A9 
~ 
516 
02 
N1 
AHOLO 
A17 
A13 
811 
A11 
A5 
012 
03 
H2 
8EO# 
K15 
A14 
C4 
83 


A{; 
515 
04 
M3 
8E1# 
J16 
810 
C5 
84 


A7 
013 
05 
J2 
8EU 
J15 
812 
E2 
85 
A8 
R13 
06 
L2 
8E3# 
F17 
813 
E16 
E1 
A9 
011 
07 
L3 
8LA5T# 
R16 
814 
G2 
E17 
A10 
513 
08 
F2 
80FF# 
017 
815 
G16 
G1 


A11 
R12 
09 
01 
8ROY# 
H15 
816 
H16 
G17 
A12 
57 
010 
E3 
8REO 
015 
C10 
J1 
H1 


A13 
010 
011 
C1 
858# 
016 
C11 
K2 
H17 
A14 
55 
012 
G3 
8516# 
C17 
C12 
K16 
K1 


A15 
R7 
013 
02 
CLK2 
C3 
C13 
L16 
K17 
A16 
09 
014 
K3 
CLK5EL 
A3 
C14 
M2 
L1 
A17 
03 
015 
F3 
O/C# 
M15 
G15 
M16 
L17 
A18 
R5 
016 
J3 
OPO 
N3 
R17 
P16 
M1 


A19 
04 
017 
03 
OP1 
F1 
54 
R3 
M17 
A20 
08 
018 
C2 
OP2 
H3 
R6 
P17 
A21 
05 
019 
81 
OP3 
AS 
R8 
02 


A22 
07 
020 
A1 
EA05# 
817 
R9 
R4 
A23 
53 
021 
82 
FLU5H# 
C15 
R10 
56 
A24 
06 
022 
A2 
HLOA 
P15 
R11 
58 
A25 
R2 
023 
A4 
HOLD 
E15 
R14 
59 
A26 
52 
024 
A6 
INTR 
A16 
510 
A27 
51 
025 
86 
KEN# 
F15 
511 
A28 
R1 
026 
C7 
LOCK # 
N15 
512 
A29 
P2 
027 
C6 
M/IO# 
N16 
514 
A30 
P3 
028 
C8 
NMI 
815 
A31 
01 
029 
A8 
PCO 
J17 
030 
C9 
PCHK# 
017 
031 
88 
PWT 
L15 
PLOCK # 
016 
ROY# 
F16 
RE5ET 
C16 
W/R# 
N17 
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intel· 


Address 
Data 
Control 
N/C 
Vcc 
Vss 


A2 
146 
Do 
17 
A20M# 
104 
15 
6 
1 
A3 
150 
01 
18 
AOS# 
145 
34 
19 
11 
A4 
152 
02 
20 
AHOLD 
129 
52 
24 
21 
A5 
154 
03 
23 
BEO# 
117 
56 
28 
22 
A6 
158 
04 
25 
BE1# 
116 
60 
36 
33 
A7 
159 
05 
26 
BE2# 
115 
64 
49 
40 
A8 
161 
06 
27 
BE3# 
113 
68 
54 
50 
A9 
163 
07 
29 
BLAST# 
144 
72 
62 
58 
A10 
165 
08 
31 
BOFF# 
137 
73 
70 
66 
A11 
172 
09 
32 
BROY# 
138 
75 
84 
86 
A12 
174 
010 
35 
BREQ 
118 
76 
93 
95 
A13 
176 
011 
37 
BS8# 
135 
77 
98 
96 
A14 
178 
012 
38 
BS16# 
136 
78 
107 
99 
A15 
180 
013 
39 
CLK2 
123 
79 
112 
109 
A16 
181 
014 
41 
CLKSEL 
127 
81 
119 
114 
A17 
183 
015 
42 
O/C# 
110 
82 
125 
121 
A18 
189 
016 
44 
OPO 
16 
83 
131 
126 
A19 
191 
017 
45 
DP1 
30 
85 
147 
141 
A20 
193 
018 
46 
OP2 
43 
87 
164 
148 
A21 
2 
019 
47 
OP3 
57 
88 
170 
167 
A22 
3 
020 
48 
EAOS# 
105 
89 
175 
168 
A23 
4 
021 
51 
FLUSH# 
102 
90 
179 
177 
A24 
5 
022 
53 
HLOA 
122 
91 
184 
182 
A25 
7 
023 
55 
HOLD 
130 
92 
196 
194 
A26 
8 
024 
59 
INTR 
101 
94 
A27 
9 
025 
61 
KEN# 
132 
97 
A28 
10 
026 
63 
LOCK# 
142 
124 
A29 
12 
027 
65 
M/IO# 
111 
134 
A30 
13 
028 
67 
NMI 
100 
140 
A31 
14 
029 
69 
PCO 
106 
149 
030 
71 
PCHK# 
139 
151 
031 
74 
PWT 
108 
153 
PLOCK# 
143 
155 
ROY# 
133 
157 
RESET 
103 
160 
TCK 
128 
162 
TOI 
185 
166 
TOO 
80 
166 
TMS 
187 
169 
UP# 
156 
171 
W/R# 
120 
173 
186 
188 
190 
192 
195 


Symbol 
Type 
Name and Function 


CLK2 
I 
CLK2 provides 
the fundamental 
timing for the Low Power Intel486 
microprocessor. 


This is twice the internal frequency 
of the CPU. 


CLKSEL 
I 
Clock Select pin selects the 2X mode required for the Low Power Intel486 CPU. A well 
defined 
pulse on this pin establishes 
the phase relationship 
of the 2X clock. With the 


exception 
of a pulse during cold reset, this pin should be driven low all the time and 


must be free of spikes or glitches. 


ADDRESS 
BUS 


A31-M 
1/0- 
A31-A2 
are the address 
lines of the microprocessor. 
A31-A2, 
together 
with the byte 
A3-A2 
0 
enables 
BEO#-BE3#, 
define the physical area of memory or input/output 
space 


accessed. 
Address 
lines A31-A4 
are used to drive addresses 
into the microprocessor 
to perform 
cache line invalidations. 
Input signals must meet setup and hold times t22 


and t23. A31-A2 
are not driven during bus or address 
hold. 


BEO-3# 
0 
The byte enable signals indicate 
active bytes during read and write cycles. During the 
first cycle of a cache fill, the external 
system should assume that all byte enables 
are 


active. BE3# 
applies to 024-031, 
BE2# 
applies to 016-023, 
BE1 # applies to 08- 


015 and BEO# applies to 00-07. 
BEO#-BE3# 
are active LOW and are not driven 


during bus hold. 


DATA BUS 


031-00 
I/O 
These are the data lines for the Low Power Intel486 
micrprocessor. 
Lines 00-07 


define the least significant 
byte of the data bus while lines 024-031 
define the most 


significant 
byte of the data bus. These signals must meet setup and hold times t22 and 


t23 for proper operation 
on reads. These pins are driven during the second and 


subsequent 
clocks of write cycles. 


DATA 
PARITY 


OPO-OP3 
I/O 
There is one data parity pin for each byte of the data bus. Data parity is generated 
on 


all write data cycles with the same timing as the data driven by the Low Power Intel486 
microprocessor. 
Even parity information 
must be driven back into the microprocessor 


on the data parity pins with the same timing as read information 
to insure that the 


correct 
parity check status is indicated 
by the Low Power Intel486 
microprocessor. 
The 


signals read on these pins do not affect program 
execution. 
Input signals must meet 


setup and hold times t22 and t23. OPO-OP3 
should be connected 
to VCC through 
a 


pullup resistor in systems which do not use parity. OPO-OP3 
are active HIGH and are 


driven during the second and subsequent 
clocks of write cycles. 


PCHK# 
0 
Parity Status is driven on the PCHK # pin the clock after ready for read operations. 
The parity status is for data sampled 
at the end of the previous clock. A parity error is 


indicated 
by PCHK # being LOW. Parity status is only checked 
for enabled 
bytes as 
indicated 
by the byte enable and bus size signals. PCHK# 
is valid only in the clock 


immediately 
after read data is returned 
to the microprocessor. 
At all other times 


PCHK # is inactive (HIGH). PCHK # is never floated. 


,. 
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Symbol 
Type 
Name and Function 


BUS CYCLE DEFINITION 


M/lO# 
0 
The memory/input-output, 
data/control 
and write/read 
lines are the primary 
D/C# 
0 
bus definitions 
signals. These signals are d~ven valid as the ADS # signal is asserted. 
W/R# 
0 


MIIO# 
D/C# 
W/R# 
Bus Cycle Initiated 


0 
0 
0 
Interrupt 
Acknowledge 
0 
0 
1 
Halt/Special 
Cycle 
0 
1 
0 
I/O Read 
0 
1 
1 
I/O Write 
1 
0 
0 
Code Read 
1 
0 
1 
Reserved 
1 
1 
0 
Memory 
Read 
1 
1 
1 
Memory Write 


The bus definition 
signals are not driven during bus hold and follow the timing of the 
address bus. 


LOCK # 
0 
The bus lock pin indicates 
that the current bus cycle is locked. The Low Power Intel486 
microprocessor 
will not allow a biJs hold when LOCK # is asserted 
(but address 
holds 
are allowed). 
LOCK# 
goes active in the first clock of the first locked bus cycle and goes 
inactive after the last clock of the last locked bus cycle. The last locked cycle ends 
when ready is returned. 
LOCK # is active LOW and is not driven during bus hold. Locked 
read cycles will not be transformed 
into cache fill cycles if KEN # is returned 
active. 


PLOCK# 
0 
The pseudo-lock 
# pin indicates 
that the current bus transaction 
requires more than 
one bus cycle to complete. 
Examples 
of such operations 
are segment 
table reads (64 
bits), cache line fills (128 bits). The Low Power Intel486 
microprocessor 
will drive 
PLOCK # active until the addresses 
for the last bus cycle of the transaction 
have been 
driven regardless 
of whether 
ROY # or BRDY # have been returned. 
Normally 
PLOCK # and BLAST # are inverse of each other. PLOCK # is a function 
of 
the BS8 #. BS 16 # and KEN # inputs. PLOCK # should be sampled 
only if the clock 
ready is returned. 
PLOCK # is active LOW and is not driven during bus hold. 


BUS CONTROL 


ADS# 
0 
The address 
status output indicates 
that a valid bus cycle definition 
and address are 
available 
on the cycle definition 
lines and address 
bus. ADS# 
is driven active in the 
same clock as the addresses 
are driven. ADS # is active LOW and is not driven during 
bus hold. 


RDY# 
I 
The non-burst 
Ready input indicates 
that the current bus cycle is complete. 
ROY # 
indicates 
that the external 
system has presented 
valid data in response 
to a read or that 
the external 
system has accepted 
data in response 
to a write. ROY # is ignored when 
the bus is idle and at the end of the first clock in a bus cycle. 
ROY # is active during address 
hold. Data can be returned 
to the processor 
while 
AHOLD 
is active. ROY # is active LOW and is provided 
with a small pullup resistor. 
ROY # must satisfy the setup and hold times t16 and t17 for proper chip operation. 


Symbol 
Type 
Name and Function 


, 


BURST CONTROL 


BROY# 
I 
The burst 
ready 
input performs 
the same function 
during a burst cycle that ROY # 
performs 
during a non-burst 
cycle. BROY # indicates 
that the external 
system has 
presented 
valid data in response 
to a read for that the external 
system has accepted 
data in response 
to a write. BROY # is ignored when the bus is idle and at the end of the 
first clock in a bus cycle. BROY # is sampled 
in the second 
and subsequent 
clocks of a 
burst cycle. The data presented 
on the data bus will be strobed 
into the microprocessor 
when BRDY # is sampled 
active. If ROY # is returned 
simultaneously 
with BRDY #, 
BRDY # is ignored and the burst cycle is prematurely 
aborted. 
BRDY # is active LOW 
and is provided with a small pullup resistor. 
BRDY # must satisfy the setup and hold 
times t16 and tH· 


BLAST # 
0 
The burst 
last signal indicates 
that the next time BRDY # is returned 
the burst bus cycle 
is complete. 
BLAST # is active for both burst and non-burst 
bus cycles. BLAST # is 
active LOW and is not driven during bus hold. 


INTERRUPTS 


RESET 
I 
The reset 
input forces the Low Power Intel486 
microprocessor 
to begin execution 
at a 
known state. The microprocessor 
cannot 
begin execution 
of instructions 
until at least 
1 ms after Vcc and CLK2 have reached 
their proper D.C. and A.C. specifications. 
The RESET pin should remain active during this time to insure proper microprocessor 
operation. 
However, 
for warm boot-ups 
RESET should remain active for at least 30 
CLK2 periods. RESET is active HIGH. RESET is asynchronous 
but must meet setup and 
hold times t20 and t21 for recognition 
in any specific 
clock. 


INTR 
I 
The maskable 
interrupt 
indicates 
that an external 
interrupt 
has been generated. 
If the 
internal interrupt 
flag is set in EFLAGS, active interrupt 
processing 
will be initiated. 
The 
Low Power Intel486 
microprocessor 
will generate 
two locked interrupt 
acknowledge 
bus 
cycles in response 
to the INTR pin going active. INTR must remain active until the 
interrupt 
acknowledges 
have been performed 
to assure that the interrupt 
is recognized. 
INTR is active HIGH and is not provided with an internal pulldown 
resistor. 
INTR is 
asynchronous, 
but must meet setup and hold times t20 and t21 for recognition 
in any 
specific 
clock. 


NMI 
I 
The non-maskable 
interrupt 
request signal indicates 
that an external 
non-maskable 
interrupt 
has been generated. 
NMI is rising edge sensitive. 
NMI must be held LOW for at 
least eight CLK2 periods before this rising edge. NMI is not provided with an internal 
pulldown 
resistor. 
NMI is asynchronous, 
but must meet setup and hold times t20 and t21 
for recognition 
in any specific 
clock. 


BUS ARBITRATION 


BREa 
0 
The internal 
cycle 
pending 
signal indicates 
that the Low Power Intel486 
microprocessor 
has internally 
generated 
a bus request. 
BREa 
is generated 
whether 
or 
not the Low Power Intel486 microprocessor 
is driving the bus. BREa 
is active high and 
is never floated. 


HOLD 
I 
The bus hold request 
allows another 
bus master complete 
control 
of the Low Power 
Intel486 
microprocessor 
bus. In response 
to HOLD going active the Low Power Intel486 
microprocessor 
will float most of its output and input! output pins HLDA will be asserted 
after completing 
the current bus cycle, burst cycle or sequence 
of locked cycles. The 
Low Power Intel486 
microprocessor 
will remain in this state until HOLD is deasserted. 
HOLD is active high and is not provided with an internal pulldown 
resistor. 
HOLD must 
satisfy setup and hold times t18 and t19 for proper operation. 


• 


Symbol 
Type 
Name and Function 


BUS ARBITRATION 
(Continued) 


HLDA 
0 
Hold acknowledge 
goes active in response 
to a hold request presented 
on the 
HOLD pin. HDLA indicates 
that the Low Power Intel486 
microprocessor 
has given 
the bus to another 
local bus master. HLDA is driven active in the same clock that the 
Low Power Intel486 microprocessor 
floats its bus. HLDA is driven inactive when 
leaving bus hold. HLDA is active HIGH and remains driven during bus hold. 


BOFF# 
I 
The backoff 
input forces the Low Power Intel486 microprocessor 
to float its bus in 
the next clock. The microprocessor 
will float all pins normally floated 
during hold but 
HLDA will not be asserted 
in response 
to BOFF #. BOFF # has higher priority than 
RDY # or BRDY #; if both are returned 
in the same clock, BOFF # takes effect. The 
microprocessor 
remains in bus hold until BOFF # is negated. 
If a bus cycle was in 


progress 
when BOFF # was asserted 
the cycle will be restarted. 
BOFF # is active 
LOW and must meet setup and hold times t18 and t19 for proper operation. 


CACH INVALIDATION 


AHOLD 
I 
The address 
hold request allows another 
bus master access to the Low Power 
Intel486 
microprocessor's 
address 
bus for a cache invalidation 
cycle. The Low 
Power Intel486 microprocessor 
will stop driving its address 
bus in the clock following 


AHOLD going active. Only the address bus will be floated 
during address 
hold, the 
remainder 
of the bus will remain active. AHOLD 
is active HIGH and is provided with a 
small internal 
pulldown 
resistor. 
For proper operation 
AHOLD 
must meet setup and 
hold times t18 and t19. 


EADS# 
I 
This signal indicates 
that a valid external 
address 
has been driven onto the Low 
Power Intel486 
microprocessor 
address 
pins. This address will be used to perform 
an internal cache invalidation 
cycle. EADS# 
is active LOW and is provided 
with an 
internal pullup resistor. 
EADS# 
must satisfy setup and hold times t12 and t13 for 
proper operation. 


CACHE CONTROL 


KEN# 
I 
The cache enable pin is used to determine 
whether 
the current cycle is cacheable. 


When the Low Power Intel486 
microprocessor 
generates 
a cycle that can be cached 
and KEN # is active, the cycle will become 
a cache line fill cycle. Returning 
KEN # 


active one clock before ready during the last read in the cache line fill will cause the 
line to be placed in the on-chip cache. KEN # is active LOW and is provided 
with a 
small internal pull-up resistor. 
KEN# 
must satisfy setup and hold times t14 and t15 for 


proper operation. 


FLUSH # 
I 
The cache flush input forces the Low Power Intel486 microprocessor 
to flush its 
entire internal cache. FLUSH # is active low and need only be asserted 
for one 
clock. FLUSH# 
is asynchronous 
but setup and hold times t20 and t21 must be met 


for recognition 
in any specific 
clock. FLUSH # being sampled 
low in the clock before 
the falling edge of RESET causes the Low Power Intel486 
microprocessor 
to enter 


the tri-state test mode. 


PAGE CACHEABlllTY 


PWT 
0 
The page write-through 
and page cache disable 
pins reflect the state of the page 
PCD 
0 
attribute 
bits, PWT and PCD, in the page table entry, page directory 
entry or control 


register 3 (CR3), when paging is enabled. 
If paging is disabled, 
the CPU ignores the 
PCD and PWT bits and assumes they are zero for the purpose of caching 
and driving 
PCD and PWT pins. PWT and PCD have the same timing as the cycle definition 
pins 


(M/IO#, 
D/C#, 
and W/R#). 
PWT and PCD are active HIGH and are not driven 
during bus hold. PCD is masked by the cache disable bit (CD) in Control 
Register O. 


Symbol 
Type 
Name and Function 


NUMERIC 
ER~OR 
REPORTING 
(Present on Intel486 
OX CPU Only) 


FERR# 
0 
The floating 
point 
error 
pin is driven active when a floating 
point error occurs. 
FERR # 
is similar to the ERROR # pin on the i387 Math CoProcessor. 
FERR # is included 
for 
compatibility 
with systems 
using DOS type floating 
point error reporting. 
FERR # is 
active LOW, and is not floated 
during bus hold. 


IGNNE# 
I 
When the Ignore 
numeric 
error 
pin is asserted, 
the Low Power Intel486 
microprocessor 
will ignore a numeric error and continue 
executing 
non-control 
floating 
point instructions. 
When IGNNE# 
is deasserted 
the Low Power Intel486 
microprocessor 
will freeze on a non-control 
floating 
point instruction, 
if a previous floating 
point 


instruction 
caused an error. IGNNE # has no effect when the NE bit in control 
register 0 
is set. IGNNE # is active LOW and is provided with a small internal pullup resistor. 
IGNNE # is asynchronous 
but setup and hold times t20 and t21 must be met to insure 
recognition 
on any specific 
clock. 


BUS SIZE CONTROL 


BS16# 
I 
The bus size 16 and bus size 8 pins (bus sizing pins) cause the Low Power Intel486 
BS8# 
I 
microprocessor 
to run multiple 
bus cycles to complete 
a request from devices that 
cannot 
provide or accept 32 bits of data in a single cycle. The bus sizing pins are 
sampled 
every clock. The state of these pins in the clock before ready is used by the 
Low Power Intel486 
microprocessor 
to determine 
the bus size. These signals are active 
LOW and are provided 
with internal pullup resistors. 
These inputs must satisfy setup and 
hold times t14 and t15 for proper operation. 


ADDRESS 
MASK 


A20M# 
I 
When the address 
bit 20 mask pin is asserted, 
the Low Power Intel486 
microprocessor 
masks physical address 
bit 20 (A20) before performing 
a lookup to the internal cache or 


,driving a memory cycle on the bus. A20M# 
emulates 
the address wraparound 
at one 
Mbyte which occurs on the 8086. A20M # is active LOW and should be asserted 
only 
when the processor 
is in real mode. This pin is asynchronous 
but should meet setup and 
hold times t20 and t21 for recognition 
in any specific 
clock. For proper operation, 
A20M# 
should be sampled 
high at the falling edge of RESET. 


PERFORMANCE 
UPGRADE 
SUPPORT 
(Inte1486 SX CPU PQFP Version 
Only) 


UP# 
I 
Upgrade Present 
forces the Intel486 SX CPU to tri-state 
all its outputs 
and enter the 
power down mode. UP# 
is active low and is sampled 
at all times, including 
after power- 
up and during reset. 


TEST ACCESS 
PORT (Inte1486 SX CPU PQFP Version 
Only) 


TCK 
I 
Test Clock 
is an input to the Intel486TM CPU and provides 
the clocking 
function 
required 
by the JT AG Boundary 
scan feature. TCK is used to clock state information 
and data into 
component 
on the rising edge of TCK on TMS and TOI, respectively. 
Data is clocked 
out 
of the part on the falling edge of TCK and TOO. 


TOI 
I 
Test Data Input 
is the serial input used to shift JTAG instructions 
and data into 
component. 
TOI is sampled 
on the rising edge of TCK, during the SHIFT-IR 
and SHIFT- 
DR TAP controller 
states. During all other tap controller 
states, TOI is a "don't 
care". 


TOO 
0 
Test Data Output 
is the serial output used to shift JTAG instructions 
and data out of the 
component. 
TOO is driven on the falling edge of TCK during the SHIFT-IR 
and SHIFT- 
DR TAP controller 
states. At all other times TOO is driven to the high impedance 
state. 


TMS 
I 
Test Mode Select 
is decoded 
by the JT AG TAP (Tap Access 
Port) to select the 
operation 
of the test logic. TMS is sampled 
on the rising edge of TCK. To guarantee 
deterministic 
behavior 
of the TAP controller 
TMS is provided 
with an internal 
pull-up 
resistor. 
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Table 1-1. lists all the output pins, indicating their 
active level, and when they are floated. 


Name 
Active 
When Floated 
Level 


BREQ 
HIGH 
HLDA 
HIGH 
BEO#-BE3# 
LOW 
Bus Hold 
PWT, PCD 
HIGH 
Bus Hold 
W/R#, 
D/C#, 
HIGH/LOW 
Bus Hold 
M/IO# 
LOCK# 
LOW 
Bus Hold 
PLOCK# 
LOW 
Bus Hold 
ADS# 
LOW 
Bus Hold 
BLAST# 
LOW 
Bus Hold 
PCHK# 
LOW 
FERR#* 
LOW 
A2-A3 
HIGH 
Bus, 
Address Hold 


Table 1-2 lists all input pins, indicating their active 
level, and whether they are synchronous or asyn- 
chronous inputs. 


Name 
Active 
Synchronous! 
Level 
Asynchronous 


CLK2 
CLKSEL 
RESET 
HIGH 
Asynchronous 
HOLD 
HIGH 
Synchronous 
AHOLD 
HIGH 
Synchronous 
EADS# 
LOW 
Synchronous 
BOFF# 
LOW 
Synchronous 
FLUSH# 
LOW 
Asynchronous 
A20M# 
LOW 
Asynchronous 
BS16#, BS8# 
LOW 
Synchronous 
KEN# 
LOW 
Synchronous 
RDY# 
LOW 
Synchronous 
BRDY# 
LOW 
Synchronous 
INTR 
HIGH 
Asynchronous 
NMI 
HIGH 
Asynchronous 
IGNNE#*(1) 
LOW 
Asynchronous 
UP#(2) 
LOW 
Asynchronous 


NOTES: 
1. The IGNNE# 
pin is present 
on the Inte1486 OX CPU and 
Intel487 
SX MCP only. 


2. The 
UP# 
pin is present 
on the 
Intel486 
SX CPU PQFP 
package 
only. 
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Table 1-3 lists all the input! output pins, indicating 
their active level and when they are floated. 


Name 
Active 
When Floated 
Level 


DO-D31 
HIGH 
Bus Hold 
DPO-DP3 
HIGH 
Bus Hold 
A4-A31 
HIGH 
Bus, Address Hold 


Name 
Input or 
Sampled! 
Output 
Driven On 


TCK 
Input 
N/A 


TOI 
Input 
Rising Edge of TCK 


TOO 
Output 
Falling Edge of TCK 


TMS 
Input 
Rising Edge of TCK 


1486SX Microprocessor! 
Component 
Revision 
1487SX Math CoProcessor 
Stepping Name 
10 
10 


AO 
04 
20 


NOTE: 
Table 
1-5 shows 
the Component 
10 number 
and 
Revision 
10 number 
for the A-O stepping 
of the 
Intel486 
SX Micro- 


procesor 
and Intel487 
SX Math CoProcessor. 
When 
an In- 
tel487 
SX Math CoProcessor 
is installed 
in the system, 
the 
Component 
10 and Revision 
10 is provided 
by the Intel487 
SX Math 
CoProcessor 
and not the 
Intel486 
SX Microproc- 
essor. 
The 
Component 
10 and 
Revision 
10 read 
by the 
BIOS/software 
may change 
when 
a Performance 
Upgrade 
Component, 
such as the Intel487 
SX Math CoProcessor, 
is 
installed 
in an Intel486 
SX Microprocessor 
based 
system. 


1.6 Signal Description 


With the exception of CLK2 and CLKSEL, all signals 
follow the same definition as the Intel486 microproc- 
essor. The A.C. timing parameters for all of these 
signals are given in Table 2-7. 


CLK2 provides the fundamental timing for the Low 
Power Intel486 microprocessor. It is divided by two 
internally to generate the internal processor clock 
used for instruction execution. The internal clock is 
comprised of two phases, "phase one" and "phase 
two". Each CLK2 period is a phase of the internal 
clock. Figure 1-5 illustrates the relationship. If de- 


INTERNAL 
InteI486Tlo·CPU 
ClK 
(half the freq. 
of ClK2) 


sired, the phase of the internal processor clock can 
be synchronized to a known phase by ensuring the 
pulse on the CLKSEL pin meets the applicable tim- 
ings during cold boot (power-up reset). 


All set-up, hold, float-delay and valid delay timings 
are referenced to the phase one of the clock. 


The internal processor clock (CLK) is similar to the 
clock signal of the standard Intel486 microproces- 
sor. All I/O signals get sampled on the rising edge of 
this signal, Le. the rising edge of phase one. Thus it 
is important to synchronize the external circuitry with 
the phase one of CLK2. 


Clock Select pin selects the 2X mode required for 
the Low Power Intel486 CPU. This pin should be 
driven low after power-up and during the entire oper- 
ation of the CPU. However, a well defined pulse is 
required on CLKSEL pin during cold boot (power-up 
reset) to establish the phase relationship of the 2X 
clock. The reset pulse width during cold reset should 
be at least 1 ms. As shown in Figure 1-6, the pulse 
on CLKSEL should be asserted by the end of reset 
(approximately 0.9 ms after driving reset active) and 
at least 30 CLK2 periods before the falling edge of 
reset. 


Figure 1-7 shows the detailed timing definition of this 
pulse. The pulse on CLKSEL pin is only required dur- 
ing power-up reset. During all other times including 
warm resets the CLKSEL pin should be driven low 
and must be free of spikes or glitches. After the 
power-up reset, the system must track the phase of 
CLK2 at all times including during warm resets so 
that the input/output signals can be sampled at the 
appropriate clock edge. The phase relationship is 
described in the next section. 
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The Low Power Intel486 microprocessor is architec- 
turally similar to the Intel486 CPU. Thus all bus cy- 
cles follow the same definition (for details see the 
Intel486 data sheet). The difference lies in the fact 
that the Low Power Intel486 CPU works with an ex- 
ternal 2X clock input (CLK2). As shown in Figure 1-5, 
each of the internal processor clock (CLK) cycle is 
two CLK2 cycles wide. Thus a 25 MHz Low Power 
Intel486 microprocessor needs a 50 MHz clock in- 
put. 


CLK2 provides the fundamental timing for the Low 
Power Intel486 CPU. It is divided by two internally to 
generate the internal processor clock (CLK) used for 
instruction execution. The internal clock is com- 
prised of two phases, "phase one" 
and "phase 
two". Each CLK2 period is a phase of the internal 
clock. All Low Power Intel486 microprocessor inputs 
are sampled at the rising edge of phase 1. Each bus 
cycle is comprised of at least two bus states, T1 and 
T2. Each bus state in turn consists of two CLK2 cy- 
cles phase 1 and phase 2 of the bus state. The bus 
state diagram in Section 7.2.13 of the Intel486 CPU 
data sheet is valid for the Low Power Intel486 micro- 
processor. 


NOTE: 
The timing diagrams given in the Intel486 data 
sheet can be used for the Low Power Intel486 mi- 
croprocessor. Read "CLK" 
signal as the internal 
clock of the CPU, with "CLK2" (the input clock of 
the Low Power Intel486 CPU) being twice the fre- 
quency of the internal processor clock as shown in 
Figure 1-5. 


The following describes how the input signals are 
sampled and output signals are referenced with re- 
spect to the input clock (CLK2): 


INPUT SIGNALS: 


The Low Power Intel486 CPU samples all its syn- 
chronous 
input 
signals 
(Le. 
ROY#, 
8RDY #, 


eoo 


500 
~ 
400 
.5 


"- 
300 
::> 
Jl 
200 


100 


0 
0 


8S8#, 
8S16#, 
KEN#, 
EADS#, 
80FF#, 
HOLD 
and AHOLD) at the rising edge of phase 1, as long 
as proper setup and hold times relative to that clock 
edge are met. 


The Low Power Intel486 CPU samples all its asyn- 
chronous 
input signals (Le. RESET, INTR, NMI, 
A20M# 
FLUSH#, 
IGNNE#) at every other rising 
edge of the system clock (Phase 1), as long as prop- 
er setup and hold times relative to that clock edge 
are met. 


The A.C. timing specifications for output signals (Le. 
valid and float delay timings) are specified with re- 
spect to the rising edge of the Phase 1 of the system 
clock. This holds true for all output signals including 
ADS# and PCHK#. 


1.8 Variable CPU Frequency 


The Low Power Intel486 microprocessor allows the 
CPU frequency to change dynamically. As shown in 
Figures 1-8 and 1-9, the relationship between fre- 
quency and power consumption is approximately lin- 
ear. Thus lowering the CPU frequency, reduces the 
power supply current (Ice>consumed by the CPU. 


The following must be satisified to change the CPU 
frequency: 
1. Frequency can be changed at least 8 clocks after 
satisfying t4 (see Figure 1-7). The system can be 
started at a lower frequency and after satisfying 
the CLKSEL pulse specifications, it can be oper- 
ated at the required speed. 
2. The change in frequency should satisfy the mini- 


mum specification of 
"CLK2 
high time" 
and 
"CLK2 low time". That is, at no time should the 
clock period go below the specified clock high 
and clock low times (see A.C. specifications for 
exact values). 


700 


600 


<- 
500 


,5 
400 
Q.::- 
300 
u 
_u 


200 


100 


0 
0 
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Table 
2-1 provides 
the absolute 
maximum 
ratings. 
It is a stress 
rating 
only and functional 
operation 
at the 
maximums 
is not guaranteed. 
Functional 
operating 
conditions 
are given in Section 
2.1 D.C. Specifications 
and 
2.3 A.C. Specifications. 


Case Temperature 
under Bias 
-65·C 
to + 110·C 


Storage Temperature 
- 65·C to + 150·C 


Voltage 
on Any Pin with Respect 
to Ground 
-0.5V 
to (VCC + 0.5V) 


Supply Voltage with Respect 
to Vss 
-0.5Vto 
+6.5V 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input low 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
VCC + 0.3 
V 


VOL 
Output low 
Voltage 
0.45 
V 
(Note 1) 


VOH 
Output High Voltage 
2.4 
V 
(Note 2) 


Icc 
Power Supply Current 
700 
mA 
(Note 3) 
ClK2 
= 50 MHz 


III 
Input leakage 
Current 
±15 
/LA 
(Note 4) 


IIH 
Input leakage 
Current 
200 
/LA 
(Note 5) 


IlL 
Input leakage 
Current 
-400 
/LA 
(Note 6) 


ILO 
Output 
leakage 
Current 
±15 
/LA 


CIN 
Input Capacitance 
20 
pF 
Fe = 1 MHz(7) 


Co 
I/O or Output Capacitance 
20 
pF 
Fe = 1 MHz(7) 


CCLK 
ClK 
Capacitance 
20 
pF 
Fe = 1 MHz(7) 


NOTES: 
1. This parameter is measured at: 


Address, Data BEn 4.0 mA 
Definition, Control 5.0 mA 
2. This parameter is measured at: 
Address, Data BEn -1.0 
mA 
Definition, Control -0.9 
mA 
3. Typical supply current 
Ice = 550 mA @CLK2 = 50 MHz 
4. This parameter is for inputs without pullups or pulldowns and 0 :S: VIN :S: Vee. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and VIL = 0.45V. 
7. Not 100% tested. 
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Table 2-3 provides 
the D.C. operating 
conditions 
for the low 
Power Intel486 
SX microprocessor 
(PGA Version) 
and the Intel487 
SX Math CoProcessor 
installed 
in a low power system. 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


VIL 
Input low 
Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
VCC + 0.3 
V 


VOL 
Output low 
Voltage 
0.45 
V 
(Note 1) 


VOH 
Output High Voltage 
2.4 
V 
(Note 2) 


Icc 
Power Supply Current 
ClK2 
= 32 MHz 
525 
mA 
(Note 3) 
= 40 MHz 
600 
= 50 MHz 
700 


ICCF 
Power Supply Current with Intel486 SX 
CPU Tri-stated 
(floating) 
ClK2 
= 32 MHz 
400 
mA 
= 40 MHz 
500 
= 50 MHz 
600 


III 
Input leakage 
Current 
±15 
/LA 
(Note 4) 


IIH 
Input leakage 
Current 
200 
/LA 
(Note 5) 


IlL 
Input leakage 
Current 
-400 
/LA 
(Note 6) 


ILO 
Output leakage 
Current 
±15 
/LA 


CIN 
Input Capacitance 
20 
pF 
Fe = 1 MHz(7} 


Co 
I/O or Output Capacitance 
20 
pF 
Fe = 1 MHz(7} 


CCLK 
ClK 
Capacitance 
20 
pF 
Fe = 1 MHz(7} 


NOTES: 
1. This parameter is measured at: 


Address, Data BEn 4.0 mA 
Definition, Control 5.0 mA 
2. This parameter is measured at: 


Address, Data BEn -1.0 
mA 
Definition, Control -0.9 
mA 
3. Typical supply current 
Icc 
= 400 @CLK2 = 32 MHz 
(Normal Operation) 
= 475 mA @CLK2 = 40 MHz 
= 500 mA @CLK2 = 50 MHz 
ICCF = 325 mA @CLK2 = 32 MHz 
Intel486 CPU Tri-stated (Floating) 
= 400 mA @CLK2 = 40 MHz 
= 470 mA @CLK2 = 50 MHz 
4. This parameter is for inputs without pullups or pulldowns and 0 ,;; VIN ,;; VCC. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and Vil = 0.45V. 
7. Not 100% tested. 


Table 2-4 provides 
the D.C. Operating 
Conditions 
for 
the Low Power 
Intel486 
SX microprocessor 
(PQFP 
version) 
and the Intel487 
SX Math CoProcessor 
in- 
stalled 
in a low power system. 
Vee = 5V - 
10%, +5%; 
TeAsE = O'C to + 85'C 


Table 2·4. Low Power Intel486™ SX Microprocessor D.C.Parametric Values (PQFPversion) 


Symbol 
Parameter 
Mln 
Max 
nit 
Notes 


VIL 
Input Low Voltage 


VIH 
Input High Voltage 


VOL 
Output Low Voltage 


VOH 
Output High Voltage 


Icc 
Power Supply Current 
CLK2 = 32 MHz 
CLK2 = 40 MHz 
CLK2 = 50 MHz 


ICCF 
Power Supply Current 
CPU in Power Dowl),!-'1 
e 


III 
Input Leakage Current 


IIH 
Input Leakage Current 


IlL 
Input Leakage Cur 


ILO 
Output Leakag 


CIN 
Input Capacitance 


Co 
I/O or Output Capacitance 


CCLK 
CLK Capacitance 


(Note 1) 


(Note 2) 


(Note 3) 
mA 


mA 
(Note 7) • 
±15 
IJoA 
(Note 4) 


200 
IJoA 
(Note 5) 


-400 
IJoA 
(Note 6) 


±15 
IJoA 


10 
pF 
FC = 1 MHz (Note 7) 


10 
pF 
FC = 1 MHz (Note 7) 


6 
pF 
Fc = 1 MHz (Note 7) 


NOTES: 
1. This parameter is measured at: 
Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
2. This parameter is measured at: 


Address, Data BEn 
-1.0 
mA 
Definition, Control 
-0.9 
mA 
3. Typical supply current: 


Ice 380 mA @ CLK2 = 32 MHz (Normal Operation) 
440 mA @ CLK2 = 40 MHz 
480 mA @ CLK2 = 50 MHz 
4. This parameter is for inputs without pullups or pulldowns and 0 ~ VIN ~ Vee. 
5. This parameter is for inputs with pulldowns and VIH = 2.4V. 
6. This parameter is for inputs with pullups and VIL = 0.45V. 
7. Not 100% tested. 


2.2 Power Supply Current vs Frequency 


Following 
is the power consumption 
of the Low Power Intel486 
microprocessor 
or Intel487 
Math CoProcessor 
installed 
in a low power system 
for different 
frequencies. 


CLK2 
Frequency 
Operating 
Frequency 
ICC(max) 
ICC(typ) 
(MHz) 
(MHz) 
(mA) 
(mA) 


4 
2 
150 
100 


16 
8 
325 
235 


32 
16 
525 
400 


40 
20 
600 
475 


50 
25 
700 
550 


CLK2 
Frequency 
Operating 
Frequency 
Icc (max) 
Icc (typ) 
(MHz) 
(MHz) 
(mA) 
(mA) 


4 
2 
150 
100 


16 
8 
250 
210 


32 
16 
450 
380 


40 
20 
500 
440 


50 
25 
560 
480 


The following 
tables provide the A.C. specifications 
for the Low Power Intel486 
microprocessors. 
They consist 
of output 
delays, 
input setup requirements 
and input hold requirements. 
All A.C. specifications 
are rel~tive 
to 
the rising edge of the phase 
1 of the input system 
clock 
(CLK2), 
unless otherwise 
specified. 


Table 
2-7. Low 
Power 
Intel486 
DX-25 
MHz Microprocessor 
A.C. Characteristics 
Vcc = 5V ±10%; 
Tease = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
25 
MHz 
Half of CLK2 Frequency 


t1 
CLK2 Period 
) 
20 
250 
ns 
2.1 


t2 
CLK2 High Time 
7 
ns 
2.1 
At2V 


t3 
CLK2 Low Time 
7 
ns 
2.1 
At 0.8V 


t4 
CLK2 Fall Time 
, 
2 
ns 
2.1 
2Vto 
0.8V 


t5 
CLK2 Rise Time 
2 
ns 
2.1 
0.8Vto 
2V 


ts 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
22 
ns 
2.2 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
FERR#, 
BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
30 
ns 
2.2 
After Clock Edge(1) 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK # Float Delay 


t8 
PCHK # Valid Delay 
3 
27 
ns 
2.2 


t8a 
BLAST #, PLOCK # Valid Delay 
3 
27 
ns 
2.3 
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Table 2·7. Low 
Power 
Intel486 
DX-25 MHz Microprocessor 
A.C. Characteristics 
(Continued) 
Vcc = 5V ± 10%; Tease = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t9 
8 LAST #, PLOCK # 
30 
ns 
2.2 
After Clock Edge(1) 
Float Delay 


t10 
00-031, 
OPO-3 Write Data 
3 
22 
ns 
2.2 
Valid Delay 


tl1 
00-031, 
OPO-3 Write Data 
30 
ns 
2.2 
After Clock Edge(1) 
Float Delay 


t12 
EAOS# 
Setup Time 
9 
ns 
2.3 


t13 
EAOS # Hold Time 
4 
ns 
2.3 


t14 
KEN#, 
8S16#, 
8S8# 
9 
ns 
2.3 
Setup Time 


t15 
KEN#, 
8S16#, 
8S8# 
4 
ns 
2.3 
Hold Time 
I' 


t16 
ROY #, 8ROY # Setup Time 
9 
ns 
2.3 


t17 
ROY #, 8ROY # Hold Time 
4 
ns 
2.3 


t18 
HOLD, AHOLO, 80FF 
# 
11 
ns 
2.3 
Setup Time 


t19 
HOLD, AHOLO, 80FF 
# 
4 
ns 
2.3 
Hold Time 


t20 
RESET, FLUSH #, A20M #, NMI, 
11 
ns 
2.3 
INTR, IGNNE # Setup Time 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, 
4 
ns 
2.3 
INTR,IGNNE# 
Hold Time 


t22 
00-031, 
OPO-3, A4-A31 
6 
ns 
2.3 
Read Setup Time 


t23 
00-031, 
OPO-3, A4-A31 
4 
ns 
2.3 
Read Hold Time 


CLKSEL 
See Figures 1-6 and 1-7 for details on this signal. Figure 
1-7 shows minimum timings required for the proper 
operation 
of the CPU. The pulse on CLKSEL can be of 
any length as long as the minimums 
are satisfied 
and 
the transitions 
from low to high occurs at the clock edge 
shown. 


NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. 
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Table 2·8. Low Power 
Intel486™ 
SX-16 
MHz 
Microprocessorllntel487™ 
SX Math CoProcessor 
A.C. Characteristics 
Vcc = 5V ± 10%; Tease = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
16 
MHz 
Half of CLK2 Frequency 


t1 
CLK2 Period 
31 
250 
ns 
2.1 


t2 
CLK2 High Time 
10 
ns 
2.1 
At2V 


t3 
CLK2 Low Time 
10 
ns 
2.1 
AtO.8V 


t4 
CLK2 Fall Time 
4 
ns 
2.1 
2V toO.8V 


t5 
CLK2 Rise Time 
4 
ns 
2.1 
0.8Vt02V 


te 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
26 
ns 
2.2 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
FERR#*, 
BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
42 
ns 
2.2 
After Clock Edge(1) 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK # Float Delay 
, 


te 
PCHK# 
Valid Delay 
3 
35 
ns 
2.2 


tea 
BLAST #, PLOCK # Valid Delay 
3 
35 
ns 
2.2 


t9 
BLAST #, PLOCK # Float Delay 
42 
ns 
2.2 
After Clock Edge(1) 


t10 
DO-D31, 
DPO-3 Write Data Valid Delay 
3 
30 
ns 
2.2 


t11 
DO-D31, 
DPO-3 Write Data Float Delay 
42 
ns 
2.2 
After Clock Edge(1) 


t12 
EADS# 
Setup Time 
12 
ns 
2.3 


t13 
EADS# 
Hold Time 
4 
ns 
2.3 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
12 
ns 
2.3 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
4 
ns 
2.3 


t16 
RDY #, BRDY # Setup Time 
12 
ns 
2.3 


t17 
RDY # , BRDY # Hold Time 
4 
ns 
2.3 


t1e 
HOLD, AHOLD, 
BOFF # Setup Time 
12 
ns 
2.3 


t19 
HOLD, AHOLD, 
BOFF # Hold Time 
4 
ns 
2.3 


t20 
RESET, FLUSH #, A20M #, NMI, 
14 
ns 
2.3 
INTR, IGNNE # * Setup Time 


t21 
RESET, FLUSH #, A20M #, NMI, 
4 
ns 
2.3 
INTR, IGNNE # * Hold Time 


t22 
DO-D31,DPO-3,A4-A31 
10 
ns 
2.3 
Read Setup Time 


t23 
DO-D31, 
DPO-3, 
A4-A31 
4 
ns 
2.3 
Read Hold Time 


CLKSEL 
See Figures 1·6 and 1-7 for details on this signal. 
Figure 1-7 shows minimum timings required for the 
proper operation 
of the CPU. The pulse on CLKSEL 
can be of any length as long as the minimums 
are 
satisfied 
and the transitions 
from low to high occurs at 
the clock edge shown. 


Table 2-9. Low Power 
Intel486™ 
SX-20 
MHz 
Microprocessorlintel487™ 
SX Math CoProcessor 
A.C. Characteristics 
Vcc = 5V ± 10%; Tcase = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
20 
MHz 
Half of CLK2 Frequency 


t1 
CLK2 Period 
25 
250 
ns 
2.1 


t2 
CLK2 High Time 
8.5 
ns 
2.1 
At2V 


t3 
CLK2 Low Time 
8.5 
ns 
2.1 
AtO.8V 


t4 
CLK2 Fall Time 
3 
ns 
2.1 
2V to 0.8V 


ts 
CLK2 Rise Time 
3 
ns 
2.1 
0.8Vto 
2V 


~ 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
23 
ns 
2.2 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 


FERR#·, 
BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
37 
ns 
2.2 
After Clock Edge(1) 
M/IO#, 
D/C#, 
W/R#, 
ADS#, 


LOCK# 
Float Delay 


t8 
PCHK # Valid Delay 
3 
28 
ns 
2.2 


taa 
BLAST #, PLOCK # Valid Delay 
3 
28 
ns 
2.2 


t9 
BLAST #, PLOCK # Float Delay 
37 
ns 
2.2 
After Clock Edge(1) 


tlO 
DO-D31, 
DPO-3 Write Data Valid Delay 
3 
26 
ns 
2.2 


tll 
DO-D31, 
DPO-3 Write Data Float Delay 
37 
ns 
2.2 
After Clock Edge(1) 


t12 
EADS# 
Setup Time 
10 
ns 
2.3 


t13 
EADS # Hold Time 
4 
ns 
2.3 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
10 
ns 
2.3 


t1S 
KEN#, 
BS16#, 
BS8# 
Hold Time 
4 
ns 
2.3 


t16 
RDY #, BRDY # Setup Time 
10 
ns 
2.3 


t17 
RDY #, BRDY # Hold Time 
4 
ns 
2.3 


t18 
HOLD, AHOLD, 
BOFF # Setup Time 
12 
ns 
2.3 


t19 
HOLD, AHOLD, 
BOFF # Hold Time 
4 
ns 
2.3 


t20 
RESET, FLUSH#, 
A20M#, 
NMI, 
12 
ns 
2.3 
INTR, IGNNE #. 
Setup Time 


t21 
RESET, FLUSH #, A20M #, NMI, 
4 
ns 
2.3 
INTR, IGNNE#· 
Hold Time 


t22 
DO-D31, 
DPO-3, 
A4-A31 
6 
ns 
2.3 
Read Setup Time 


t23 
DO-D31, 
DPO-3, 
A4-A31 
4 
ns 
2.3 
Read Hold Time 


NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = SOpF. 


• 


Table 2·10. Low Power 
Inte1486-25 
MHz 
Microprocessorlintel487™ 
SX Math CoProcessor 
A.C. Characteristics 
VCC = 5V ± 10%; Tcase = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
2 
25 
MHz 
Half of CLK2 Frequency 


tl 
CLK2 Period 
20 
250 
ns 
2.1 


t2 
CLK2 High Time 
7 
ns 
2.1 
At2V 


t3 
CLK2 Low Time 
7 
ns 
2.1 
At 0.8V 


t4 
CLK2 Fall Time 
2 
ns 
2.1 
2Vto 
0.8V 
I 


t5 
CLK2 Rise Time 
2 
ns 
2.1 
0.8Vto 
2V 


ts 
A2-A31, 
PWT, PCD, BEO-3#, 
3 
19 
ns 
2.2 
MIIO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
FERR#', 
BREQ, HLDA Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#, 
28 
ns 
2.2 
After Clock Edgell) 
MIIO#, 
D/C#, 
W/R#, 
ADS#, 
LOCK # Float Delay 


t8 
PCHK# 
Valid Delay 
3 
24 
ns 
2.2 


t8a 
BLAST #, PLOCK # Valid Delay 
3 
24 
ns 
2.2 


t9 
BLAST #, PLOCK # Float Delay 
28 
ns 
2.2 
After Clock Edgell) 


tl0 
00-031, 
DPO-3 Write 
3 
20 
ns 
2.2 
Data Valid Delay 


tll 
00-031, 
DPO-3 Write 
28 
ns 
2.2 
After Clock Edgell) 
Data Float Delay 


t12 
EADS # Setup Time 
9 
ns 
2.3 


t13 
EADS # Hold Time 
4 
ns 
2.3 


t14 
KEN#, 
BS16#, 
BS8# 
Setup Time 
9 
2.3 
. 


ns 


t15 
KEN#, 
BS16#, 
BS8# 
Hold Time 
4 
ns 
2.3 


t16 
ROY #, BRDY # Setup Time 
9 
ns 
2.3 


t17 
RDY#, 
BRDY# 
Hold Time 
4 
ns 
2.3 


t18 
HOLD, AHOLD, 
BOFF # Setup Time 
11 
ns 
2.3 


t19 
HOLD, AHOLD, 
BOFF # Hold Time 
4 
ns 
2.3 


t20 
RESET, FLUSH#, 
A20M#, 
NMI, 
11 
ns 
2.3 
INTR, IGNNE #' 
Setup Time 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, 
4 
ns 
2.3 
INTR, IGNNE#' 
Hold Time 


t22 
00-031, 
DPO-3, 
A4-A31 
6 
ns 
2.3 
Read Setup Time 


'Present only in the Intel487 SX Math CoProcessor 


NOTES: 
1. Not 100% tested, guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. 


Table 
2·10. Low Power 
Inte1486-25 
MHz 
Microprocessorlintel487™ 
SX Math CoProcessor 
A.C. Characteristics 
Vcc = 5V ± 10%; Tcase = O°C to + 85°C; CL = 50 pF(2) unless otherwise 
specified 
(Continued) 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


t23 
DO-D31,DPO-3,A4-A31 
4 
ns 
2.3 
Read Hold Time 


CLKSEL 
See Figures 1-6 and 1-7 for details on this signal. 
Figure 1-7 shows minimum timings required for the 
proper operation 
of the CPU. The pulse on CLKSEL 
can be of any length as long as the minimums 
are 
satisfied 
and the transitions 
from low to high occurs at 
the clock edge shown. 


NOTES: 
1. Not 100% 
tested. 
guaranteed 
by design 
characterization. 


2. All timing 
specifications 
assume 
CL = 50 pF. 


_/ 
\- 
\- 


tX=t12, 
t14, t16, t18, 
120, t22 


ty=t13, 
t15, t17, t19, t21, t23 


• 


CLK2 
__ 
\ 
/ 


OUTPUTS 


tx = t6. t8, t8a, t1 0 


ty = t7, t9, t11 


\- 


__ 
VA_LID 
_~--- 


3.0 
MATH UPGRADE 
FOR LOW 
POWER 
Intel486™ 
SX 
MICROPROCESSOR 


The Intel487 SX Math CoProcessor (MCP) is the 
math upgrade for the Low Power Intel486 SX micro- 
processor. The Intel487 SX MCP is designed and 
tested to operate with an external 1X clock input 
(standard mode) when it is installed in an Intel486 
SX CPU system or with an external 2X clock input 
(low power mode) when it is installed in a Low Power 
Intel486 SX CPU based system. The Intel487 SX 
MCP is a super-set of the Intel486 SX CPU, contain- 
ing a floating point unit, in addition to all other on- 
chip units present in the Intel486 SX microproces- 
sor. 


The Floating Point Unit (FPU) performs floating point 
operations on the 32-, 64-, and 80-bit arithmetic for- 
mats specified in IE~E Standard 754. Like the inte- 
ger processing unit, the floating point unit architec- 
ture is binary-compatible with the 8087 and 80287 
Math CoProcessors. The architecture is 100% com- 
patible with the Intel287 and Intel387 Math CoProc- 
essors. 


The on-chip floating point unit of the Intel487 SX 
Math CoProcessor is similar to the FPU of the In- 
tel486 OX CPU containing eight data registers, a 


tag word, a control register, a status register, an in- 
struction pointer and a data pointer. (For details on 
FP registers and instructions, see the Intel486 SX 
CPU/lntel487 SX MCP data sheet Section 2.1.3). 


Note that the Intel487 SX Math Coprocessor is the 
only upgrade available for the Low Power Intel486 
SX microprocessor based designs. It is fully compat- 
ible with the Low Power mode of the Intel486 SX 
CPU. However, the Intel OverDrive Processor does 
not work in systems based on the Low Power In- 
tel486 CPU. All address, data and control signals, 
including the external CPU clock input (CLK2) and 
the CLKSEL signal of the Low Power Intel486 SX 
CPU, must be tied to the corresponding signals of 
the Intel487 SX MCP (Le. the Intel486 SX CPU's 
CLK2 signal must be connected to the Intel487 SX 
MCP's CLK2 signal, and the Intel486 SX CPU's 
CLKSEL signal must be connected to the CLKSEL 
signal on the Intel487 SX MCP). Additionally, the 
performance upgrade circuit given in the Intel486 SX 
CPU/lntel487 SX MCP data sheet should be fol- 
lowed to provide the math upgrade capability in Low 
Power Intel486 SX CPU systems. 


The D.C.!A.C. specifications given in Section 2.0 ap- 
ply to the Low Power Intel486 SX CPU and the In- 
tel487 SX MCP when it is installed in a Low Power 
Intel486 SX CPU system. 
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A 
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C 
0 
E 
f 
G 
H 
J 
K 
L 
M 
N 
P 
Q 
R 
S 


1 /0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
1 


020 
011 
VSS 
VSS 
VCC 
VSS 
02 
A31 
A27 


019 
09 
OP1 
VSS 
VSS 
VSS 
DO 
A28 


2 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
2 


022 
018 
VCC 
VCC 
OS 
06 
01 
VSS 
A26 
021 
013 
08 
03 
VCC 
VCC 
A29 
A2S 


3 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
3 


CLKSEL 
CLK2 
010 
012 
016 
07 
OPO 
A17 
A23 


VSS 
017 
O1S 
OP2 
014 
04 
A30 
vcc 
4 
0 
0 
0 
0 
0 
0 
0 
4 


023 
VCC 
KEY 
A19 
NC 


VSS 
vss 
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A14 
vss 
A18 
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6 


024 
027 
A24 
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o2S 
vcc 
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3.2. Pin Reference of Intel487™ SX Math CoProcessor 
Table3-1.PinCross Referenceby PinName 


Address 
Data 
Control 
Nle 
Vcc 
Vss 


A2 
014 
Do 
P1 
A20M# 
015 
A10 
B7 
A7 
A3 
R15 
01 
N2 
AOS# 
S17 
A12 
B9 
A9 
~ 
S16 
02 
N1 
AHOLO 
A17 
A14 
B11 
A11 
A5 
012 
03 
H2 
BEO# 
K15 
B10 
C4 
B3 
A6 
S15 
04 
M3 
BE1# 
J16 
B12 
C5 
B4 
A7 
013 
05 
J2 
BEU 
J15 
B13 
E2 
B5 
A8 
R13 
06 
L2 
BE3# 
F17 
B16 
E16 
E1 
A9 
011 
07 
L3 
BLAST # 
R16 
C10 
G2 
E17 
A10 
S13 
08 
F2 
BOFF# 
017 
C11 
G16 
G1 


A11 
R12 
09 
01 
BROY# 
H15 
C12 
H16 
G17 
A12 
S7 
010 
E3 
BREO 
015 
C13 
J1 
H1 
A13 
010 
011 
C1 
BS8# 
016 
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K2 
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A14 
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012 
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015 
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M15 
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J3 
OPO 
N3 
P16 
M1 
A19 
04 
017 
03 
OP1 
F1 
R3 
M17 
A20 
08 
018 
C2 
OP2 
H3 
R6 
P17 
A21 
05 
019 
B1 
OP3 
A5 
R8 
02 
A22 
07 
020 
A1 
EAOS# 
B17 
R9 
R4 
A23 
S3 
021 
B2 
FERRI' 
A13 
R10 
S6 
A24 
Q6 
022 
A2 
FLUSH # 
C15 
R11 
S8 
A25 
R2 
023 
A4 
HLOA 
P15 
R14 
S9 
A26 
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024 
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E15 
S10 
A27 
S1 
025 
B6 
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A15 
S11 
A28 
R1 
026 
C7 
INTR 
A16 
S12 
A29 
P2 
027 
C6 
KEN# 
F15 
S14 
A30 
P3 
028 
C8 
LOCK # 
N15 
A31 
01 
029 
A8 
M/IO# 
N16 
030 
C9 
MP# 
B14 
031 
B8 
NMI 
B15 
PCO 
J17 
PCHK# 
017 
PWT 
L15 
PLOCKI' 
016 
ROY# 
F16 
RESET 
C16 
W/R# 
N17 


• 


The Intel487 
SX Math CoProcessor 
consists 
of all the Intel486 
SX microprocessor 
signals with the following 


additional 
pins. 


NUMERIC 
ERROR REPORTING 


FERR# 
0 
The Floating 
point 
error 
pin is driven active when a floating 
point error occurs. 
FERR # is 


similar to the ERROR # pin on the i387 Math CoProcessor. 
FERR # is included 
for 


compatibility 
with systems 
using DOS type floating 
point error reporting. 
FERR # is active 


LOW, and is not floated 
CONT1 g bus hold. 


IGNNE# 
I 
When the ignore 
numeric 
error 
pin is asserted, 
the Low Power Intel487 SX Math 


CoProcessor 
will ignore a numeric error and continue 
executing 
non-control 
floating 
point 
instructions. 
When IGNNE# 
is deasserted 
the Low Power Intel487 SX Math CoProcessor 


will freeze on a non-control 
floating 
point instruction, 
if a previous floating 
point instruction 


caused an error. IGNNE # has no effect when the NE bit in control 
register 0 is set. IGNNE # 
is active LOW and is provided 
with a small internal pullup resistor. 
IGNNE # is asynchronous 


but setup and hold times t20 and t21 must be met to insure recognition 
on any specific 
clock. 


Intel487™ 
SX MATH COPROCESSOR 
INTERFACE 


MP# 
0 
The math present 
pin is used to signal the Intel486 SX microprocessor 
to float its outputs 


and get-off the bus. This pin can be used to check the presence 
of the Math CoProcessor 
in 


the two socket math upgrade circuit. It is active low and is never floated. 
MP# 
is driven low 


at power-up 
and remains active for the entire duration of the Intel487™ 
SX Math 


CoProcessor 
operation. 


KEY PIN 


KEY 
The KEY pin is an electrically 
non-functional 
pin which is used to insure the correct 
Intel487 


SX Math CoProcessor 
orientation 
in a 169-pin socket. 
KEY pin is located at "04" 
and is the 
169th pin of the Intel487 SX MCP. 


Aevision-002 
of the Low Power 
Intel486 
SX CPU/intel487 
SX MCP data book 
contains 
many updates 
and 


improvements 
to the original 
version. 
A revision 
summary 
of major changes 
is listed below: 


Cover Page 


5ection 
1.1 


5ection 
1.4 


5ection 
1.5 


Section 
1.7 


Section 
2.1 


5ection2.2 


Added 
Low Power 
Intel486 
SX CPU PQFP package 
information. 


Added 
Figure 
1-5. Low Power Intel486 
SX CPU PQFP Pinout. 


Added 
Low Power Intel486 
SX CPU PQFP package 
to Pin Cross Reference 
Table. 


Added 
Performance 
Upgrade 
Sup,port section 
of Low Power Intel486 
SX CPU PQFP package. 


Added 
Table 
1-5. Test Pins. 


Added 
Table 
1-6 Component 
10 and Revision 
10. 


Added 
Test Access 
Port section 
of Low Power 
Intel486 
SX CPU PQFP package. 


Noted 
IGNNE -# pin is present 
on the Intel486 
OX CPU and Intel487 
SX MCP only . 


Added 
UP -# pin and descriptions. 


Added 
Figure 
1-9. Frequency 
vs Icc (typ) (PQFP Version). 


Added 
Table 2-4. Intel486 
SX Microprocessor 
D.C. Parametric 
Values 
(for PQFP package). 


Added 
Table 
2-6 
Power 
Supply 
Current (cd Values 
over 
Frequencies 
of Operation 
(PQFP 


Version). 
• 
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Intel's 82420 PClset enables workstation level of performance for Intel486™ CPU desktop systems. The 
Peripheral Component Interconnect Bus (PCI) is driving a new architecture for PC's-eliminating 
the I/O 
bottleneck of standard expansion busses. PCI provides a glueless interface for high performance peripherals 
such as graphics, SCSI, LAN and video to be placed onto a fast local bus. By utilizing this technology and 
incorporating read/write bursts along with write buffers into the 82420 PClset, a new level of performance is 
now possible for today's Intel486 CPU desktop systems. 


The Intel 82420 PClset is comprised of three components: the 82424 Cache DRAM Controller (CDC), the 
82423 Data Path Unit (DPU), and the 82378 System I/O (SID). The CDC and DPU provide the core system 
architecture while the SID is a PCI master/slave agent which bridges the core architecture to the ISA standard 
expansion bus. Intel also offers two components, the 82374EB (ESG) and 82375EB (PCEB), that work in 
conjunction to bridge the PCI bus to the EISA expansion bus. Refer to the ESC and PCEB data sheets for 
information regarding the EISA bridge components. 


The chip set supports the Intel486 CPU family as well as the write-back caching capability of Intel's future 
OverDrive™ processor for the Intel486 DX2 CPU.The high performance memory subsystem supports concur- 
rent operation between PCI bus masters while the CPU accesses memory. An integrated second level cache 
can be programmed for write-through or write-back operation. 


r-----I 
I Cache 
SRAM 
I 
I64KB• 
512KB 
I 


Product Highlights 


82378-System 
110Component (510) 


• 
Supports 
Fast DMA Type A, B, or F 
Cycles 


• 
Supports 
DMA Scatter/Gather 


• 
Arbitration 
Logic for Four PCI Masters 


• 
Reusable 
across 
Multiple 
Platforms 


• 
Directly 
Drives Six External 
ISA Slots 


• 
Integrates 
Many of Today's 
Common 
110 Functions 


• 
208-Pln QFP Package 


82424-Cache 
DRAM Controller (CDC) 


• 
Concurrent 
Llneflll during Copyback 
Cycles 


• 
Supports 
Intel486 
CPU Family and 


OverDrive 
Processors 


• 
Supports 
Future OverDrive 
Upgrade 
Processor 
In Write-Back 
Cache Mode 


• 
64K-512K 
Level 2 Cache Support 


• 
Level 2 Cache Conflgurable 
as Write- 
Back or Write-Through 


• 
208-Pln QFP Package 


82423-Data 
Path Unit (DPU) 


• 
Highly Integrated 


• 
Four Dword 
Write Buffers 


• 
Zero Walt States 
for CPU Write Cycles 


• 
PCI Burst Write Capability 


• 
160-Pln QFP Package 


Product Description 


The 82424 Cache DRAM Controller 
(COG) is a single-chip 
bridge from the CPU to the PCI bus. It provides 
the 
integrated 
functionality 
of a second 
level cache controller, 
a DRAM controller, 
and a PCI bus controller. 
It also 


features 
an optimized 
memory subsystem. 
The CDC is a dual ported device with one port as the host port and 


the other as the PCI port. 


The 82423 
Data Path Unit (DPU) integrates 
the host data, memory 
data, and PCI data interface, 
DPU control/ 


parity 
and four deep 
posted 
write 
buffers. 
With glue and buffers 
integrated 
directly 
into the DPU, the 
Intel 
82420 PClset reduces board space requirements. 
The DPU's posted write buffers allow CPU write cycles to be 


executed 
as 0 wait states. 


The 82378 System 
I/O (510) is a dual ported device which acts as a bridge between 
the PCI and standard 
ISA 


I/O 
bus. The 510 integrates 
the functionality 
of an ISA controller, 
PCI controller, 
fast 32-bit 
DMA controller, 
and standard 
system 
I/O functions. 


• 
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82423 
DATA PATH UNIT (DPU) 


• 
A 32-bit High Performance 
Host/PClf 
Memory 
Data Path 


• 
Operates 
Synchronous 
to the CPU and 
PCI Clocks 


• 
Dual-Port 
Architecture 
Allows 
Concurrent 
Operations 
on the Host and 
PCI Buses 


• 
Burst Read of Memory 
from the Host 
and PCI Buses 


• 
Host-to-Memory 
and Host-to-PCI 
Post 
Buffers 
Permit Zero Walt State Write 
Performance 


• 
Byte Parity Support 
for the Host and 
Memory 
Buses 
- 
Optional 
Parity Generation 
for Host- 
to-Memory 
Transfers 
- 
Optional 
Parity Checking 
for the 
Secondary 
Cache 
Residing 
on the 
Host Data Bus 
- 
Parity Checking 
for Host and PCI 
Memory 
Reads 
- 
Parity Generation 
for PCI-to-Memory 


Writes 


• 
Force Bad Parity to Memory 
Capability 


for Diagnostic 
Purposes 


The 82423 
Data Path Unit (DPU) provides 
the 32·bit data path connections 
between 
the Host (CPU/cache), 


main memory, 
and the Peripheral 
Component 
Interconnect 
(PCI) Bus. The dual-port 
architecture 
allows 
con- 


current 
operations 
on the Host and PCI Buses. Two 4-Dword 
deep Post buffers permit Host posting 
of data to 


main memory 
and the PCI Bus. The DPU supports 
byte parity for the Host and main memory 
buses. The DPU 


is intended 
to be used with the 82424 
Cache 
DRAM 
Controller 
(CDC). During bus operations 
between 
the 


Host, main memory, 
and PCI, the CDC provides 
the address 
paths and bus controls. 
The CDC also controls 


the data flow through 
the DPU. Together, 
these two chips provide 
a full function 
dual-port 
data path connec- 


tion to main memory 
and forms a Host/PCI 
bridge. 


{ 


HO[31:00l 


Host 
HOPO 
Interface 
HOP 1 


HOP2 


HOP3 


{ 


MO[31:00l 


Main 
MOPO 
Memory 
MOP 1 
Interface 
MOP2 


MOP3 
mJC 


Reset 
{ 
RClI( 
and 
R5T# 


Clock 


AO[31:00l} 


IROY# 
PCI 
TROY# 
Interface 


OEV5L# 


PM/H# 
P5T8# 


PW/R# 


MH/P# 


M5T8# 


MR/W# 


HM/P# 
H5T8# 


HW/R# 


OPUPE# 


PCIOP 


This data sheet describes 
the 82423TX 
and 82423ZX 
components. 
All normal text describes 
the function- 


ality for both components. 
All features 
that exist on the 82423ZX 
are shaded 
as shown 
below. 


82424 
CACHE AND DRAM CONTROLLER (CDC) 


• 
Supports 25/33lsiJ, MHz Intel486TM SX, 
Intel487™ SX, Intel486 DX, Intel486 
DX2, OverDrive™ for Intel486 and 
OverDrive for DX2 Processors 


• 
Fully Synchronous, 
25/33 MHz PCI Bus 
Capable of Supporting 
Bus Masters 


• 
Supports OverDrive Upgrade Socket, 
Including OverDrive for DX2 in Wrlte- 
Back Mode 


• 
Programmable 
Attribute 
Map for First 1 
MByte of Main Memory 


• 
Posted Write Buffers for Improved 
Performance 


• 
Integrated 
DRAM Controller 
- 
2 to 160 MByte Main Memory using 
70 ns Fast Page Mode SIMM Memory 
- 
Decoupled Refresh Cycles to 
Reduce DRAM Access Latency 
- 
Burst Mode PCI Accesses to DRAM 
Supported at the Rate of x-3-3-3-3-3 


• 
Integrated Cache Controller 
- 
Write-Through 
and Write-Back Cache 
Options 
-64 
KB, 128 KB, 256 KB and 512 KB 
Cache Sizes using Standards SRAMs 
- 
Burst Line Fill of 2-1-1-1 from 
Secondary Cache at 25 and 33 MHz 
and 3-1-1-' at 50 '(HZ' 
- 
Zero Walt State Write to L2 Cache 
for a Cacche Write Hit 
- 
Main Memory Posting at Zero Wait 
States, Enabling Optimum Write- 
2 


Through Cache Performance 
- 
Concurrent 
Cache Line Replacement 
from Secondary Cache in Write-Back 
Mode 


• 
PCI Bridge 
- 
Translates CPU Cycles into PCI Bus 
Cycles 
- 
Translates Back-to-Back 
Sequential 
Memory Write Cycles Into PCI Burst 
Cycles 
- 
Separate PCI-to-Maln Memory Port 
Allows Concurrent/Independent 
CPU 
and PCI Bus Operations 
- 
Integrated Snoop Filter 


The 82424 
Cache 
DRAM 
Controller 
(COG) integrates 
the cache 
and main memory 
DRAM control 
functions 
and provides 
the address 
paths and bus control 
for transfers 
between 
the Host (CPU/cache), 
main memory, 
and the Peripheral 
Component 
Interconnect 
(PCI) Bus. The Dual-ported 
architecture 
permits 
concurrent 
oper- 
ations 
on the Host and PCI Buses. The cache 
controller 
supports 
both write-through 
and write-back 
cache 
policies 
and cache 
sizes from 
64 to 512 
KBytes. 
The cache 
memory 
can be implemented 
using standard 
asynchronous 
SRAMs. The dual-ported 
main memory 
DRAM controller 
interfaces 
DRAM to the Host Bus and 
the PCI Bus. The CDC supports 
a two-way 
interleaved 
DRAM organization 
for optimum 
performance. 
Up to 
eight single sides SIMMs or four dual sided SIMMs 
provide 
a maximum 
of 160 MBytes 
of main memory. 
The 
CDC is intended 
to be used with 
the 
82423 
Data 
Path 
Unit (DPU). The 
DPU provides 
32-bit 
data 
paths 
between 
the Host, main memory, 
and the PCI. Together, 
these two components 
provide 
a full function 
dual- 
port data path connection 
to main memory 
and form a Host/PCI 
Bridge. 


This data sheet describes 
the 82424TX, 
82424ZX 
and 82424ZX-50 
components. 
All normal text describes 
the 
functionality 
for all three components. 
All features 
that exist on the 82424ZX 
and 82424ZX-50 
are shaded 
as 
shown 
below. 


This is an example 
of what the shaded 
sections 
that apply only to the 82424ZX 
and 82424ZX-50 
components 
look like. 


Thill Is sn eXllmpltJ of whet the 8hlJded SlJctIDns thtIt sppIy only to thtI 82424ZX-60 component 
look 
Ilke. 


82420 PClset 
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ItlCPURST 
PCIRSTt 


A[31 :2] 
AD[31:0] 


BE[3:0]' 
C/BE[3:0]' 


HLOCKI 
FRAMEt 


MIlO. 
TRDYI 


D/C. 
Host 
IRDYI 
CPU 
PCI 
W/R. 
Interface 
Interface 
LOCKt 


ADS' 
STOP. 


RDvt 
REQ. 


BRDY' 
GNTI 


BOFF' 
PAR 


AHOLD 
SERR' 


EADS' 
DEVSEL' 


KEN' 
FLSHREQ' 


PCD/CACHE. 
MEMREQ' 


SMIACTt 
MEMACKI 


••.....•..•-................... _. 


INIT 
OverDrive 


HITM' 
for 
DX2 
PWROK 


tNV 
SpecifIc 
SRESETI 


Signals 
PC&.fl 


PCLlff_ 
.CAIILCA3 
CLK2 
"t:! LCA2 
Reset 
DCA#! HCA3 
end 
CLK1 


OCO.II 
HCA2 
Clock 
CLK1A 
'cell' LCCS' 
CLK1B 
0" 
HCCSt 
CLK1C 
CWE[3:0]' 
Secondary 
CLK1D 
lefI'" CO Et 
(L2) 


CALE 
Cache 
CLK1 E 


TA[8:0] 
Control 
CLK1F 


VALID 
Interface 


TWE' 
DPUPE' 
TOE' 
PCIDP 
DIRTYD 
HW/R' 
DWEt 
HM/P' 
DIRTYQ 
DPU 
HSTB' 


MA[10:1] 
Control 
PW/R' 
OMAO 
Interface 
PM/H' 
EMAO 
DRAM 
~STBI 
RAS[3:0]' 
Control 
•••••• 
Interface 
MR/W • 


CAS[7:0]I 
MH/P' 


WEt 
MSTB' 


290467-3 


Simplified CDC Block Diagram 


82378 
SYSTEM I/O (SIO) 
• Provides the Bridge between the PCI 
• Integrated 
16-Blt BIOS Timer 
Bus and ISA Bus 
• Utility Bus (X-Bus) Peripheral Support 
• 100% PCI and ISA Compatible 
- 
Provides Chip Select Decode 
- 
PCI and ISA Master/Slave 
Interface 
- 
Controls Lower X-Bus Data Byte 
- 
Directly Drives 10 PCI Loads and 6 
Transceiver 
ISA Slots 
-Integrates 
Port 92, Mouse Interrupt, 
- 
Supports PCI at 25 MHz to 33.33 MHz 
Coprocessor 
Error Reporting 
- 
Supports ISA from 6 MHz to 
• Integrates the Functionality 
of One 
8.33 MHz 
82C54 Timer· 
• Enhanced DMA Functions 
- 
System Timer 
- 
Scatter/Gather 
- 
Refresh Request 
• 


- 
Fast DMA Type A, B, and F 
-Speaker 
Tone Output 
- 
Compatible DMA Transfers 
• Integrates the Functionality 
of Two 
- 
32-Blt Addressablllty 
82C59 Interrupt Controllers 
- 
Seven Independently 
Programmable 
-14 
Interrupts 
Supported 
Channels 
- 
Functionality 
of Two 82C37A DMA 
• Non-Maskable Interrupts 
(NMI) 
Controllers 
- 
PCI System Errors 
• Integrated 
Data Buffers to Improve 
-ISA 
Parity Errors 


Performance 
• 208-Pin QFP Package 
- 
8-Byte DMA/ISA Master Line Buffer 
• 5V CMOS Technology 
- 
32-Blt Posted Memory Write Buffer 
Four Dedicated PCllnterrupts 
to ISA 
- 
Level sensitive 
• Arbitration 
for PCI Devices 
- 
can be Mapped to any Unused 
- 
Two or Four External PCI Masters 
Interrupt 
are Supported 
Complete Support for SL Enhanced 
- 
Fixed, Rotating, or a Combination 
of 
the Two 
Intel486 CPU's 
- 
SMI# Generation Bued 
on System 
• Arbitration 
for ISA Devices 
Hardware Events 
-ISA 
Masters 
- 
STPCLK# Generation to Power 
- 
DMA and Refresh 
Down the CPU 


The 82378 System I/O (SIO) component 
provides 
the bridge between 
the PCllocal 
bus and the ISA expansion 
bus. The SIO also integrates 
many of the common 
I/O functions 
found in today's 
ISA based PC systems. 
The 
SIO incorporates 
the logic for a PCI interface 
(master and slave), ISA interface 
(master and slave), enhanced 
seven channel 
DMA controller 
that supports 
fast DMA transfers 
and Scatter/Gather, 
data buffers to isolate the 
PCI bus from the ISA bus and to enhance 
performance, 
PCI and ISA arbitration, 
14 level interrupt 
controller, 
a 
16-bit BIOS timer, three 
programmable 
timer/counters, 
and non-maskable-interrupt 
(NMI) control 
logic. The 
SIO also provides 
decode 
for peripheral 
devices 
such as the Flash BIOS, Real Time Clock, Keyboard/Mouse 
Controller, 
Floppy 
Controller, 
two Serial Ports, one Parallel 
Port, and IDE Hard Disk Drive. 


This data sheet describes 
the 823781B and 82338ZB 
components. 
All normal text describes 
the functionality 
for both components. 
All features 
that exist on the 82378ZB 
are shaded 
as shown 
below. 


This is an ~ 
of what the shaded 
sections 
that llPj)ly only to the 82378ZB 
comj)Qnent 
look like. 
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AO(31:0) 
CfBEt(3:0) 
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STOP' 
LOCK' 
OEVSEL' 
SERR' 
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MEMCS. 
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GNT1tfRESUMEt 
RIO(a:I)' 
GNTIa:2)' 
MEMREQ' 
MEMACKt 
FlSHREQ, 


IRQ8I 
IRQ12fM 
INT 
NMI 
IRQ(15,14,11:9,7:3,1) 


PlR 


PCI 
Bus 
Interfsce 


ISA 
Bus 
InterflCe 


DMA 
(Scatterf 
Gather) 


Tlmersf 
Counters 


Utility 
Bus 
Support 
Logic 
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SO(15:0) 
SA(1t:0) 


LA(23:17) 


IOCS11t 
MEMCS11t 


SBHEt 
MASTER. 
MEMR' 
MEMW' 
UN 
IOCHRDY 
IOCHKI 
SYSCLK 
BALE 
lOR' 
lOW' 
SMEMR' 
SMEMW' 
ZEROWS' 
RSTDRV 
OSC 


DREQ(7:5,3:0) 
DACK(7:5.3:0)' 
EO' 
REFRESH' 


UBUSTR 
UBUSOEt 
ECSADDR(2:0] 


ECSEN' 


DSKCHG 
ALT_RSft 
ALT_A20 
FERRtfIRQ13 
IGNNEt 
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82374EB 
EISA SYSTEM CONTROLLER 
(ESC) 
• Integrates 
EISA Compatible 
Bus 
• Integrates 
Support 
Logic for X-Bus 
Controller 
Peripherals 
and More 
- 
Translates 
Cycles between 
EISA and 
- 
Generates 
Chip Selects/Encoded 
ISA Bus 
Chip Selects 
for Floppy and 
- 
Supports 
EISA Burst and Standard 
Keyboard 
Controller, 
IDE, 
Cycles 
Parallel/Serial 
Ports, and General 
- 
Supports 
ISA No Walt State Cycles 
Purpose 
Peripherals 
- 
Supports 
Byte Assembly/ 
- 
Provides 
Interface 
for Real Time 
Disassembly 
for 8-Blt, 16-Blt and 
Clock 
32-Blt Transfers 
- 
Generates 
Control 
Signals for X-Bus 
- 
Supports 
Bus Frequency 
up to 
Data Transceiver 
8.33 MHz 
-Integrates 
Port 92, Mouse Interrupt, • 
• Supports 
Eight EISA Slots 
and Coprocessor 
Error Reporting 


- 
Directly 
Drives Address, 
Data and 
• Integrates 
the Functionality 
of Two 
Control 
Signals for Eight Slots 
82C59 Interrupt 
Controllers 
and Two 
- 
Decodes 
Address 
for Eight Slot 
82C54 Timers 
Specific 
AENs 
- 
Provides 
14 Programmable 
Channels 
• Provides 
Enhanced 
DMA Controller 
for Edge or Level Interrupts 


- 
Provides 
Scatter-Gather 
Function 
- 
Provides 
4 PCI Interrupts 
Routable 


- 
Supports 
Type A, Type B, Type C 
to Any of 11 Interrupt 
Channels 


(Burst), and Compatible 
DMA 
- 
Supports 
Timer Function 
for Refresh 


Transfers 
Request, 
System 
Timer, Speaker 


- 
Provides 
Seven 
Independently 
Tone, 
Fall Safe Timer, and Periodic 


Programmable 
Channels 
CPU Speed Control 


-Integrates 
Two 82C37 A Compatible 
• Generates 
Non-Maskable 
Interrupts 
DMA Controllers 
(NMI) 
• Provides 
High Performance 
Arbitration 
- 
PCI System 
Errors 


- 
Supports 
Eight EISA Masters 
and 
- 
PCI Parity Errors 


PCEB 
- 
EISA Bus Parity Errors 


- 
Supports 
ISA Masters, 
DMA 
- 
Fall Safe Timer 


Channels, 
and Refresh 
- 
Bus Timeout 


- 
Provides 
Programmable 
Arbitration 
- 
Via Software 
Control 


Scheme 
for Fixed, Rotating, 
or 
• Provides 
BIOS Interface 
Combination 
Priority 
- 
Supports 
512 KBytes 
of Flash or 
EPROM BIOS on the X-Bus 
- 
Allows BIOS on PCI 
- 
Supports 
Integrated 
VGA BIOS 
• 208-Pln QFP Package 
• 5V CMOS Technology 


The 82374EB 
EISA System 
Component 
(ESC) provides 
all the EISA system 
compatible 
functions. 
The ESC, 
with the PCEB, provides 
all the functions 
to implement 
an EISA to PCI bridge and EISA I/O subsystem. 
The 
ESC integrates 
the common 
I/O functions 
found in today's 
EISA based PC systems. The ESC incorporates 
the 
logic for an EISA (master and slave) interface, 
EISA Bus Controller, 
enhanced 
seven channel 
DMA controller 
with Scatter-Gather 
support, 
EISA arbitration, 
14 channel 
interrupt 
controller, 
five programmable 
timer/coun- 
ters, and non-maskable 
interrupt 
(NMI) control 
logic. The ESC also integrates 
support 
logic to decode 
periph- 
eral devices 
such as the Flash 
BIOS, Real Time Clock, 
Keyboard/Mouse 
Controller, 
Floppy 
Controller, 
two 
Serial Ports, one Parallel 
Port, and IDE Hard Disk Drive. 
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82375EB 
PCI/EISA 
BRIDGE (PCEB) 
• Provides 
the Bridge between 
the PCI 
• 32-Blt Data Paths 
Bus and EISA Bus 
• Integrated 
EISA Data Swap Buffers 
• 100% PCI and EISA Compatible 
• Arbitration 
for PCI Devices 
- 
PCI and EISA Master/Slave 
Interface 
- 
Supports 
Six PCI Masters 
- 
Directly 
Drives 
10 PCI Loads and 8 
- 
Fixed, Rotating, 
or a Combination 
of 
EISA Slots 
the Two 
- 
Supports 
PCI at 25 MHz to 33 MHz 


Data Buffers 
Improve 
Performance 
• PCI and EISA Address 
Decoding 
and 
• 
Mapping 
- 
Four 32-Blt PCI-to-EISA 
Posted 
Write 
- 
Positive 
Decode 
of Main Memory 
Buffers 
Areas (MEMCS# 
Generation) 
- 
Four 16-Byte 
EISA-to-PCI 
Read/Write 
- 
Four Programmable 
PCI Memory • 


Line Buffers 
Space Regions 
- 
EISA-to-PCI 
Read Prefetch 
- 
Four Programmable 
PCI I/O Space 
-:- EISA-to-PCI 
and PCI-to-EISA 
Write 
Regions 
Posting 


Data Buffer 
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on both the PCI and 
• Integrated 
16-Blt BIOS Timer 
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The 82375EB 
PCI-EISA 
Bridge 
(PCEB) provides 
the master/slave 
functions 
on both the Peripheral 
Compo- 
nent 
Interconnect 
(PCI) Local 
Bus and the EISA Bus. Functioning 
as a bridge 
between 
the PCI and EISA 
buses, the PCEB provides 
the address 
and data paths, bus controls, 
and bus protocol 
translation 
for PCI-to- 
EISA and EISA-to-PCI 
transfers. 
Extensive 
data buffering 
in both directions 
increases 
system 
performance 
by 
maximizing 
PCI and 
EISA 
Bus efficiency 
and allowing 
concurrency 
on the two 
buses. 
The 
PCEB's 
buffer 
management 
mechanism 
ensures 
data coherency. 
The PCEB integrates 
central 
bus control 
functions 
includ- 
ing a programmable 
bus arbiter for the PCI Bus and EISA data swap logic for the EISA Bus. Integrated 
system 
functions 
include 
PCI parity generation, 
system error reporting, 
and programmable 
PCI and EISA memory 
and 
I/O 
address 
space 
mapping 
and 
decoding. 
The 
PCEB also 
contains 
a BIOS Timer 
that 
can 
be used 
to 
implement 
timing loops. The PCEB is intended 
to be used with the EISA System Component 
(ESC) to provide 
an EISA I/O subsystem 
interface. 
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This Application Note covers both the Intel486TM DX 
and Intel486™ 
SX Microprocessors. Refer to the re- 
spective datasheets for detailed specifications. 


The Intel486TM CPU contains several improvements 
over its predecessor, the highly successful Inte1386™ 
CPU. One of the most important of these is the proces- 
sor's data access rate. The Intel486 CPU can access 
instructions 
and data from its on-chip cache in the 
same clock cycle. To support the processor's redesigned 
internal data path, the external bus has also been opti- 
mized and can access external memory at twice the rate 
of the Intel386 CPU. The internal cache requires rapid 
access to entire cache lines. Invalidation cycles must be 
supported to maintain consistency with external memo- 
ry. All of these functions must be supported by the 
external memory system. Without them, the full per- 
formance potential of the CPU cannot be attained. 


The requirements of todays multitasking and multipro- 
cessor operating systems also put increased demand on 
the external memory system. OS support 
functions 
such as paging and context switching can degrade refer- 
ence locality. Without efficient access to external mem- 
ory, the performance of these functions is reduced. 


Second level caching is a technique used to improve the 
memory interface. Some applications, such as multiuser 
office computers, require this feature to meet perform- 
ance goals. Single-user systems, on the other hand, may 
not warrant the extra cost. Given the variety of applica- 
tions incorporating the Intel486 CPU, memory system 
architecture will be very diverse. 


In this application note, we will work with an example 
to discuss the details of memory system design. In the 
example, we have supported as many functions of the 
CPU as possible. An optional second-level cache is in- 
cluded. A write buffer is also implemented to reduce 
write latency. The cache supports zero wait state read 
cycles. The DRAM controller supports the following 
devices with the wait states shown in Table 2. The 
DRAM speed given in Table I is the RAS access time 
(tRAe). 
Table 2 summarizes the bus clocks required 
for each function. 


CPU 
Clock 
DRAM 
Frequency 
Speed 


Intel486 OX/ 
25 MHz 
100 ns 
Intel486 SX 


Intel486 OX 
33 MHz 
70 ns 


Many of the functions and optimizations included here 
will not be required in every application. The example 
provides guidelines for the hardware designer but will 
not necessarily provide the optimal cost/performance 


solution for many applications. For example, II PLDs 
are required to implement the memory control logic 
partially due to the implementation of a back-off capa- 
bility. An address register must also be used to imple- 
ment this function. If this function is not used, the con- 
trollogic can be substantially reduced. These and other 
optimizations will be discussed in the summary. 


Table 2 


DRAM 
First 
Subsequent 
Write 
Function 
Access 
Burst 
Cycles 
Burst 
Accesses 


Page Hit 
3 
1 
2 


Page Miss 
7 
1 
5" 


NOTE: 
"Write 
miss latencies 
occur 
only during 
cycles 
subsequent 
to a write miss cycle. 


The discussion assumes a working knowledge of com- 
puter system design. Items discussed but not explained 
include DRAM operation, PLD programming and op- 
eration, worst-case timing analysis and Intel486 CPU 
bus operation. 
The complete schematics 
and PLD 
equations are in Appendix A. 


2.0 THE 485TURBOCACHE 
SECOND 
LEVEL CACHE 
MODULE 


Several different types of second level cache architec- 
tures are possible candidates for use with the Intel486 
CPU. For single CPU systems, the different architec- 
tures offer similar performance benefits in most cases. 
The reason they are so similar is the mechanism which 
improves performance. The primary benefit of the sec- 
ond level cache is bus cycle latency reduction. 


In most systems which incorporate a single Intel486 
CPU, bus traffic from other bus masters is minimal. 
With any reasonable memory system the CPU uses at 
most 50% to 70% of the bus. Therefore reduction of 
bus cycle latency is the only performance benefit exter- 
nal logic can offer. 


The second level cache used in this example is an eco- 
nomical method of reducing read cycle latency. The 
485Turbocache module contains the control circuits, 
data and tag ram required to implement a 64k or 128k 
byte second level cache for Intel486 CPU (25 MHz/ 
33 MHz). It is organized as a two way set associative 
cache. In this example, the l28k byte size cache module 
is used. 


One of the most interesting aspects of this device is it 
can be a system option. To provide this capability the 
device is configured as a look-aside cache. It monitors 
the CPU address and control signals. When a cycle 
occurs in which the cache can supply data, it inter- 
venes. The 
cache 
module 
then 
supplies 
an entire 
16-byte line with no wait states. 


• 


The performance improvement offered by this cache is 
substantial 
in some environments. This performance 
improvement is particularly 
obvious when executing 
multitasking, 
multiuser 
operating 
systems such 
as 
UNIX and OS/2. Some users, however, may not re- 
quire the performance 
improvement 
offered by the 
cache. In these cases, the cache as an option is attract- 
ive. 


By designing the cache subsystem .as an option b<;>th 
user's requirements can be met. A smgle system design 
can be manufactured for both customers. The UNIX or 
OS/2 user can add the cache module. Other users may 
or may not require the module. They can choose the 
system configuration which meets their price-perform- 
ance needs. 


When a 485Turbocache Module is connected to an In- 
tel486 processor system, the processor's internal cache 
should map the entire address space including that of 
the 485Turbocache Module devices to provide the high- 
est performance. This is the most efficient configura- 
tion. The Intel486 CPU can access a line from its inter- 
nal cache in one clock and the 485Turbocache Module 
provides the next fastest access in two clocks for the 
first doubleword and the remaining three doublewords 
in three clocks. 


The processor's address bits A2-A3l 
are connected to 
A2-A3l 
on the 485Turbocache Module. Internally, 
address bits A4-A15 are sent to both sets, to select one 
of 4096 locations. Because the cache is two-way set 
ass~iative, each address points to information stored in 
two banks. On each read or write cycle, the value of 
A16-A3l 
is compared to the tags stored at the location 
addressed by A4-AI5. 
If they are equal, and if the 
valid bit is set, then a hit occurs. If a read cycle is in 
progress, then the 485Turbocache Module returns data 
to the Intel486 CPU. If the hit cycle is a write cycle, 
then the new data is updated in the 485Turbocache 
Module. 


The BRDYO# output and the CBRDY# 
input must 
be used in forming of the Intel486 CPU's BRDY # in- 
put. Similarly, the CRDY# 
input must be used in 
forming of the Intel486 CPU's RDY # input. 


The memory system generates KEN # to the Intel486 
CPU when read data needs to be cached. The 485Tur- 
bocache Module receives this signal as the SKEN # in- 
put. The 485Turbocache 
Module's CKEN # 
output 
can be used in the formation of the KEN # input to the 
Intel486 CPU. CKEN# 
can be used in conjunction 
with other logic that can deassert KEN # to the CPU 
when the system wants the current line fill to be cached 
by the 485Turbocache Module and not cached in the 
Intel486 CPU. 
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The 485Turbocache Module connects directly to the 
Intel486 CPU's address lines A2-A31. 
The designer 
may have to add external buffers to the address out- 
puts, depending upon the loading. Other signals con- 
nected to the Intel486 CPU include the burst control 
signals, the bus cycle definition signals, the b~te ~n- 
ables, the ADS # signal, and the data and panty sig- 
nals. The 485Turbocache Module and CPU connec- 
tions are shown in Figure I. The 485Turbocache Mod- 
ule main memory controller and bus controller inter- 
face are shown in Figure 2. 


A read hit cycle occurs when requested data is present 
in the 485Turbocache Module. The Inte1486 CPU at- 
tempts to retrieve the entire line from the 485Turbo- 
cache Module without incurring wait states. This may 
be accomplished by activating the KEN # input at the 
end of T1 (the clock in which ADS# becomes active). 
There is very little time to decode the address, generate 
the KEN# 
signal to the Intel486 CPU, and complete a 
zero wait state read operation. Because KEN # is sam- 
pled twice, it is possible to always assert KEN # in T1 
and to wait until the end of a line fill to d¢ide whether 
the data is cacheable. (See Section 3.2.) 


CKEN # is used in the formation of the KEN # signal 
to the Intel486 CPU. CKEN# 
is activated in T1 (see 
Figure 3 and Figure 4). If a read hit occurs, data can be 
sent to the Intel486 CPU in zero wait states and can 
still be cached in the processor's on-chip cache. The 
485Turbocache 
Module asserts CKEN # 
which re- 
mains asserted for the duration of the read hit cycle 
(unless WPSTRP# 
is low and the line is write protect- 
ed). This means that the Intel486 CPU will cache the 
entire line unless external logic is added to cause the 
KEN # signal to be sampled high in the clock before 
the last BRDYO# from the 485Turbocache Module. 


If the CKEN # input from the 485Turbocache Module 
is connected directly to the KEN # input of the In- 
tel486 CPU, then the CPU will sample KEN# 
active 
at the end ofT!. 
To deassert KEN# 
to the processor, 
the system must create another signal that is used in the 
formation 
of the Intel486 CPU's 
KEN #, 
and the 
485Turbocache 
Module's SKEN #. 
Using this tech- 
nique, a non-cacheable memory read cycle can be per- 
formed. In addition, if the 485Turbocache module's 
write protection 
feature is to be used, KEN # 
and 
SKEN # must be generated separately since the CPU 
does not support write protection. 


The BRDY # signal to the Inte1486 CPU can be gener- 
ated from many sources. Therefore, the various signals 
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should be logically "ANDed" to generate the actual 
Intel486 BRDY # input. 


On a cache read hit, the 485Turbocache Module gener- 
ates a BRDYO# signal for each of the doublewords it 
transfers. 
The 
485Turbocache 
Module 
asserts 
BRDYO# in the first T2 cycle, and BRDYO# remains 
asserted for the duration of the burst. If the Intel486 
CPU either terminates a burst early or fails to generate 
a burst cycle as defmed by BLASTIf, the 485Turbo- 
cache Module will deassert BRDYOIf after the In- 
tel486 CPU has sampled the required data. 


The 485Turbocache Module is a write-through cache, 
so main memory is updated with every write hit or 
miss. The 485Turbocache Module does not generate a 
ready signal to the Inte1486CPU for write cycles.How- 
ever, it does perform a comparison and updates the 
cache memory when a write hit occurs (provided the 
location isn't write protected). The 485Turbocache 
Module is not updated on write misses.The timings for 
write operations are shown in Figure 3 and Figure 4. 
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Because the 485Turbocache Module is a write-through 
cache, writes are immediately forwarded to the system. 
If a processor write occurs on a valid entry that is not 
write protected, the new data will be stored into the 
cache in zero wait states. The 485Turbocache Module 
will not generate a ready signal. It is the systems's re- 
sponsibility to update the system memory on all writes 
and to terminate all cycles with a ready signal. Even 
after the 485Turbocache Module has completed its in- 
ternal write update, it remains idle until the system re- 
turns a ready to the processor. 


A cache location can be write protected by asserting the 
WP input to the 485Turbocache Module. The WP sig- 
nal must be valid during the third BRDYO# or RDY # 
of a cache line fill cycle until the end of the cycle. It sets 
a state bit for a particular cache location and remains in 
effect until the line is invalidated. Tieing WPSTRP # 
low will prohibit the write protected entry from being 
cached by the Intel486 CPU in subsequent accesses. 
The entry can be invalidated by any of the following: a 
flush operation, a reset operation, an invalidation cycle, 
or an LRU replacement. 


When an Intel486 CPU cycle produces a write hit to a 
write-protected 485Turbocache Module location, data 
in the cache is not modified. The 485Turbocache Mod- 
ule responds in the same way whether or not a write hit 
location is write protected by asserting the START # 
signal. It is the designer's responsibility to prevent in- 
consistencies between the 485Turbocache Module and 
main memory when using the WP signal. 


The 485Turbocache 
Module ignores all I/O 
cycles. 


When an I/O cycle is executed by the Intel486 proces- 
sor, the system responds and terminates the cycle. The 
485Turbocache Module does not assert the START# 
signal for I/O accesses, and the system should monitor 
the M/IO# 
signal rather than wait for the assertion of 
the START# 
signal. 


System 
Cacheability 
Indication 


The 485Turbocache 
Module uses the cache enable 
scheme of the Intel486 CPU. A cache update to the 
485Turbocache 
Module 
requires 
activating 
the 
SKEN # signal. The signal is sampled twice, first to 
determine if a line fill cycle should be run and second to 
confirm that the line fill should actually be cached. 
SKEN # is ignored during write cycles. 


Typically, the system will use the same logic to gener- 
ate the Intel486 CPU's KEN # signal and the 485Tur- 
bocache Module requires activating the SKEN # sig- 
nal. However, it is not necessary for both to be asserted 
during 
an 
access. 
It 
is 
possible 
to 
use 
differ- 


ent cacheing maps for the CPU cache arid the 485Tur- 
bocache Module cache because the Intel486 CPU and 
the 485Turbocache Module maintain their own cache 
contents via snooping. 


The improvements made to the CPU bus interface obvi- 
ously impact the memory subsystem design. It is im- 
portant to understand the impact of these features be- 
fore attempting to define the system. This section is a 
review of the bus features which affect the memory in- 
terface. The features and their impact on memory sys- 
tem design is discussed. 


The Intel486 CPU's burst bus cycle feature has more 
impact on the memory logic than any other feature. It 
is the most significant departure from previous bus ar- 
chitectures. A large portion of the control logic is dedi- 
cated to supporting this feature. The second level cache 
is also primarily dedicated to supporting burst cycles. 


To understand why the logic is designed this way. we 
must first understand the function of the burst cycle. 
Burst cycles are generated by the CPU if, and only if, 
two events occur. First, the CPU must request a cycle 
which is longer in bytes than the data bus can accomo- 
date. Second. the BRDY # signal must be activated to 
terminate the cycle. When these two events occur a 
burst cycle will take place. Note that this cycle will 
occur regardless of the state of the KEN # input. The 
KEN # input's function is discussed in the next section. 


With this definition we see that several cases are includ- 
ed as "burstable". 
Some examples of burstable cycles 


are listed in Table 3. These cycle's length is shown in 
bytes to clarify the case listed. 


Burst Bus Cycle 
Size 
(bytes) 


All Code Fetches 
16 


Descriptor 
Loads 
8 


Cacheable 
Reads 
16 


Floating Point Operand 
Loads 
8 


Bus Size 8(16) Writes 
4 (max) 


• 


The last case shows that write cycles are burstable. In 
this case a write cycle is transfe~ed on an 8 or 16 bit 
bus. If BRDY If is returned to terminate this cycle the 
CPU will generate another without activating ADS If. 


Using the burst write feature has debatable perform- 
ance benefit. Some systems may implement special 
functions which benefit from the use of burst writes. 
However, the Intel486 CPU does not write cache lines. 
Therefore, all write cycles are 4 bytes long. Also, most 
of the devices which use dynamic bus sizing are read 
only. This fact further 
reduces the utility of burst 
writes. 


Due to these facts, the design example used here does 
not implement burst write cycles. In fact, the BRDY If 
input is only asserted during main memory read cycles 
and cache hit cycles. RDY If is used to terminate all 
memory write cycles. RDY If is also used for all cycles 
which are not in the memory subsystem or are not ca- 
pable of supporting burst cycles. The RDY If input is 
used, for example, to terminate an EPROM or I/O cy- 
cle. 


The primary purpose of the KEN If input is to deter- 
mine whether a cycle is to be cached. Only read data 
and code cycles can be cached. Therefore, these cycles 
are the only cycles affected by the KEN If input. 
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Figure 5 shows a typical burst cycle. In this sequence 
the value of KEN If 
is important 
in two different 
places. First, to begin a cacheable cycle KEN If must be 
active the clock before BRDY If is returned. Second, 
KEN If is sampled the clock before BLAST If is active. 
At this time the CPU determines whether this line will 
be written to the cache. 


The state of KEN If also determines when read cycles 
can be bursted. Most read cycles are initiated as 4 byte 
long from the CPU's cache unit. When KEN If is sam- 
pled active the clock before BRDY If or RDY If is re- 
turned, the cycle is converted to a 16byte cache line fill 
by the bus unit. This way, a cycle which would not 
have been bursted can now be bursted by activating 
BRDYIf. 


Some read cycles can be bursted without activating 
KEN If. The most prevalent example of this type of 
read cycle is code fetches. All code fetches are generat- 
ed as 16-byte cycles from the CPU's cache unit. So, 
regardless of the state of KEN If, code fetches are al- 
ways burstable. In addition, several types of data read 
cycles are generated as 8-byte cycles. These cycles, 
mentioned previously, are descriptor loads and floating 
point operand loads. These cycles can also be bursted at 
any time. 


It's obvious that the use of the KEN # input affects 
performance. The design example used here illustrates 
one way to use this signal effectively. 


The primary concern when using KEN # is generating 
it in time for zero wait state read cycles. Most main 
memory cycles will be zero wait state if a second level 
cache is implemented. In this example, the main memo- 
ry is one wait state during most read cycles. Any Cache 
access will take place with zen? wait states. KEN # 
must, therefore, be valid during the first T2 of any read 
cycle. 


Once this requirement is established, a problem arises. 
Decode functions are inherently asynchronous. There- 
fore, the decoded output which generates KEN # must 
be synchronized. If not, the setup and hold times of the 
CPU will be violated and internal metastability will re- 
sult. With synchronization, the delay required to gener- 
ate KEN # will be at least three clocks. In this example 
4 clocks are required. In either case the KEN # signal 
will not be valid before BRDY # is returned for zero or 
one wait state cycles. 


This problem is resolved if KEN # is made normally 
active. Figure 6 illustrates this function. In this diagram 
KEN # is active during the first two clocks of the burst 
cycle. If this is a data read cycle, KEN # being active at 
this time causes it to be converted to a 16 byte length. 
The decode and synchronization of KEN # takes place 
during the first two T2 states of the cycle. If the cycle 
turns out to be nori-cacheable, KEN # will be deacti- 
vated in the third T2. Otherwise KEN # will be left 
active and the data retrieved will be written to the 
cache. 


Some memory devices may be slow enough that 16-byte 
cycles are undesireable. In this case more than three 
wait states will exist. The KEN # signal can be deacti- 
vated prior to returning RDY # or BRDY # if three or 
more wait states are present. As a result these slow 
cycles will not be converted to 16-byte cache line fills. 


The internal cache causes other effects which impact 
the memory subsystem design. Perhaps the most obvi- 
ous of these is the effect on bus traffic. The fact that the 
internal cache uses the write-through policy dramati- 
cally increases the number of write bus cycles. Figure 7 
illustrates this effect. The top chart shows the bus cycle 
mix for an application executed with the Inte1386DX • 
CPU. The bottom chart shows the same application 
executed with the Inte1486 CPU. The percentage of 
write bus cycles jumps to 70% from 30% when this 
application is executed with the Intel486 CPU. 


It seems intuitively obvious that many of these write 
cycles would be consecutive. In fact, 70% of all write 
cycles are consecutive. Furthermore, 
50% of all write 
cycles occur three in a row. It is obvious from these 


, 
statistics that optimizing the memory subsystem for 
write cycles can improve performance. But it is impor- 
tant to optimize the memory system for consecutive 
write cycles. Improving individual write cycle latency 
will not buy much performance if subsequent write cy- 
cles suffer. 
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A technique called write posting proves ideal for this 
purpose. This technique allows consecutive write cycles 
to be overlapped. It also allows write cycles to be over- 
lapped with second level cache cycles and reduces over- 
all write miss latency. 


Using the write posting technique adds complexity to 
the system logic. It is therefore valid to ask what per- 
formance improvement is gained by using this tech- 
nique. This question is especially pertinent when we 
consider the logic already implemented in the Intel486 
CPU to improve write performance. The internal In- 
tel486 write buffers decouple the processor execution 
unit from the external bus. 


Analysis has shown that, in general, 6% degradation in 
performance can be expected for every additional wait 
state added to write cycles. This analysis was per- 
formed by measuring the CPU clocks required to exe- 
cute several applications. 


The same analysis has shown that write posting reduces 
average write latency to 2.5 clocks. Without write post- 
ing average write latency is 4 clocks. From this data we 
can conclude that approximately 9% performance im- 
provement can be obtained by using write posting. This 
improvement may increase due to other affects. These 
affects, such as overlapping write cycles with cache 
reads, are discussed in subsequent sections. 
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Figure 8. KEN IF Logic for Second· Level Cache 


The Intel486 CPU bus interface unit integrates several 
functions 
which improve 
the memory 
access rate. 
These features must be supported by the memory sub- 
system to provide the intended performance benefit. 
They are supported by the memory subsystem example. 
The example also includes logic support for a second- 
level cache. An overview of the subsystem is presented 
in this section. Details of the function and logic design 
of this subsystem are presented in later sections. 


This subsystem follows a modular design. Only minor 
changes to particular logic sections are needed to imple- 
ment variations. For instance, the PLD which gener- 
ates the CAS"" signal needs only minor changes to sup- 
port Static Column mode DRAMs. It is also simple to 
implement a non-interleaved DRAM controller based 
on this design. 


Other 
possible 
optmllzations 
will be pointed 
out 
throughout 
the discussion. This first section summa- 
rizes the features and functions present in the design 
example presented in this section. 


Two common design techniques are employed in inter- 
facing the Intel486 CPU to DRAMs. The first, inter- 
leaving, is used to support the burst bus feature. The 
second, write posting, is used to reduce write cycle la- 
tency. Both techniques improve performance, and with- 
out them, performance is degraded by the access re- 
quirements of currently available DRAMs. 


Interleaving can be implemented in several ways. Here, 
alternate 32-bit DRAM banks are accessed.The bank 
accessed is determined by the value of A2. In this way, 
even DWORDs (A2=O) are stored in one bank while 
odd DWORDs (A2= 1) are stored in the other. When 
data is retrieved from memory during a cache line fill, 
cycles are overlapped to allow single clock DWORD 
accesses. Timing of this operation is detailed in the next 
section. 


A multiplexor alternates data flow between the DRAM 
banks and the appropriate data path is selected accord- 
ing to the value of A2. The multiplexor prevents bus 
contention. 


With write posting, bus cycles are again overlapped to 
reduce latency. Figure 9 illustrates how this technique 
is applied within the write cycle. The ROY II signal 
terminates the cycle in the clock after ADS II becomes 
active. This creates a zero-waitstate write cycle, the 
fastest possible. 


When the cycle terminates, however, data must still be 
written 
to 
memory. 
The 
delay 
allows 
additional 
DRAM access time. Figure 9 shows that data is actual- 
ly written to memory two clocks after ROY # is re- 
turned to the CPU. The CASI1 signal completes the 
write cycle four clocks after it is started by the CPU. 


Write data and address registers support the posted 
write function by holding write data and address after 
RDYI1 is returned to the CPU. These registers are re- 
quired to allow the CPU to start another cycle immedi- 
ately following the first (see Figure 9). ADS# 
is acti- 
vated in the clock after RDY II is returned to the CPU. 
This cycle starts before the first is complete, and the 
cycles overlap by two clocks. 
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In effect the write cycle completes in two clocks. Write 
cycles can be overlapped in this manner indefinitely. 
The timing and logic required to support this function 
is described in Section 5.3. 


Address registers also support invalidation with the 
AHOLD signal. They are required if AHOLD is acti- 
vated when bus cycles are in progress to hold the cur- 
rent address while the bus cycle completes. 


The efficient CPU interface and invalidation support 
make this DRAM subsystem well-suited for use with 
an optional cache. The memory system includes specif- 
ic functions designed to support the optional 485Turbo- 
cache module. The subsystem supports 256K X 4 and 
IMbyte 
X I DRAM 
configurations. The minimum 
memory configuration is 2 Mbytes with 256K X 4 de- 
vices; the maximum is 16 Mbytes with IMbyte X I 
devices. Additional banks can be added to increase the 
memory capacity. 


The control logic for this example is implemented with 
EPLDs.The modular approach allows quick modifica- 
tion so that the example can be tailored for specific 
implementation requirements. 


The control state machine is distributed among the var- 
ious EPLDs, and each functional block receives control 
input from other blocks. In addition most of the func- 
tional blocks are implemented as state machines. 


Figure 10 is a top level block diagram of the memory 
system. This diagram depicts the sections of logic that 


will be described subsequently . We will first discuss the 
address path logic. 


Unlike processors without on-chip caches, the address 
bus of the Intel486 processor is bidirectional. The ad- 
dress pins serve as inputs whenever external memory is 
changed by DMA or another CPU. The address is driv- 
en into the CPU to invalidate the corresponding cache 
entry if present. 


Invalidation of the Intel486 CPU's internal cache can 
be performed in several different ways. This example 
supports invalidation cycles during a memory access. 


As described in the previous section, AHOLD is used • 
to perform the invalidation function. AHOLD tristates 
the Intel486 address bus. Address registers must be 
used to hold the address to allow the current bus cycle 
to be completed. These registers hold the current ad- 
dress when AHOLD is activated. 


The registers shown in Figure II hold the entire row 
and column address, as well as the current byte enables 
and control definition. These signals are latched at the 
rising clock edge of the first T2 of a bus cycle. They 
must be held from this edge to allow zero wait state 
write cycles. 
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Registers with enable inputs are needed. The enable in- 
put can select the CLK edge appropriate for latching 
the address and control state. The control logic gener- 
ates the enable signal ALD which disables the CLK 
input of the registers during a bus cycle. When ALD is 
active (High) the current row and column addresses are 
held in the registers. 74AS823 registers have enable in- 
puts and are used in this example. 


An additional address register is required for posted 
write cycles. This register holds the write column ad- 
dress. The address is latched only on write cycles and is 
held until the write cycle completes at the DRAM. 


Separate write and read address paths are implemented 
with a 3 to I address multiplexor. The read address 
path is required to meet the timing of a three CLK read 
cycle. In this case the read address must propagate 
through the address mux one CLK sooner than the 
write address. If the initial read access is 4 CLKs long 
the read and write address paths can be combined. See 
section 5.1 for a complete description of read cycle tim- 
ing. The third address path is for the row address. 


A delay line is used to meet the row address DRAM 
hold time requirement(tRA,H). The RAS# signal is de- 
layed 20ns to create the DRAS# 
signal. This signal is 
used 
as the 
multiplexor 
path 
select input. 
When 
DRAS# 
is inactive (high) the multiplexor always se- 
lects the row address path. When DRAS# 
is active 


(low) the mux enable signal (MENO# 
or MENI#) 
controls whether the read path or the write path is se- 
lected. 


The comparator and register combination is connected 
to the row address path to generate the HIT # signal. 
This signal indicates that the current cycle's address is 
in the same DRAM row as that of the previous cycle 
and also determines whether RAS# 
will be deactivat- 
ed. 


In this example a standard component designed specifi- 
cally for this purpose is used. This component contains 
a register and a comparator. The register in this compo- 
nent holds the previous row address. When a bus cycle 
occurs to a new DRAM row, the new row address is 
latched. The RALE 
signal enables the row address 
latch. 


The timing of this component meets the requirements 
of a 33 MHz CPU clock. Discrete registers and com- 
parators can be used to improve the timing of the 
HIT # signal, if desired. 


The last important address logic component is the burst 
address generator. This state machine generate A3 and 
A2 during burst accesses and is needed to achieve zero 
wait state performance during burst cycles. It predicts 
the value of A2 and A3. Section 5.6 contains a com- 
plete description of the burst cycle timing. 
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Note that because interleaving is used, A3 is the lowest 
order DRAM address. Two A3 equivalent signals are 
generated. One for Bank 0 (BOAO)and one for Bank I 
(BIAO). These signals are connected directly to the 
DRAM devices to meet critical timing requirements. 
The signals must also reflect the lowest order row ad- 
dress during miss cycles. As a result A13 is, therefore, 
an input to this logic. It is the lowest order row address 
when IMBxl DRAMs are used. 


A2 must also be predicted during burst read accesses. 
For this purpose, the burst address logic creates the 
DATASEL signal. DATASEL reflects the value of A2 
for each access of a burst cycle and is used to control 
the data multiplexor as shown in Figure 12. 


During burst cycles, the data multiplexor alternates be- 
tween the bank 0 and bank I data paths. A2 must alter- 
nate states each clock for interleaving to function prop- 
erly. The Intei486 CPU's burst address sequence is de- 
fmed such that A2 changes state on every access. 


A2 also selects the bank to which data is written. Data 
path logic is not involved in steering data during writes. 
Figure 12 shows separate data registers for each bank. 
Separate registers are only required to divide the data 
paths. These registers hold the same write data on every 


write cycle. The CAS# 
and WE# 
(write enable) sig- 
nals control doubleword and byte steering. 


Because of write data timing, the data registers must 
have the enable function. This function, can be used to 
select the clock upon which data is latched. The proces- 
sor clock can be used as the register clock input to 
guarantee proper data setup and hold times. 


As Figure 12 indicates, the MRDY # signal enables the 
write data registers and terminates memory write cy- 
cles. Data is therefore latched during the last clock of 
any write cycle. 


MRDY # 
is restricted 
to 
write 
cycles 
while 
the 
MBRDY IF signal is used for read cycles. The need for 
these signals illustrates the convenience of the CPU's 
dual-ready inputs. The MBRDY # signal enables the 
output of the data path multiplexor to prevent bus con- 
tention. 


These ready signals are combined with similar system 
logic signals 
to 
form 
the 
processor 
RDY # 
and 
BRDY IF inputs. I/O, peripheral and other non-burst 
devices can use the RDY # input. Burst devices, such 
as a second level cache controller must also use the 
BRDY IF input. The MBRDY IF and MRDY # signals 
are, therefore, used only with the DRAM control logic. 
They are isolated from the rest of the system by combi- 
natorial logic. 
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Second level cache strategies for the Intel486 CPU are 
diverse and application dependent. The example de- 
scribed illustrates a second level cache strategy that is 
ideal for single CPU systems. 


The 485Turbocache second level cache used in this ex- 
ample is optional and is used to complement the In- 
tel486 internal cache to improve the performance when 
running complex applications and operating systems. 
Some users will not require the extra performance. 
Since the cache is optional, O.E.M.'s or end-users can 
decide whether it should be included. System board de- 
sign and manufacturing costs are thus eased since one 
system board supports multiple performance require- 
ments. 


The 485Turbocache 
is a completely self contained 


cache module. Optionality is accomplished by includ- 
ing control logic, tag ram and data ram in one package. 
A socket is added to the system board in much the 
same manner as a math coprocessor socket. In systems 
which, for example, run UNIX, the cache module is 
simply plugged in. 


This option must, of course, be supported by the system 
logic. Specifically, the memory control logic is directly 
interfaced to the cache module. The DRAM controller 
example described here is particularly well- suited for 
this cache configurations. 


The support included in the 485Turbocache module's 
memory control logic for the 485Turbocache module is 
illustrated in Figure 13. Since the 485Turbocache is a 
write-through cache, provision must be made for read 
cycles. When read data is found in the second level 
cache, the cycle is called a cache hit. At the time this 
cycle is determined to be a cache hit, it has already been 
started in the DRAM controller. This cycle must be 
aborted by the DRAM controller. 


The BRDYO# 
signal from the 485Turbocache module 


provides a convenient cache hit indication. This signal 
is included in the decoder function. When a cache hit 
occurs, the DRAM 
controller aborts the cycle. The 


memory chip select signal is not activated and the first 
level control logic is reset aborting the cycle. The con- 
trol logic then waits for another cycle to start. This 
function is very similar to the back-off function. 
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Like the Intel486 internal cache, the 485Turbocache 
module supports non-cacheable memory by decoding. 
The SKEN /I input is similar to the Intel486 CPU's 
KEN /I input. This function is also supported by the 
decode logic. Note that, as with the KEN /I 
signal, 
SKEN /I must be synchronized to the CPU clock. 


Separate cache enable inputs also allow areas of memo- 
ry to be noncacheable in the Intel486 CPU internal 
cache yet cachable in the second level cache. This fea- 
ture is convienient for BIOS. 


Memory control logic generates the signals that control 
the memory devices, multiplexors, and registers de- 
scribed earlier. These control signals can be generated 
in a variety of ways. This example employs a distribut- 
ed state machine. 


Since this example is a prototype, PLDs were the logi- 
cal choice for the controller implementation. Because 
the number of terms in a PLD is limited, the state ma- 
chine implementation 
must be distributed. 
Function 
distribution was determined based on this constraint. 
Figure 14shows a block diagram of the controller, with 
each block made up of one or two PLDs. 


There are two levels of logic in the controller shown in 
Figure 14.The first is made up of two PLDs, one which 
tracks bus cycles and another 
which generates the 
MRDY /I signal. The first level signals to PLDs in the 
second level that a cycle has started. The second level is 
made up of several PLDs which generate the actual 
control signals such as RAS /I and CAS /I. 


Implementing the controller in this manner has two 
important advantages. First, more decode time is al- 
lowed. The cycle start signal, CIP /I, is used by the 
second level logic to sample the decode output. CIP /I 
is valid in the first T2 of any bus cycle. As a result, 
decode does not need to be valid until the end of this T2 
bus state. Without this function, the decode output 
must be valid at the end of every TI bus state. In this 
case, the time allowed for decode at 33 MHz is very 
short. With 7-ns PLDs, the time allowed for decode 
would be 7ns. With 5-ns PLDs, this time is still only 
9ns. The advantage of the extra clock period is clear. 


The second advantage of the two level approach is simi- 
larly clear. The CIP /I signal indicates the start of a bus 
cycle to all second-level PLDs. Without 
this signal 
ADS/I 
would have to be connected to these devices, 
and the resulting load on ADS /I would be prohibitive. 
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Invalidation 
within bus cycles is another 
case that 
makes decode design difficult. The AHOLD 
signal 
must be used to implement this function. As its name 
implies, AHOLD 
can be active in any 
clock. 
If 
AHOLD is active in the first clock (TI) of a bus cycle, 
the CPU address lines are tristated in T2. Unless de- 
code is latched at the beginning of TI, it will not be 
valid for the DRAM cycle. 


The two-level approach allows decode to be a transpar- 
ent function. The decode circuit is shown in Figure IS. 
The 8SCS08 shown here includes a flow-through latch 
function. Using this function, the decode outputs can 
be latched. The DALE# 
signal is generated at the beg- 
gining of the first T2 of any bus cycle. This signal acti- 
vates the latch input of the 8SCS08. 
In this manner, 
decode is held during TI. If AHOLD is active in TI, 
the decode outputs may not be valid in T2. In this case, 
the cycle must not be started until the CPU address is 
redriven. Cycle-tracking PLD handles this function. By 
delaying the cycle start signal, the DRAM cycle is de- 
layed. When AHOLD is deasserted, the CPU redrives 
the address again. At that time, CIP # is activated and 
the cycle begins. If AHOLD 
is active in any other 
clock, the bus cycle can continue normally. 


The first level of interface with the memory subsystem, 
the cycle tracking PLD handles many other functions, 
most of which relate to synchronization. Refresh syn- 
chronization 
is one example, as is determining 
the 
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RAS # precharge duration. CIP # is not the only signal 
which supports the AHOLD function. Address regis- 
ters, controlled by the PLD, generate the ALD signal 
to disable the registers during bus cycles. These and 
other functions of the control logic are described com- 
pletely in Section S.II. 


The PLDs in the next level of logic perform more spe- 
cific functions. RAS# and CAS# are generated at this 
level, and the PLDs that generate these signals are de- 
voted solely to this function. The RAS # PLD gener- 
ates four RAS# 
signals, RASO#-RAS3#. 
These sig- 
nals are identical but drive different DRAM modules to 
reduce the load on the RAS# signal. 


The RAS # function is designed to support page or 
static column mode memory devices. To support these 
devices, RAS # must be left active between accesses to 
the same row. The RAS# state machine is designed so 
that RAS is deactivated only for a refresh or page miss 
cycle. This module generates RAS# 
for both DRAM 
banks. 


For the CAS# function, the PLD's are responsible for 
implementing burst accesses. During write cycles, the 
CAS# signals determine which DRAM bank is written 
to. All even doublewords (A2 = 0) are stored in bank 0 
while odd doublewords (A2 = 1) are stored in bank 1. 
When data is retrieved from memory, cycles can be 
overlapped to allow zero wait state burst accesses. 
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Address generation is another important consideration 
in burst accesses. The address for the last three accesses 
of a burst must be generated by logic because the CPU 
cannot generate these addresses in time to allow zero- 
wait state accesses. The burst address logic shown in 
Figure 14 is actually two PLDs which generate the 
burst address for bank 0 and bank I, respectively. The 
burst address consists of two signals-the 
lowest order 
DRAM addresses from each PLD. 


Because of timing constraints, these signals are con- 
nected directly to the DRAM devices. The burst ad- 
dress PLD must generate the burst address, provide the 
multiplexor function for row and column addresses and 
generate the write address. The burst address signals 
must, therefore, reflect the value of A13 during miss 
cycles. During burst read and write cycles, these signals 
reflect A3. 


BOOMAOand BOIMAOare the burst address signals for 
bank O. Two identical signals are used to divide load- 
ing. BIOMAO and BllMAO are the burst address sig- 
nals for bank I. A detailed description of the burst ad- 
dress function is given in Sections 5.6 and 5.16. 


The DSEL PLD main function is to generate the data 
select signal. As described above, this signal is used 
during a burst to switch the data path multiplexer. It 
reflects the value of A2 during burst read cycles only 
and is one component of the burst address. The DSEL 
PLD also generates the RALE# 
signal to control the 


row address register described above. 


BRDY # terminates all read cycles. MBRDY # is gen- 
erated by the MRDY PLD and is separated from the 
RDY # signal to facilitate posted writes by preventing 
data bus contention. When a write cycle is immediately 
followed by a read, the read cycle must be delayed. This 
delay is implemented by delaying MBRDY # until the 
previous write cycle is complete. MBRDY # is com- 
bined with other burst ready inputs using combinatorial 
logic. 


WIP# 
(write in progress) indicates to the MRDY PLD 


that a write is taking place, and MBRDY # is not gen- 
erated unless this signal is inactive. WIP # tracks the 
state of the CAS # state machines. 


The WE PLD generates WIP # and other signals asso- 
ciated with the write function. The MEN # signals con- 
trol the address multiplexors and activate the write ad- 
dress path during write cycles. The WE# 
signals are 
used to create the DRAM W inputs and to implement 
byte steering. They are combined with latched CPU 
byte enables using combinatorial 
logic. In this way, 
DRAM W inputs are not active for unselected bytes. 
Data bus contention on unselected bytes is prevented 
by controlling the write data register output enables. 


By implementing byte steering in this way the CAS # 
logic is simplified. The CAS# 
timing path is critical 


during burst read cycles, and by placing the byte steer- 
ing logic in the write enable path, CAS # timing restric- 
tions are eased. 


The MRDY # signal terminates all write cycles. The 
logic used to generate this signal is unusual because it 
uses the ADS # input and is therefore at the first level. 
This configuration is needed to implement zero wait 
state write cycles. 


MRDY # must be active by the end of the first T2 to 
terminate a write cycle and maintain zero wait-state 
performance. To meet this restriction, it must be active 
during any write cycle, or before decode is available 
because the CPU RDY # signal must not be activated 
during non-memory write cycles, MRDY # is inhibited 
by the decode output, MEMCS #, in combinatorial log- 
ic. 


In this section we will explore the function of the mem- 
ory subsystem in detail. Each of the signals will be de- 
scribed, and bus cycles will be illustrated to show the 
memory logic function. 


The bus cycle description in this section is specific to 
this example. Signals such as KEN # and RDY #, for 
example, are shown as they are driven by this particular 
control logic. The signals are not restricted to the tim- 
ing shown here., 


KEN # is an input to the proc- 
essor, 
indicating 
whether 
the 


next bus cycle is cacheable or 
not. 
This 
signal is a logical 


AND of SKEN # and CKEN # 
signals. 
PBRDY # is the burst ready in- 
put to the processor. This is a 
logical AND 
of the BRDY # 


signal from the system and the 
BRDYO# 
from the second lev- 


el cache. 
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5.2 
Data Path Control 


DATASEL 
DATASEL reflects the value of 
A2 during burst accesses. It is 
used to control the data multi- 
plexor for bank 0 and bank I 
data paths. 


MRDY # 
MRDY # enables the write data 
registers that are used to sup- 
port write posting and termi- 
nates memory write cycles. 


MBRDY # 
MBRDY # is used for read cy- 
cles and enables the output of 
the data path multiplexor. 
WEO# lWEI # 
WEO# and WEI # signals en- 
able the outputs of data write 
registers used for write posting. 
Both the signals are active dur- 
ing a write and CAS# 
deter- 
mines the correct bank to which 
the data is written. 
WBEOO#-WBE03# 
WBEOO#-WBE03# 
are 
a 
combination of write enable and 
byte enable signals. They con- 
trol which byte is written into 
bank 0 during a write cycle. 
WBElO#-WBE13# 
WBEIO#-WBE13# 
control 
which byte is written into bank 
I during write cycles. 


ALD 
ALD disables the clock input to 
the registers that hold the row 
and 
column 
addresses 
corre- 
sponding to the current bus cy- 
cle. 
MENO#,I# 
MENO#, MENU 
control sig- 
nals are inputs to the address 
multiplexors and are used in se- 
lecting the read or write paths 
to the respective banks. 
RALE# 
RALE# 
enables the row ad- 
dress latch, allowing a new row 
address to be latched for succes- 
sive bus cycles. 


DALE# 
DALE# 
activates the latch in- 
puts of the decode logic in the 
first T2 of a bus cycle and holds 
the decode during the bus cycle. 
BOOMAO/BOIMAO 
BOOMAOand BOIMAO are the 
burst address signals for bank O. 
They correspond to the value of 
A3 during burst read cycles. 
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BlOMAO/BllMAO 
BIOMAO and BllMAO are the 
burst address signals for bank I. 
They correspond to the value of 
A3 during burst read cycles. 


HIT # is active if the row ad- 
dress for the current 
memory 
cycle is the same as the previous 
memory cycle. 
WIP# 
indicates that a write cy- 
cle is in progress and a read to 
the DRAM needs to be delayed 
till WIP# 
becomes inactive. 
CIP # indicates a memory cycle 
is in progress. If the current cy- 
cle is not to DRAM, CIP# 
is 
deactivated else it remains ac- 
tive till the end of the bus cycle. 
RASO-3# 
go active for a valid 
row address. It remains active 
between accesses to the same 
row and is de-activated only for 
page miss and refresh cycles. 
DRAS# 
is the delayed RAS# 
signal to accomodate the RAS # 
hold time requirements. 
RFRQ indicates that a refresh 
of the DRAM is required. This 
signal is activated every 15.6 us. 
RFACK 
is asserted 
as a re- 
sponse to RFRQ and indicates 
that the DRAM 
controller 
is 
ready to perform the refresh cy- 
cle. It is active during idle cy- 
cles or after the current cycle is 
complete. 
PCHG determines the timing of 
refresh cycles and RAS # pre- 
charge count. 
CASO# and CASI # signals are 
active when a valid column ad- 
dress is present on the bus and 
control the bank to which the 
data is written into. 
MEMCS# 
is active when 
a 
read or a write is performed to 
the DRAM. It is the synchro- 
nized output of the address de- 
coder. 
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CT indicates that a new cycle 
had started while a cycle was in 
progress 
or the refresh cycle 
was taking place. It is de-acti- 
vated when the pending cycle is 
recognized. 
SKEN # It is an input to the 
second level cache and is similar 
to the KEN # signal input to 
the processor. 
CKEN # is the output of the 
second level cache. It is activat- 
ed twice for a valid line fill - 
first to enable a 485Turbocache 
cache line fill and the second 
time to validate it. 
LA2 and 
LA313 are latched 
versions of address lines A2 and 
A13. LA313 is the lowest order 
DRAM address line. The multi- 
plexor output reflects A3 when 
RAS# 
is low and A13 when 
RAS # is high. 


M # indicates the occurrance of 
a write miss. 
BRDYO# 
is a burst ready sig- 


nal driven by the second level 
cache. It is activated when a 
read hit occurs in this cache. 


Timing Diagram 16 shows a burst read cycle. At the 
start of the bus cycle, RAS # is inactive. This case is a 
rare occurence because RAS# 
is normally active. Un- 


less a cycle is the first bus cycle after a reset or refresh 
cycle, RAS# will be active in TI. 


It is useful to examine this case because it demonstrates 
a complete DRAM cycle. The basic function of most of 
the control logic is illustrated. 


The cycle begins with the activation of ADS #. The 
controller samples this signal and activates both ALD 
and CIP #. The CPU address registers are disabled by 
ALD. Therefore, the previously latched address is held 
throughout the bus cycle. The latched address is valid 
in the first T2 of the bus cycle. 
• 


The row address comparison is made with this address. 
As a result, the HIT # signal is not valid until the rising 
edge of the second T2. At this rising clock edge, the 
CIP#, 
MEMCS# 
and HIT# 
signals are sampled. If 
MEMCS# 
is sampled active, the RAS# signal is acti- 
vated. 


The delay line holds the DRAS# 
signal high for 20 ns 
after RAS# is activated. In this way the row address is 
maintained to meet tRAH, the row address hold time. 
When DRAS# 
is activated, the address multiplexers 
switch to the column address path. The MEN # signals 
are not active, and the read path is selected. 


In the third T2 of the bus cycle CAS # is asserted. This 
cycle begins with A2 low and the first access is to bank 
O.Due to the access time of the DRAM two clocks are 
required to retrieve data from memory. MBRDY# 
is 
asserted in the fourth T2 of the bus cycle, and this 
action completes the first access of the burst read. The 
access is completed in five clocks. The minimum time 
for this access is two clocks indicating that three wait- 
states were added to the first cycle. 


The timing diagram reveals two important points about 
burst cycle implementation. 
First DRAM 
access re- 
quires two clocks. Second, the burst address from the 
CPU is not available until the clock after MBRDY # is 
sampled active. These circumstances make implement- 
ing zero-wait-state burst cycles difficult. The DRAM 
bank interleaving alleviates this difficulty. 


The first advantage of interleaving is revealed in the 
second and third T2 states. Access to both the first and 
second memory doublewords can be made simulta- 
neously. This function requires that the burst address 
be predicted. As mentioned above, the burst address 
from the CPU is not available until several clocks later. 
The burst address for both the first and second accesses 
is generated in the second T2. Therefore, CAS# 
for 


both banks can be asserted in the next T2 state. 


The second advantage of interleaving is seen in fifth T2 
of the burst cycles in which DATASEL switches the 
data multiplexer. The second doubleword is driven on 
the CPU data bus. In this CLK, the burst address for 
the third access of the cycle is generated. CASOO# and 
CASOI# are also deasserted to begin the third access. 
Note that this access is started before the second access 
is completed. The cycle overlap shown allows new data 
to be driven on the CPU data bus every clock. This way 
zero-wait-state access is achieved. 


Timing is even more critical during page hit cycles. Fig. 
17 shows the timing of this cycle. Because of the func- 
tion of RAS#, 
this cycle is more common than the 
cycle discussed above. The row address is the same as 
in the previous cycle. Therefore, the RAS # signal is 
left active. 
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When a burst' read starts with RAS# 
active, fewer 
clocks are required to complete the first access. This 
reduction improves performance. As a result, however, - 
some timings become more critical. One of these is the 
time allowed to generate the burst address. 


The CAS # signals are asserted in the second T2 of the 
bus cycle. MBRDY # is also asserted at this time. To 
meet the address access time of the DRAMS, the burst 
address must be generated in the second T2. The rest of 
the read column address must also be available at this 
time. Two logic functions are needed to meet this tim- 
ing requirement. First, read and write address paths 
must be separate to allow the read address to be avail- 
able in the first T2. Second, the burst address path logic 
must latch the CPU A3 signal directly. In this way, the 
logic can generate the necessary address in time. The 
burst address state machine must track the state of A3 
at the begining of every cycle. The state machine func- 
tion is described in Section 5.11. 


The timing of KEN # must also be considered in this 
example. KEN # must be valid at the beginning of the 
second T2 of the cycle. If it is not,-the cycle will not be 
cached, and a 16-byte access cannot be generated. If 
KEN# 
is active, a 16-byte burst access will be generat- 
ed, and the cycle will be cached as long as KEN # is 
active in the second to last T2. 


At first glance this timing may not appear critical. 
KEN # is a decode function, and decode is valid at the 
clock edge called for. The KEN # input to the CPU 
must be synchronized 
to the CPU clock, however. 
Since decode is not synchronous, a two-clock synchro- 
nizer delay is required, and this delay is the reason that 
KEN # is normally active in this example. 


From the time CAS # is activated, this cycle is exactly 
the same as in the previously described burst cycle. It is 
terminated when BLAST# is asserted, and MBRDY# 
is deasserted when BLAST # is sampled active. 


As described in Section 4.1, a posted or delayed write 2 
function is employed in this example to reduce write 
cycle latency. Latency is reduced since write cyles are 
overlapped with other cycles including other Write cy- 
cles or reads from the second level cache. Write cycles 
normally make up 70 percent of all cycles, and overlap- 
ping can increase performance accordingly. 


Figure 18 illustrates the posted write implementation. 
In this example cycles begin when RAS# 
is inactive. 


As with read cycles, this case is rare in practice. 
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The cycle begins like a read. The CPU drives ADS# 
active, and the decode is sampled. RAS# is activated if 
the cycle is in DRAM space. In the second T2 of the 
cycle, however, the latched version of WIR # (LW/ 
R #) is sampled active at the rising edge of the second 
T2. In response, the control logic begins several write 
cycle functions at this clock edge. 


The CAS# state machine for the appropriate bank en- 
ters the write sequence. The MEN# and WE# signals 
are asserted. MRDY # is also asserted, terminating the 
cycle at the CPU. The MEN# 
signals activate the 
write address path. This address is not present at the 
multiplexor outputs, however, until the next clock at 
which the write pipeline register latches the write ad- 
dress. 


The write data is latched at the same clock edge. The 
write data registers are enabled by MRDY # which 
simultaneously terminates the CPU cycle. Note that 
data is latched in both the bank 0 and bank I registers. 


The WEQ# and WEI # signals are also both active. 
The CAS# signals determine which bank is written to. 
These signals are asserted within two clocks after 
MRDY #. This action completes the write cycle. Note 
that, while five clocks are required clocks are required 
to complete the cycle, the CPU cycle is terminated in 
three CLKs. The wait state is only required if RAS# is 
inactive at the start of the cycle. 
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In Figure 18the next bus cycle starts immediately after 
RDY# 
is sampled. In this case, CAS# is activated 
during the second clock of the next bus cycle. This 
overlap of cycles is similar to the pipelining feature 
used by many processors except that the Intel486 proc- 
essor bus is not involved in the posting function. All 
logic for this function is implemented in the memory 
controller. 


Figure 19 is a more typical Intel486 processor bus se- 
quence which clearly illustrates the advantages of the 
posting technique. Four write cycles have occurred to- 
gether without idle bus clocks occurring between cy- 
cles. Since all writes access the same DRAM row, 
RAS# is active throughout the sequence. 


Without the extra clock to activate RAS#, MRDY# 
can be asserted in the clock after ADS# is asserted. 
These cycles, therefore, have no wait-states. As before, 
the write cycle is not complete when MRDY # is as- 
serted but instead when CAS# is asserted two clocks 
after MRDY # to terminate the CPU bus cycle. 


At zero wait-states, each write cycle still requires four 
clock cycles. The last two clocks of each write cycle 
overlap with the next cycle. The net effect on the CPU 
bus is the same as a string of two-clock write cycles, as 
illustrated in Figure 19. 


The first write in this figure is to bank O.The fa11ing 
edge of CASO# clocks the data into the bank 0 
DRAM. This edge is denoted by WI in the diagram. 
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CASO# is asserted in the same clock that MRDY# 
terminates the second write (W2), which accesses bank 
1. CASl # is activated in the same clock as MRDY # 
for the third write (W3). 


The second and third writes happen to be to the same 
DRAM bank. As we see, no timing modification is re- 
quired in this case. Write cycles can be completed with 
zero wait states in either case. This is important since 
writes often occur in sequence on the Intel486 bus, but 
not necessarily to sequential addresses. Write posting 
supports zero wait-state write cycles to sequential and 
non-sequential addresses. 


This fact is also important if the design is to be modi- 
fied. For example while, interleaved DRAMs may not 
be required in systems with a permanent second level- 
cache, the write posting technique may still be used in 
the system. The benefits of this technique still apply 
since write cycles may still be overlapped as described. 
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The DRAM control logic is optimized for write cycles, 
as warranted by the Intel486 processor's bus character- 
istics. Over 70 percent of all cycles are writes. By em- 
ploying the posted write technique, system performance 
is increased. 


The posted write technique poses some special prob- 
lems, however. Page miss, refresh and consecutive 
write-read cycles require special consideration. We will 
begin by discussing the consecutive write-read case. 
Page miss and refresh cycles will be discussed in sec- 
tions 5.9 and 5.10. 


When a read cycle immediately follows a write, the 
read cycle must be delayed as illustrated in Figure 20. • 
The read cycle is delayed to allow the write to com- 
plete. Only read cycles to DRAM, i.e. (cache misses) 
need be delayed. Cache hits and write cycles overlap 
easily because the cache is on the CPU side of the 
DRAM controller. 
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Write cycles cannot overlap DRAM read cycles, how- 
ever, primarily because of data bus contention. The 
DRAMs used here have common data I/O pins. In this 
case read and write data paths cannot be active at the 
same time. 


To prevent data bus contention, the first data access of 
the read is delayed. In Figure 20 the first read access is 
to the same bank as the write. In addition, the read 
cycle accesses the same DRAM row. Two functions are 
required to ensure that the write is completed. First, the 
write address must be held until CAS # is asserted. Sec- 
ond, the data mux outputs must not be enabled until 
the CPU tristates the bus. 


The first function is accomplished by the MEN # sig- 
nals. The MEN'# state machine tracks the CAS # func- 
tion for the appropriate bank. When the write for that 
bank is complete, MEN # is deactivated. In this way, 
the read address path is not enabled until the CLK 
after CAS# 
becomes active. Normally, the read ad- 
dress would be valid in the first T2 of the read cycle; 
however it must be delayed one clock to allow the write 
to complete. Note that if one or more idle CLKs inter- 
venes between these cycles, no delay occurs. 


The second function is accomplished with the WIP # 
signal which is active until all write cycles are com- 
plete. A read cycle to either bank will be delayed if it 
immediately follows a write. The first access of the read 
is delayed by MBRDY #, which is not asserted until 
the WIP # signal is deasserted. WIP # is deasserted 
once all pending writes are complete. 


As described previously, page miss cycles occur when 
the CPU generates a cycle which changes the DRAM 
row address. The RAS # signal must be deasserted to 
change the ROW address in the DRAMS. Any time 
RAS # is deasserted, it must remain high for the pre- 
charge time (tRP). A delay is added to every page miss 
cycle to satisfy this requirement. 


For read cycles this function simply requires extra wait 
states as illustrated in Figure 21. 


The bus cycle starts with RAS # low or active. The row 
address generated by the CPU is different than in the 
previous cycle, and the row address comparator deas- 
serts HIT #. This signal is valid in the first TI. HIT # 
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is sampled by the RAS /I PLD at the rising edge of the 
second TI. In response, RAS /I is immediately deassert- 
ed and held inactive for two clocks. This time satisfies 
the RAS /I precharge requirement. 


Four wait states are added to process the miss cycle. 
These clocks are added to every read cycle which ac- 
cesses a new DRAM row. The delay is accomplished, 
again, with the MBRDY /I signal. MBRDY /I will not 
be asserted when RAS/I 
is inactive. Once RAS/I 
is 
sampled active, MBRDY /I is asserted. From here, the 
cycle proceeds as described in section 5.6. 


Write miss cycles are more complex than read miss 
cycles, due mainly to the write posting technique. The 
added complexity results in lower latency than in a 
non-posted memory system, however. Figure 22 illus- 
trates how this improvement is achieved. 


The write cycle in Figure 22 also begins with RAS/I 
active. The HIT /I signal is deasserted in the first T2 at 
the same time that MRDY/I 
is asserted. MRDY/I 
could be inhibited at this point to prevent write cycle 


termination. The wait-states added to meet RAS /I pre- 
charge time would then be added to this cycle. Five 
wait states are required to meet the precharge time. 


The average number of write cycle clocks can be re- 
duced, however, if another method is used. MRDY /I 
can be allowed to terminate the cycle. In this case, any 
necessary wait-states will be added to the next cycle. 


This method improves the average in two ways. First, 
some write miss cycles will not require wait-states. This 
is the case when the next cycle occurs four or more 
clocks after a write miss. In addition, wait states will be 
reduced when the next cycle occurs in two or three 
clocks. Second, three wait-states are required to com- 
plete the next cycle when it follows immediately as il- 
lustrated in Figure 22. 


The first cycle in this figure is a page miss. It is termi- 
nated at the CPU without wait-states. Because HIT /I is 
not active in the first T2, RAS /I is deasserted. At this 
point, additional clocks are added to perform the miss 
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function. Part of the time required for RAS # pre- 
charge is overlapped with the next cycle. The two clock 
overlap reduces the number of wait-states required in 
the next cycle. Therefore, the average write cycle laten- 
cy is reduced. 


The CAS # before RAS # refresh function is used in 
this example. This function uses internal counters in 
the DRAM 
devices to generate the refresh address. 
When the CAS # input is activated prior to RAS #, the 
internal counter 
is incremented. 
The output 
of the 
counter is then used as the address of the row to be 
refreshed. 


Each refresh cycle refreshes one row of the DRAM 
array. The refresh cycles are distributed such that one 
occurs every 15.6 ,""S, with every row being refreshed in 
8 ms. Refresh cycles are initiated by the RFRQ signal. 
This signal is activated every 15.6 ,""S by a counter. 


RFACK is asserted in response to RFRQ. This signal 
indicates that the DRAM controller is ready to per- 
form the refresh cycle. It also signals the counter circuit 
that RFRQ can be deasserted. 


The function of RFRQ and RFACK is very similar to 
that of the CPU's HOLD and HLDA signals. RFRQ is 
sampled at the end of each cycle and during idle cycles. 
RFACK is activated in the clock after RFRQ is sam- 
pled, except immediately after write cycles. 


Again, the posted write function must complete before 
the refresh cycle begins. If WIP # is active when RFRQ 
is sampled, RFACK will not be immediately asserted. 
RFACK will be asserted after WIP# 
is deactivated as 
illustrated in Figure 23. 


Another cycle can start between RFRQ and RFACK. 
The cycle start PLD tracks this case. CIP# 
will not be 
asserted for any cycle that starts during this interval. 
Once the refresh cycle is complete, this cycle can be 
started. 
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The functions described in the previous section are gen- 
erated by the control logic. The controller, as outlined 
in Section 4.0, is made up of several PLDs. These devic- 
es generate the control signals described in Section 5.0. 
The function of the logic is determined by the state 
machine defmition. These state machines are distribut- 
ed in the different PLDs of the controller. 


In this section, we will explore the implementation of 
the control logic. The discussion will focus on the state 
machine definition. Certain conventions are followed 
throughout the discussion. These conventions are based 
on the state machine compiler used to generate the 
PLD equations. This compiler uses the exclamation 
point (!) to indicate the low or "0" condition of a signal. 
It uses the number symbols (#) 
to indicate that the 
signal is active low. For example, !ADS# indicates that 
the ADS signal is both low and active. The # symbol 
indicates that a signal is active when low. So symbol 
!ALD means that the ALD signal is not active. These 
symbols are used to indicate state transitions as shown 
in Figure 24. The state transition in Figure 24 depends 
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lIlRDY*+ 
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RESET 


on three signals: ADS#, 
ALD, and RAS#. The equa- 
tion indicates that if both ADS# 
and ALD are active 
or if RAS# 
is not active at the next clock edge. the 
transition from SOto SI takes place. In the transition 
between SO and Sl, the Y# 
signal is activated. The 
definition of states indicates which outputs are changed 
in the transition. These conventions are used to de- 
scribe the control state machines in the next section. 


The cycle tracking logic is contained in one PLD. The 
five state machines implemented in this PLD start and 
end DRAM cycles, control refresh timing and control 
the address registers. These state machines, along with 
the MRDY # state machine comprise the first level of 
control logic. All other control state machines depend 
• 
on this first level to generate signals at the proper time. 


CIP # - Cycle in Progress 
ALD - Address Latch Disable 
CT - Cycle Track 
RFACK - Refresh Acknowledge 
PCHG - RAS Precharge Count 
M # - Write Miss Indicator 


The primary cycle tracking state machine is shown in 
Figure 25. This state machine generates the CIP# 
and 
M # signals. CIP # indicates that the CPU has started a 
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cycle. When it is active, the rest of the logic samples the 
CPU control and MEMCS# 
signals. If the current cy- 
cle is not to DRAM, it will be ignored and CIP# 
will 


be deactivated. 


This Junction is defined by the SOand Sl states in Fig- 
ure 25. As shown, CIP# 
is activated when either 
ADS# or cr are sampled active. If the cycle is not to a 
DRAM address, the MEMCS# 
signal will not be ac- 


tive in the next clock. In this case, CIP # is deactivated 
to wait for the next ADS#. 
If the cycle is to DRAM, 


CIP # stays active until the end of the bus cycle. The 
bus cycle is terminated by one of three circumstances. 
All write cycles are terminated with the MRDY # sig- 
nal. Read cycles are terminated by BRDY# 
and by 
BLAST #. The cycle can be aborted by BOFF #. Any 
of these three events causes CIP # to be deactivated (S1 
to SO). 


Two special cases are also handled by this state ma- 
chine. When AHOLD is active in the same clock as 
ADS#, 
MEMCS# 
is not valid. In this case, the CIP# 


signal is not activated until AHOLD is deasserted. The 
state machine remains in SOwhen AHOLD is active. 


The second case is a write miss cycle. During a write 
miss, CIP # must be active for the cycle to complete. 
CIP# 
is active in this case after MRDY# 
is returned 


to the CPU. Cycles that start during the time CIP# 
is 


active must be tracked by the cr state machine. The 
M # signal indicates to the cr state machine that the 
cycles must be tracked. 


The state in which M # and CIP # are both active is S2. 
This state is entered when MRDY# 
and RAS# 
are 


active and HIT# 
is inactive. By using MRDY# 
to 


qualify this transition, S2 is entered only during write 
cycles. Therefore, M # is only activated during write 
miss cycles. Note that any cycle will be recognized by 
the CT state machine when M # is active. 
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The cr state machine is shown in Figure 26. This state 
machine tracks cycles that start while the CIP # state 
machine is busy. It tracks CPU cycles that start during 
refresh cycles as well as the two cases mentioned above. 


This state machine tracks one cycle. Any cycle that 
starts while CIP# 
is busy is not terminated immediate- 


ly. The MRDY# 
and MBRDY# 
signals are delayed 


until the previous cycle is finished. Therefore, anytime 
cr is active, there is only one cycle pending. 


cr is deactivated when the pending cycle is recognized 
by the CIP # state machine. This event is indicated by 
CIP # active and M # inactive. When this event occurs, 
the cr state machine transitions to SOdeactivating cr. 


The ALD signal is also active only during DRAM cy- 
cles. Therefore, its state machine is very similar to that 
of CIP #. 
As with CIP #, 
ALD 
is asserted when 


ADS # 
is sampled active. If the cycle is not to a 


DRAM address, ALD is deasserted. When a DRAM 
cycle is terminated, ALD is also deasserted. The SO-to- 
S1 transition is quite similar to that of CIP # . 


The difference between the two state machines is re- 
vealed during write miss cycles. The SI-to-S2 transition 
is made if a write miss occurs. ALD must be held active 
during a write miss until RAS # is active. In this way 
the row address is held even if another cycle occurs. 
The combination of CIP # being active while PCHG is 
inactive indicates that RAS # will be active in this 
clock. ALD must be deactivated in this clock to allow 
the next address to be latched. ALD is re-activated if 


RAS Precharge 
and Refresh 
Counter 


RESET 
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another cycle has started during the write miss process. 
CIP*, and MEMCS*, are sampled during SOfor this 
purpose. 


The PCHG state machine provides two functions. It 
determines the time RAS*, is inactive during a miss or 
refresh cycle, and it determines the timing of refresh 
cycles. Figure 27 shows the state transitions of the 
PCHG state machine. Because the timing of this signal 
is not obvious, Figure 28 has been included. It shows a 
refresh cycle which occurs following a write cycle. 


After RAS*' is active the PCHG signal is activated. 
State Sl is maintained then until RAS*' is deactivated. 
RAS *' is only deactivated during a miss or refresh cy- 
cle or, of course, if RESET is asserted. During a miss 
cycle the transition to SOis made deactivating PCHG. 
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RAS*, is then deactivated, resulting in two CPU clocks 
of RAS *' precharge time. 


States S1and S2 define the timing of refresh cycles. The 
transition to this sequence is made when RAS*, is sam- 
pled inactive while EP is active. EP indicates that the 
RAS *' state machine has entered the refresh sequence. 


RFACK initiates the refresh sequence. It indicates that 
the control logic is ready to accept a refresh request. 
The RFRQ signal is sampled at the end of a DRAM 
cycle or during idle clocks. Note that RFRQ cannot be 
recognized during a write miss. 


RFACK 
is deactivated after RAS*, is deactivated at 
the beginning of the refresh sequence (See Figure 27 
and Figure 28). 
• 


0.'" 
n",~ 'IF LOgIC 


The RASiII logic for both memory banks occupies one 
PLD. 
Four RASiII signals are generated: RASOill- 
RAS3 ill. These signals are generated to divide loading. 
Their timing is identical. The state machine for RAS is 
relatively simple and is shown in Figure 29. 


States SI and S2 are used to implement RASiII function 
for normal cycles. After RESET, the state machine 
waits for the first bus cycle. The first bus cycle is sig- 
naled by the CIPiII signal. When CIPiII, MEMCSiII 
and PCHG 
are sampled active, RASiII is asserted. 
RASiII stays active until a miss or refresh cycle occurs. 


A miss cycle is indicated when the HIT ill signal is driv- 
en inactive. It is qualified by CIPiII and MEMCSiII 
being active. In this way, RASiII is only deactivated 
during DRAM cycles. 


vnce KA~'" 
ISaeassertea aunng a miss cycle, It stays 
high until PCHG is sampled active. This function im- 
plements 
the 
RAS ill precharge 
time. 
CIP ill and 
MEMCSiII will still be active during read miss cycles. 
Therefore, RAS ill will be asserted in the next clock. 
For write miss cycles the WIP ill signal must be used to 
restart RASiII. With a write miss, a non-DRAM cycle 
can occur before RAS ill is asserted. WIP ill is the only 
valid indication that a DRAM cycle has occurred in 
this case. WIPiII is combimid with MEMCSiII to create 
the CSWIP ill term which indicates a valid RAS ill cy- 
cle. 


When a refresh cycle occurs, the RAS ill state machine 
transitions to S3. S3 and S4 are devoted to the refresh 
function. When RFACK 
is sampled active, the tran- 
sition occurs. The refresh sequence shown in Figure 28 
illustrates the function of these two states. Note that 
after a refresh cycle, RAS ill is left inactive. The tran- 
sition from SI to S4 allows for refresh cycles that start 
when RASiII is inactive. 


RAS 
EP 
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Two separate PLDs implement the CAS;II function. 
These PLDs generate the CAS;II signals for bank 0 and 
bank I, respectively. The state machines which gener- 
ate these signals are separate and independent. Each 
generates two CAS;II signals. CASOO;lland CASOI;ll 
for bank 0, and CASIO;ll and CASll;ll 
for bank!. 


These signals drive separate DRAM modules due to 
drive requirements. 


Figure 30 shows the state diagram for the bank 0 
CAS;II function. The states on the left side of the dia- 
gram implement the write function. The states on the 
right implement the read function. As with RAS;II, the 
state machine waits until CIP;II indicates that a cycle 
has started. When CIP;II is active, the state of the 
latched version of WIR;II determines which sequence is 
started. 


CASO 
C1 
C2 
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If the cycle is a read, S4 is entered. If the cycle is a 
write, LA2 is sampled to determine if the cycle is to 
bank O. If LA2 is low, SI is entered. Note that this 
function is the same for the bank I state machine. The 
only difference is the state of LA2, which starts the 
write sequence. 


During a write cycle, CAS# is held inactive until the 
clock after ROY # is asserted. The state machine also 
waits in SI during a write miss cycle. CAS# is asserted 
during S2. In this state, several events can occur. First, 
the CPU may not start another bus cycle. Second, it 
may start a bus cycle other than a DRAM cycle. Third, 
it may initiate a read cycle, and fourth, it may begin a 
write cycle to bank 1. If any of these events occur, SOis 
entered. If another write cycle starts to the same bank, 
however, S3 is entered. 


The case of sequential writes to the same bank involves 
S2 and S3 only. An unlimited number of write cycles 
can occur in the same bank. If the DRAM row is same, 
they will occur without wait-states. If a write miss oc- 
curs, RAS # will be deasserted, and the transition from 
S3 to SI takes place. 


During read cycles, the CAS # signals for bank 0 and 
bank 1 are activated at the same time. Therefore, the 
state machines enter S4 at the same clock. At this 
point, however, the state of LA2 determines which 
state machine enters S5. In S5, CAS# is deasserted to 
prepare that bank for the next access. If S6 is entered, 
the data from that bank has not yet been accessed. 
CAS # must be held active, in this case, until the data is 
sampled by the CPU. From S6, the next transition will 
be to S5 to continue the cycle, or SOto terminate the 
cycle. If this bank was accessed first, the cycle will ter- 
minate from this state. 


The read sequence is much simpler if static column 
mode DRAMs are used. The state sequence for static 
column mode is shown in Figure 31. The write se- 
quence in this diagram is exactly the same as for the 
page mode CAS# 
control logic. The read function, 


however, requires only two states. From SO,the tran- 
sition is made to S4 any time that a DRAM read cycle 
starts. Note that LA2 is not used to qualify this tran- 
sition. Therefore, the CAS# 
signals for bank 0 and 


bank I are active at the same time. 
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The posted write implementation requires logic support 
for a few key functions. These functions are required 
mainly to support posting with interleaved memory. 
Three types of signals are generated to implement these 
functions: 


Multiplexer Select - These signals control the address 
multiplexers when RAS# 
is active. During write cy- 
cles, they must be active to select the write address 
path. These signals stay active during read cycles which 
are immediately preceded by a write. They are deacti- 
vated, when the write cycle is complete. Once they are 
deactivated the read cycle may proceed as the read path 
is selected. 


Write Enable - These signals are combined with the 
byte enable CPU outputs (BEO# - BE3#) to create the 
WBE# 
signals. The WBEOO#-WBE03# 
signals con- 
trol which byte is written in bank 0 during a write cy- 
cle. The WBElO#-WBE13# 
signals perform the same 
function for bank I. 


Write In Progress - This signal is active when a write 
cycle has been started by either DRAM bank. It is ac- 
tive when either COl # or CII # is active. COl # and 
C11# are state outputs from the CAS # state machine 
which indicates that a write cycle is being performed. 
COI# is generated for bank 0 and C11# for bank I. 
WIP# 
is only required for interleaved memory sys- 
tems. The COI # (or C11#) output would be sufficient 
for a non-interleaved (single bank) system. 


The state machines which generate these signals are 
shown in figure 32. The state diagram for the MENO# 
signal is shown. This signal enables the address multi- 
plexer for bank O. MENO# 
is activated whenever a 
write cycle occurs to an address with A2 low (0). The 
MEN I # function is the same except that it is activated 
when A2 is high (I). The CIP#, 
MEMCS# 
and 
LWR # signals are used to indicate a valid write cycle. 


The MEN # signals are deactivated when the write cy- 
cle is complete. The cycle is complete when CAS # for 
that bank is sampled active. For bank 0, COI# is used 
to indicate that a write is in progress. MENO# is held 
active when COI# is active. When CASOO# is sampled 
active, CIP# 
is checked to determine if another valid 
write to the same bank has occured. If so, MENO# 
stays active until CASOO# is sampled active. This func- 
tion keeps the write address path open during consecu- 
tive writes to the same bank. 


The WE# 
state machine is very similar to that of the 
MEN # 
state machine. 
When a write cycle starts, 
WEO# is activated in the same manner as MENO#. 


The write enable signals, however, must stay active one 
clock longer than the MEN # signals. Therefore, the 
WE# 
signal is not deactivated until COI# is sampled 
inactive. 


WIP # is generated in part by combinatorial logic so 
that it can be active in the same clock as the COI# and 
C11# 
signals. WIP# 
must be active in this clock to 
ensure that a write miss is completed before a refresh 
cycle takes place. WIP# 
must also be held active one 
clock after COI# and C02# are sampled inactive. This 
timing ensures the proper sequence for subsequent read 
cycles. The logic equation and state machine for WIP # 
are shown in Figure 32. 


The burst address logic generates the BIMAO and 
BOMAOsignals. These signals are connected directly to 
the low order address inputs of the DRAMs. Because 
of the direct connection, these signals must perform 
several different functions. They must multiplex the 
low order row and column addresses, multiplex the 
write and read addresses and generate the burst address 
during read cycles. 


These functions are performed separately for each bank 
by two PLDs. Each PLD generates two identical sig- 
nals to reduce the drive requirements. These signals are 
connected directly to two bytes of the DRAM array. 
The signals are generated partly by combinatorial logic 
and partly by the state machine. 


The logic equations and state diagram for this function 
are shown in Figure 33. The state machine generates 
the burst address for read cycles. The logic equations 
handle the multiplexing functions. 


The burst address is generated after a burst read cycle 
has started. Note that the Intel486 CPU cache need not 
be enabled for burst cycles to occur. Cycles such as 
64-bit floating-point operand reads will burst if BRDY 
is returned to the processor. SOand S3 track the state of 
the A3 CPU address output. When a burst read cycle 


, starts, S1 or S2 is entered. The BOMAOaddress output 
will then change its state when MBRDY# 
and DATA- 
SEL are both low. This function is the burst address for 
bank O. The BIMAO address output changes its state 
when MBRDY# 
is low and DATASEL is high. This 
function is the burst address for bank!. The only differ- 
ence in the two PLDs is the value of DATASEL used 
to determine the time of which the burst address chang- 
es its state. 


The SOand S3 states are required only to ensure that 
the burst address outputs are valid during the T2 of any 
read cycle. Figure 17 shows the timing of a burst read 
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hit cycle. In the first access of this cycle, the burst ad- 
dress must be valid in the first T2 to satisfy the address 
access time requirements of the DRAM. The value of 
A3 is sampled with ALD to statisfy this requirement. 
In this way, the burst address state machine always 
starts from the correct value of A3. If another wait 
state is added to this access, this function is not re- 
quired. 


The logic equations which provide the multiplexor 
function are very simple. The first term of the equations 
shown in Figure 33 enables the write path. The write 
enable signals are used to enable this path. When WEO 
is active, for example, the value of the multiplexor out- 
put is passed through to the DRAM. The second term 
allows the row address A13 to be passed to the DRAM 
during a read page miss. This term is also qualified by 
the write enable signals. In this way, the write address 
is not disabled early during a read miss. The third term 
enables the burst address output from the state machine 
onto the address pins. 


We have discussed an example memory subsystem for 
the Intel486 CPU. The material has been presented as a 
design guide for systems under development or as an 
optimization for existing systems. We have discussed 
several key functions which will be summarized in this 
section. We will also discuss some important timing 
restrictions. The key functions discussed include an ex- 
ternal 'or second level cache, posted write cycles, and 
interleaved DRAM banks. 


The interleaving technique is used to support the burst 
bus feature of the Intel486 CPU. The use of this tech- 
nique allows the DRAM to supply a DWORD every 
clock during burst cycles. Interleaving proves to be 
very useful in Intel486 CPU memory designs. Without 
its use DRAM timings such as tPC (page Mode Cycle 
time) and tCP (CAS Precharge time) would prevent 
zero wait state access at 33 MHz. 


Data registers are also used to improve average write 
cycle latency. These registers hold write data during 
posted write cycles. Write posting can improve average 
write latency to under 3 clocks for many applications. 
This improvement is important in Intel486 CPU based 
systems because 65% to 70% of all bus cycles are 
writes. Without using a latency improvement technique 
such as write posting average write latency will be 
above 5 clocks. 


The write posting technique also improves memory per- 
formance 
in other 
ways. Write cycles, particularly 


DRAM page misses, can be overlapped with read hit 
cycles in the second level cache. This fact greatly reduc- 
es the delay caused by read cycles which immediatly 
follow write cycles. 


Analysis of this memory subsystem design has shown 
that use of these features has resulted in a low latency 
response to the CPU. Over several important applica- 
tions the following characteristics have been recorded. 
The average clock cycles required to complete the first 
read is 3.5 clocks. Subsequent cycles of a burst are al- 
ways processed in one clock. Write cycles average 2.5 
clocks. These average counts result from the following 
DRAM access rates. Read accesses from the cache al- 
ways occur in zero wait states. 


DRAM 
First 
Subsequent 
Write 
Function 
Access 
Burst 
Cycles 
Burst 
Accesses 


Page Hit 
3 
1 
2 


Page Miss 
7 
1 
5" 


NOTE: 
·Writemiss latencies occur only duringcycles subsequent 
to a writemiss cycle. 


A few DRAM timing restrictions must be mentioned. 
These timings become critical at 33 MHz. These tim- 
ings are critical due primarily to the latency of the first 
cycle of a read page hit. Since three clocks are used the 
following timing restrictions exist. 


tRP = RAS # precharge time 


At 33 MHz 


tRAC = 71.5 ns 


tCAA = 37.5 ns 


tCAC = 34 ns 


tRP = 60.6 ns 


tRAC = 101.5 ns 


tCAA = 51 ns 


tCAC = 61.5 ns 


tRP = 80 ns 
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APPENDIX 
A 
PLD CODES AND SCHEMATICS 


Many design examples in this manual use PLDs (Pro- 
grammable Logic Devices) which can be programmed 
by the user to implement random logic. A PLD device 
can be used as a state machine or a signal decoder, for 
example. The advantages of PLDs include the follow- 
ing: 
I. PLD pinout is determined by the designer, which 
can simplify board layout by moving signals as re- 
quired. 


2. PLDs are inexpensive as compared to dedicated bus 
controllers. 


Intel EPLDs (Erasable Programmable Logic Devices) 
have the following additional advantages: 
I. Programmability/erasability 
allows 
EPLD 
func- 
tions to be changed easily, simplifying prototype de- 
velopment. 


2. Since EPLDs are implemented in CMOS technolo- 
gy, they can consume an order of magnitude less 
power than bipolar PLDs. Power-conscious applica- 
tions can benefit greatly from using EPLDs. 
3. Since the EPROM cell size is an order of magnitude 
smaller than an equivalent bipolar fuse, EPLDs can 
implement more functions in the same package. 
This higher integration can result in a lower overall 
component count for a design. The added flexibility 
can also mean that an extremely low number of 
"raw" (unprogrammed) devices need to be stocked 
versus bipolar PLDs. 
4. Once an EPLD design has been tested, plastic OTP 
(One-Time Programmable) 
versions of the device 
can be used in a production environment. 


PLDs have the following tradeoffs: 
I. Most PLDs do not have buried (not connected to 
outputs) registers. For some state machine applica- 
tions, this means using an otherwise available output 
pin to store the current state. 


2. The drive capability of CMOS EPLDs may be insuf- 
ficient for some applications. While the trend is 
towards use of CMOS throughout a system, in cases 


where high current levels are required, some addi- 
tional buffering may be required with EPLDs. 


A PLD consists logically of a programmable AND ar- 
ray whose output terms feed a fixed OR array. Any 
sum-of-products 
equations, within the limits of the 
number of PLD inputs, outputs, and equation terms, 
can be realized by specifying the correct AND array 
connections. Figure A-I shows an example of two PLD 
equations and the corresponding logic array. Note that 
every horizontal line in the AND array represents a 
multi-input AND gate; every vertical line represents a 
possible input to the AND gate. An X at the intersec- 
tion of a horizontal line and a vertical line represents a 
connection from the input to the AND' gate. 


The sum-of-products is then routed to a confignrable 
macrocell. The macrocell in Figure A-2 can be config- 
ured as a combinational output or registered output. 
The output can be active high or active low. A separate 
AND term controls the output buffer. 


Designing with PLDs consists of determining where Xs 
must be placed in the AND array and how to configure 
the macrocell. This task is simplified by logic compil- 
ers, such as iPLS II (Intel's Programmable Logic Soft- 
ware II) or ABEL. Logic compilers accept input in the 
form of sum-of-product equations and translate the in- 
put into a JEDEC programming file that can be used 
by programming hardware/software. 


Intel PLDs are described in the Programmable Logic 
Handbook 
Three Intel PLDs have been used in this 
manual to implement state machine and decode func- 
tions. These PLDs include: 


• 85C220---fast 20-pin superset of 16 x 8 type bipolar 
and CMOS PLDs. 


• 85C224-fast 
24-pin superset of 20 x 8 type bipolar 
and CMOS PLDs. 
• 85C508-fast 
address decode PLD with integral 
transparent latches. 


The 85C220 and 85C224 PLDs are both available at 
clock speeds to support fast state-machines in Intel486 
systems. The 85C508 provides a fast Enable-to-Output 
time with a minimal system setup time. 
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title 
'DRAM 
CONTROLLER 
- PLD 
1, 
INTEL 
CORPORATION' 
" Cycle 
Tracking 
Logic 
" Implemented 
with 
Intel 
85C224 
EPLD 


x 
.x. ; 
" ABEL 
'don't 
care' 
symbol 
c 
.C.;" 
ABEL 
'clocking 
input' 
symbol 


CLK 
BLAST- 
MEMCS- 
AHOLD 
HIT- 
BOFF- 
ADS- 
RFRQ 
RESET 
BRDY- 
MRDY- 
RAS- 
EP 


RFACK- 
CIP- 
M- 
CT 
PCHG 
Ql 
ALD 
adlst- 


pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 


"i486 
CPU 
input 
CLK" 
"i486 
CPU 
BLAST* 
output 
"Memory 
Chip 
Select 
"Address 
HOLD 
input 
to 
i486" 
"DRAM 
Page 
Hit 
Signal 
"Backoff 
input 
to 
i486" 
"Address 
Status 
output 
of 
i486" 
"Refresh 
Request 
Signal 
"System 
Reset 
"Processor 
Burst 
Ready 
pin 
"Memory 
ready 
"Row 
Address 
Strobe 
"Refresh 
indicator 
- count 
on 
RAS- 
low 


" Refresh 
acknowledge 
" ADS- 
active 
indicator 
- Cycle 
OK 
" Miss 
Indicator· 


" Cycle 
Track 
Output 
" Precharge 
state 
indicator 
" Precharge 
state 
indicator 
" Address 
Latch 
Disable 
" State 
Variable 


pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 


state 
[1, 
1]: 
if 
RESET 
then 
[1, 
1] 
else 
if AHOLD 
* 
!RFACK- 
* EP 
then 
[1, 
1] 
else 
if 
!ADS- 
* CT 
then 
[0, 
1] 
else 
[1, 
1]; 


state 
[0, 
1): 
if 
RESET 
* 
IBOFF- 
* MEMCS- 
then 
[1, 
1] 
else 
if 
HIT- 
& lRAS- & IMRDY- 
then 
[0, 
0] 
else 
if 
(IMRDY- 
* 
(IBRDY- 
& lBLAST-») 
then 
[1, 
1) 
else 
[0, 
1]; 


state 
[0, 
0]: 
if 
RESET 
* 
IBOFF- 
then 
[1, 
1] 
else 
if 
IPCHG & ICT 
* 
lADS-) 
then 
[0, 
1] 
else 
if 
IPCHG & ICT 
then 
[1, 
1] 
else 
[0, 
0]; 


state 
[0, 
0]: 
if 
RESET 
then 
[0, 
0] 
else 
if 
lRAS- 
then 
[1, 
0] 
else 
if 
RAS- 
& 
IRFACK- 
then 
[0, 
1] 
else 
[.0, 0]; 


state 
[1, 
0]: 
if 
RESET 
then 
[0, 
0] 
else 


• 


f RAS- 
& 
!EP then 
[0, OJ else 
f RFACK- 
& EP & !RAS- then 
[1, 1] else 
f RAS- & EP then 
[0, 1] else 
[1, 0]; 


state 
[0, 
1] : 


state 
[1, 1] : 


state_diagram 
[CT] 


state 
[OJ: 


state 
[1]: 


if RESET 
then 
[1] else 
if 
ICIP- & RFRQ & !MRDY- & !HIT- then 
[0] else 
if 
ICIP- & RFRQ & (iBRDY- & iBLAST-) * 
RFRQ & CIP- & ADS- 
then 
[0] else 
[1]; 


state 
[1, 0]: 
if RESET 
then 
[0, 1] else 
if 
!CIP- & MEMCS- 
then 
[0, 1] else 
if HIT- 
& !MRDY- then 
[1, 1] else 
if 
!HIT- & !MRDY- then 
[0, 1J else 
if 
!BRDY- & iBLAST- then 
[0, 1] else 
[1, 0]; 


state 
[1, 1]: 
if RESET 
then 
[0, 1] else 
if 
!CIP- & (iPCHG * MEMCS-l 
then 
[0, 1] else 
[1, 1]; 
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title 
'DRAM CONTROLLER 
- PLD 2, INTEL CORPORATION' 
" This 
PLD generates 
CASO 
" Implemented 
with 
the 
Intel 
85C220 
EPLD 


SC2 
device 
'E0320' ; 


x 
.x.; 
ABEL 
'don't 
care' 
symbol 
c 
.c. ; 
ABEL 
'clocking 
input' 
symbol 


" 
Inputs 


CLK 
pin 
1; 
"i486 CPU 
input CLK" 
RFACK 
pin 
2; 
"Refresh Acknowledge 
CIP- 
pin 
3 ; 
"Cycle OK 
LA2 
pin 
4; 
"Latched A2 
HIT- 
pin 
5; 
"DRAM Page Hit Signal 
BOFF- 
pin 
6; 
"Backoff 
input to i486' 
LW_R- 
pin 
7 . 
"Latched W/RlI 
RAS- 
pin 
8; 
'Row Address 
Strobe 
RESET 
pin 
9; 
'System 
Reset 
RDY- 
pin 
12; 
•Processor 
RDYlI 
MEMCS- 
pin 
13; 
"Memory Chip 
Select 
BRDY- 
pin 
18; 
•Processor 
BRDYlI 
BLAST- 
pin 
19; 
•Processor 
BLASTlI 
. Output 


CASOO- 
pin 
14; 
"CASO byte 
0,2 
C1 
pin 
15; 
"state variable 
C2 
pin 
16; 
'state variable 
CAS01- 
pin 
17 ; 
"CASO byte 
1,3 


state 
[1, 1, 1, 1]: 
if RESET 
II lBOFF- then 
[1, 1, 1, 1] else 
if 
!RFACK & lCIP- & lLA2 & LW_R- & !MEMCS- then 
[1, 1, 0, 1] else 
if 
lRFACK & lCIP- & lLW_R- & lRAS- 
& IHIT- & lMEMCS- II (RFACK & RAS-) 
then 
[0, 0, 1, 1] else 
[1, 1, 1, 1]; 


state 
[1, 1, 0, 1]: if RESET 
II lBOFF- then 
[1, 1, 1, 1] else 
if 
lRAS- & RDY- 
then 
[0, 0, 0, 0] else 
[1, 1, 0, 1]; 


state 
[0, 0, 0, 0]: if RESET 
II lBOFF- then 
[1, 1, 1, 1] else 
if lCIP- & lLA2 & LW_R- & lMEMCS- then 
[1, 1, 0, 0] else 
if CIP- 
II (ICIP- & (MEMCS- II lLW_R-» 
II (lCIP- & LW_R- 
& LA2) 
then 
[1, 1, 1, 1] else 
[0, 0, 0, 0]; 


state 
[1, 1, 0, 0): if 
!RAS- then 
[0, 0, 0, 0) else 
[1, 1, 0, 0]; 
. 


state 
[0, 0, 1, 1): if RESET 
II lBOFF- then 
[1, 1, 1, 1) else 
if 
lBRDY- & IBLAST- & lRFACK then 
[1, 1, 1, 1] else 
if 
lBRDY- & BLAST- & lLA2 then 
[1, 1, 1, 0] else 
if 
lBRDY- & BLAST- & LA2 then 
[0, 0, 1, 0] else 
if RFACK 
then 
[0, 0, 1, 0) else 
if BRDY- & lRFACK then 
[0, 0, 1, 1]; 


state 
[1, 1, 1, 0]: if RESET 
then 
[1, 1, 1, 1] else 
if 
lBOFF- then 
[1, 1, 1, 0) else 
if 
lBRDY- & BLAST- 
then 
[0, 0, 1, 1] else 
if 
IBRDY- & lBLAST- then 
[1, 1, 1, 1] else 
[1, 1, 1, 0); 


• 


intel~ 


state 
[0, 0, 1, 0] : 
f 
RESET 
1I 
IBOFF- then 
[1, 1, 1, 1] else 
f 
IBRDY- & BLAST- 
then 
[1, 1, 1, °] else 
f 
IBRDY- & IBLAST- 
1I BRDY- 
then 
[1, 1, 1, 1] ; 


test_vectors 


([CLK,RFACK,CIP-,LA2,HIT-,BOFF-,LW_R-,RAS-,RESET,RDY-,MEMCS-, 
BRDY-, BLAST-] 
-> 
[CASOO-,C1,C2,CAS01-]I 
· 
C 
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[c, x, x, x, x, x, 1, x, 
1, x, x, x, x] 
-> [x, x, x, x] 
[c, x, 
1, 0, x, x, 1, x, 1, x, x, x, x] 
-> 
[1, 1, 1, 1] 
[c, 0, 1, 0, x, 
1, 1, 0, 0, 1, 0, 1, 1] 
-> 
[1, 1, 1, 1] 
[c, 0, 1, 0, x, 
1, 1, 0, 0, 1, 0, 1, 1] 
-> 
[1, 1, 1, 1] 
[c, 0, 1, 0, x, 1, 1, 0, 0, 1, 0, 1, 1] 
-> 
[1, 1, 1, 1] 
[c, 0, 0, 0, 0, 1, 1, °, 
0, 1, 0, 1, 1] 
-> 
[1, 0, 1, 1] 
[c, 0, 0, 0, 0, 1, 1, 0, 0, 0, 0, 1, 1] 
-> 
[1, 0, 1, 1] 
[c, 0, 1, 0, 0, 1, 1, °, °, 
1, 0, 1, 1] 
-> 
[0, 0, °, 
0] 
[c, 0, 0, 0, 0, 1, 1, 0, 0, 0, 0, 1, 1] 
-> 
[1, 0, 0, 1] 
[c, 0, 1, x, 0, 1, 1, 0, 0, 1, 0, 1, 1] 
-> 
[0, 0, 0, 0] 
[c, 0, 0, 0, 0, 1, 1, 0, 0, 0, 0, 1, 1] 
-> 
[1, 0, 0, 1] 
[c, 0, 1, x, 0, 1, 1, 0, 0, 1, 0, 1, 1] 
-> 
[0, 0, 0, 0] 
[c, 0, 1, 1, x, 1, 1, 0, 0, 1, 0, 1, 1] 
-> 
[1, 1, 1, 1] 


end 
PG_MODE_DRAM_CTRL_2; 


240799-37 


title 
'DRAM CONTROLLER 
- PLD 
3, INTEL 
CORPORATION' 
• This 
PLD generates 
RAS 
• Implemented 
with 
the 
Intel 
85C220 
EPLD 


x 
.X.; 
ABEL 
'don't 
care' 
symbol 
c 
.C.;· 
ABEL 
'clocking 
input' 
symbol 


CLK 
pin 
1; 
M- 
pin 
2; 
CIP- 
pin 
3; 
MEMCS- 
pin 
4; 
HIT- 
pin 
5; 
RFACK 
pin 
6; 
PCHG 
pin 
7; 
WIP- 
pin 
8· 
RESET 
pin 
9; 
Q1 
pin 
12; 


"i486 
CPU 
input 
CLK" 
"Write 
Miss 
Indicator 
"Cycle 
OK 
"Memory 
Chip 
Select 
"DRAM 
Page 
Hit 
Signal 
"Refresh 
Acknowledge 
"RAS precharge 
count 
"Write 
in Progress 
"System 
Reset 
"RAS refresh 
count 


RAS2- 
pin 
13 ; 
RAS1- 
pin 
14; 
EP 
pin 
15; 
EP1 
pin 
16; 
RASO- 
pin 
17; 


RAS3- 
pin 
18; 
CSWIP- 
pin 
19; 


state_diagram 
[RASO-, RAS1-, EP] 


" RAS 
byte 
0,2 
" state 
variable 
" state 
variable 
" RAS 
byte 
1,3 


if RESET 
then 
[I, I, 0] else 
if 
lCIP- & lCSWIP- & IPCHG then 
[0, 0, 0) else 
if RFACK 
& WIP- 
then 
[I, I, 1] else 
[I, 
I, 
0]; 


if RESET 
if RFACK 
if 
lCIP- 


else 
[0, 


then 
[I, I, 0) else 
then 
[0, 0, 1) else 
& HIT- & !MEMCS- 
then 
0, 
0]; 


if RESET 
then 
[I, I, 0] else 
if 
lRFACK & IPCHG 
then 
[I, I, 0) else 
if RFACK 
& lWIP- 
# 
lRFACK & PCHG 
then 
[0, 0, 1] else 
if RFACK & WIP- & lQ1 then 
[1, I, 1); 


state 
[1, 1, 1] : 
if RESET 
then 
[1, I, 0] else 
if 
IPCHG then 
[0, 0, 1] else 
[I, I, 1] ; 


state 
[0, I, 0] : 
goto 
[I,I,0] ; 
state 
[0, 1, 1] : 
goto 
[I,I,0] ; 
state 
[1, 0, 0] : 
goto 
[1,1,0] ; 
state 
[I, 0, 
1] : 
goto 
[1,I,0) ; 


state_diagram 
[RAS2-,RAS3-, EP1] 


if RESET 
then 
[I, 1, 0) else 
if 
lCIP- & lCSWIP- & IPCHG then 
[0, 0, 0) else 


• 


state 
[0, 1, 0] : 
state 
[0, 1, 
1] : 
state 
[1, 0, 0] : 
state 
[1, 0, 
1] : 


equations 


if 
RESET 
if 
RFACK 
if 
ICIP- 
else 
[0, 


then 
[1, 1, 0] else 
then 
[0, 0, 1] else 
& HIT- & IMEMCS- 
then 
0, 
0]; 


if RESET 
then 
[1, 1, 0] else 
if 
!RFACK & IPCHG 
then 
[1, 1, 0] else 
if RFACK & IWIP- * 
!RFACK & PCHG 
then 
[0, 0, 1] else 
if RFACK & WIP- & IQ1 then 
[1, 1, 1]; 


if RESET 
then 
[1, 1, 0] else 
if 
IPCHG then 
[0, 0, 1] else 
[1, 1, 1]; 


goto 
[1,1,0]; 
goto 
[1,1,0] ; 
goto 
[1,1,0]; 
goto 
[1,1,0]; 


([CLK,M-,CIP-,MEMCS-,HIT-,RFACK,PCHG,WIP-,Q1,RESET] 
-> 
[RASO-,RAS1-,EP,RAS2-,RAS3-,EP1]l 
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[e, 
x, x, x, x, x, 
1, x, x, 
1] 
-> 
[x, x, x, x, 
x, x] ; 
[e, 
x, x, x, x, x, 
1, x, x, 
1] 
-> 
[1, 1, 0, 1, 1, 0] ; 
[e, 
x, x, x, x, x, 
1, x, x, 
1] 
-> 
[1, 1, 0, 1, 1, 0]; 
[e, 
x, 
x, x, x, x, 
1, x, x, 
1] 
-> 
[1, 1, 0, 1, 1, 0] ; 
[e, 
x, 
x, x, x, x, 
1, x, x, 
1] 
-> 
[1, 1, °t 
1, 1, 0] ; 
[e, 
x, x, x, x, x, 
1, x, x, 
1] 
-> 
[1, 1, 0, 1, 1, 0] ; 
[e, 
x, 
x, x, x, x, 
1, x, x, 
1] 
-> 
[1, 1, 0, 1, 1, 0] ; 
[e, 
1, 1, x, 
x, 
0, 0, 0, 0, 0] 
-> 
[1, 1, 0, 1, 1, 0] ; 
[e, 
1, 1, x, x, 
0, 0, 0, 0, 0] 
-> 
[1, 1, 0, 1, 1, 0] ; 
[e, 
1, 0, 0, 0, 0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, 0] ; 
[e, 
1, 0, 0, 0, 0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, 0] ; 
[e, 
1, 1, x, x, 
0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, 0] ; 
[e, 
1, 0, 0, 0, 0, 0, 0, 0, 0] 
-> 
[01,0, 0, 0, 0, 0] ; 
[e, 
1, 1, x, x, 
0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, 0] ; 
[e, 
1, 0, 0, 0, 0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, o 1 ; 
[e, 
1, 1, x, x, 
0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, 0] ; 
[e, 
1, 1, x, x, 
0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, 0] ; 
[e, 
1, 1, x, x, 
0, 0, 0, 0, 0] 
-> 
[0, 0, 0, 0, 0, 0] ; 


end 
SC_MODE_DRAM_CTRL_3; 


240799-39 


intel~ 


title 
'DRAM 
CONTROLLER 
- PLD 
4, 
INTEL 
CORPORATION' 
" This 
PLD 
generates 
MRDY 
and 
MBRDY 
" Implemented 
with 
Intel 
85C224 
EPLD 


x 
.X.;" 
ABEL 
'don't 
care' 
symbol 
c 
.C.;" 
ABEL 
'clocking 
input' 
symbol 


" 
Inputs, 


CLK 
pin 
1; 
M- 
pin 
2; 
CIP- 
pin 
3; 
MEMCS- 
pin 
4; 
HIT- 
pin 
5; 
RFACK 
pin 
6; 
ADS- 
pin 
7 ; 


W_R 
pin 
8; 
RESET 
pin 
9; 
dum1 
pin 
10; 
BOFF- 
pin 
11; 
WIP- 
pin 
14; 
CAS- 
pin 
15; 
BLAST- 
pin 
22; 
RAS- 
pin 
23; 


" 
Output 


dumO 
pin 
16; 
MT 
pin 
17; 
MRDY- 
pin 
18; 
DALE- 
pin 
19; 
LWR 
pin 
20; 
BRDY- 
pin 
21; 


state_diagram 
[MRDY-] 


"i486 
CPU 
input 
CLK" 
"Miss 
Indicator 
"Cycle 
OK 
"Memory 
Chip 
Select 
"DRAM 
Page 
Hit 
Signal 
"Refresh 
acknowledge" 
"CPU ADS- 
"CPU W/R 
"System 
Reset 


"CPU 
Backoff 
input 
"Write 
in Progress 
"Column 
Address 
Strobe 
"CPU Burst 
Last 
output 
"Row Address 
Strobe 


"BRDY 
state 
miss 
tracking 
"Memory 
RDY 
(modified 
with 
other 
RDYs) 
"Decode 
Latch 
enable 
"Internally 
latched 
W/R* 
for RDY 
"Processor 
BRDY- 


if 
(IRFACK & lADS- & W_R & lRAS- & M-) 
* 
(ICIP- & LWR & 


IMEMCS- & lRFACK & M-) 
then 
[0] else 
[1]; 


state_diagram 
[BRDY-, 
MT] 


state 
[1, 1]: 
if 
lCIP- & IHIT- & IMEMCS- & !LWR & lRFACK & WIP- & lRAS- 


then 
[0, 1] else 
if 
lCIP- & IMEMCS- & HIT- 
& !LWR * 
lCIP- & !MEMCS- & RAS- & lLWR then 
[1, 0]; 


state 
[1, 0] : if RESET 
then 
[1, 1] else 
if WIP- 
& 
!RFACK 
& 
!CAS- then 
[0, 1] ; 


state 
[0, 1] : if RESET 
* 
!BOFF- 
* 
!BLAST- 
then 
[1, 1] else 
[0, 1] ; 


state_diagram 
[DALE-] 


state 
[0] : 
if RESET 
then 
[0] else 
if 
IADS- 
then 
[1] else 
[0]; 


state 
[1] : 
if RESET 
* 
!BOFF- 
then 
[0] else 
if 
lCIP- 
then 
[0] else 
[1] ; 


• 


intel~ 


if RESET 
then 
[0] else 
if 
!ADS- & W_R 
then 
[1] else 
[0]; 


if RESET * 
!BOFF- then 
[0] else 
if 
!ADS- & !W_R then 
[0] else 
[1]; 


([CLK,M-,CIP-,MEMCS-,HIT-,RFACK,ADS-,W_R,RESET,WIP-,BOFF-,BLAST-,RAS-] 
-> 
[MRDY-,DALE-, LWR, BRDY -] ) 
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[e, 
x, x, x, x, x, 
1, x, 
1, x, x, x, x] 
-> [x, x, x, x] ; 
[e, 
x, 
1, 
1, x, x, 
1, x, 
1, x, x, x, x] 
-> 
[1, 0, 
0, 1] ; 
[e, 
1, 1, 1, x, 
0, 1, x, 
0, 
0, 1, 1, 1] 
-> 
[1, 0, 
0, 1] ; 
[e, 
1, 1, 1, x, 
0, 
1, x, 
0, 0, 
1, 1, 1] 
-> 
[1, 0, °, 
1] ; 
[e, 
1, 1, 1, x, 
0, 0, 1, 0, 0, 
1, 1, 1] 
-> 
[1, 1, 
1, 1] ; 
[e, 
1, 0, 
0, 0, 0, 
1, x, 
0, 0, 1, 1, 1] 
-> 
[0, 0, x, 
1] ; 
[e, 
1, 0, 
0, 0, °, 
1, x, 
0, 1, 1, 1, 0] 
-> 
[1, 0, 1, 1] ; 
[e, 
1, 1, 0, 
0, 0, 0, 
1, 0, 1, 1, 1, 0] 
-> 
[0, 1, 1, 1] ; 
[e, 
1, 0, 0, 
0, 0, 
1, x, 
0, 1, 1, 1, 0] 
-> 
[1, 0, x, 
1] ; 
[e, 
1, 1, x, 
0, 0, 0, 
1, 0, 
1, 1, 1, 0] 
-> 
[0, 1, 1, 1] ; 
[e, 
1, 0, 0, 
0, 0, 
1, x, °, 
1, 1, 1, 0] 
-> 
[1, °, x, 
1] ; 
[e, 
1, 1, x, 
0, 
0, 1, x, °, 
1, 1, 1, 0] 
-> 
[1, 0, x, 
1] ; 
[e, 
1, 
1, 1, x, 
0, 1, x, 
0, 
1, 1, 1, °] 
-> 
[1, 0, x, 
1] ; 


end SC_MODE_DRAM_CTRL_4; 


240799-41 


title 
'DRAM CONTROLLER 
- PLD 
6, INTEL 
CORPORATION' 
, This 
PLD generates 
AO 
for Bank 
0 
, Implemented 
with 
the 
Intel 
85C224 
EPLD 


x 
.X.;' 
ABEL 
'don't care' 
symbol 
c 
.C.; 
ABEL 
'clocking 
input' 
symbol 


CLK 
BRDY- 
CIP- 
MEMCS- 
LA313 
DATASEL 
RAS- 
LW_R 
RESET 
BLAST- 
A3 
ALD 
duml 
WEO- 
dum2 


pin 
1; 
pin 
2; 
pin 
3; 
pin 
4; 
pin 
5; 
pin 
6; 
pin 
7; 
pin 
8; 
pin 
9; 
pin 
10; 
pin 
11; 
pin 
14; 
pin 
15; 
pin 
22; 
pin 
23; 


BOOMAO 
BOA 
CSO- 
dun 
dum 
B01MAO 


pin 
21; 
pin 
20; 
pin 
19; 
pin 
18; 
pin 
17; 
pin 
16; 


'i486 CPU 
input 
CLK' 
'Burst 
Ready 
'Cycle 
OK 
'Memory 
Chip 
Select 
'Latched 
A3 and 
A13 
'Bank 
Select 
for Reads 
'Row address 
strobe 
'CPU W/R 
latched- 
'System 
Reset 
'CPU BLAST- 
output 
'CPU A3 
line 
'Address 
Latch 
disable 


state 
[1, 1]: 
if RESET 
then 
[1, 1] else 
if CIP- & lALD & !A3 then 
[0, 1] else 
if 
lCIP- & lALD & lA3 then 
[0, 1] else 
if 
lCIP- & ILW_R & !MEMCS- & WEO- 
then 
[1, 0] else 
[1, 1]; 


state 
[0, 1]: 
if RESET 
then 
[1, 1] else 
if CIP- & lALD & A3 
then 
[1, 1] else 
if 
lCIP- & lALD & A3 then 
[1, 1] else 
if 
lCIP- & lLW_R & IMEMCS- & WEO- 
then 
[0, 0] else 
[0, 1]; 


state 
[1, 0] : if RESET 
1/ 
(IBRDY- & IBLAST-) 
then 
[1, 1] else 
if 
IBRDY- & !DATASEL 
then 
[0, 0] else 
[1, 0] ; 


state 
[0, 0] : if RESET 
1/ 
(lBRDY- & IBLAST-) 
then 
[1, 1] else 
if 
IBRDY- & IDATASEL 
then 
[1, 0] else 
[0, 0] ; 


equations 


• 


title 
'DRAM CONTROLLER 
- PLD 7, INTEL CORPORATION' 
• This PLD generates 
DATASEL and WE 
• Implemented with 
Intel 85C220 EPLD 


x 
.X.; 
ABEL 
'don't care' symbol 
c 
.C.;· 
ABEL 
'clocking input' symbol 


CLK 
BRDY- 
CIP- 
MEMCS- 
LA2 
CASOO- 
CASIO- 
LW_R 
RESET 
BLAST- 
BOFF- 
HIT- 


DATASEL 
RS- 
RALE- 
WE- 
BSEL 


pin 
1; 
pin 
2; 
pin 
3; 
pin 
4; 
pin 
5; 
pin 
6; 
pin 
7; 
pin 
8; 
pin 
9; 
pin 
12; 
pin 
13; 
pin 
19; 


pin 
14; 
pin 
15; 
pin 
16; 
pin 
17; 
pin 
18; 


state 
[1, 0): 


state 
[0, 0] : 


state 
[0, 1) : 


state_diagram 
[WE-] 


state 
[1): 


state 
[0): 


·i486 CPU input CLK· 
·Burst Ready 
·Cycle OK 
·Memory Chip Select 
·Latched A2 
·CAS output BankO 
·CAS output Bank1 
·Latched CPU W/R 
·System Reset 
·CPU Burst Last output 
·CPU Backoff 
input 
·DRAM page hit signal 


·Bank select for reads 
·State variable 
·Enable Row Address Latch 
·Write Enable posted writes 
·Se1ects read or write data path 


if RESET then 
[1, 1] else 
if ICIP- & !LA2 & ILW_R & IMEMCS- then 
[0, 0) else 
if iCIP- & LA2 & lLW_R & IMEMCS- then [1, 0) else [1, 1); 


if RESET * IBOFF- * 
(IBRDY- & IBLAST-) then 
[1, 1] else 
if IBRDY- & BLAST- then 
[0, 0) else 
[1, 0]; 


if RESET * 
IBOFF- * 
(!BRDY- & IBLAST-) then 
[1, 1) else 
if IBRDY- & BLAST- then 
[1, 0] else 
[0, 0); 


goto 
[1,1]; 


if RESET then 
[1) else 
if LW_R & ICIP- & iMEMCS- then 
[0) else 
[1]; 


if RESET * 
IBOFF- then 
[1) else 
if LW_R & lCIP- & IMEMCS- then 
[0) else 
if CASOO- 
+ CAS10- then 
[1); 


if RESET then 
[0] else 
if lCIP- & HIT- & IMEMCS- then 
[1] else 
[0); 


if RESET * 
IBOFF- then 
[0) else 
if 
!HIT- then 
[0) else 
[1); 


title 
'DRAM CONTROLLER 
- PLD 8, INTEL CORPORATION' 
• This 
PLD generates 
CASl 
(CAS for Bank 
1) 


sc8 
device 
'E0320' ; 


x 
.X.; 
ABEL 
'don't 
care' 
symbol 
c 
.c. ; 
ABEL 
'clocking 
input' 
symbol 
· Inputs 


CLK 
pin 
1; 
·i486 CPU 
input CLK 
RFACK 
pin 
2; 
·Refresh Acknowledge 
CIP- 
pin 
3; 
·Cycle OK 
LA2 
pin 
4; 
·Latched A2. 
HIT- 
pin 
5; 
'DRAM Page Hit Signal 
BOFF- 
pin 
6; 
•Backoff 
input to i486' 
LW_R- 
pin 
7; 
·Latched W/RII 
RAS- 
pin 
8; 
·Row Address 
Strobe 
RESET 
pin 
9; 
·System Reset 
RDY- 
pin 
12; 
·Processor 
RDYII 
MEMCS- 
pin 
13; 
·Memory Chip 
Select 
BRDY- 
pin 
18; 
·Processor 
BRDYII 
BLAST- 
pin 
19; 
·Processor 
BLASTII 
· OUtput 


CAS10- 
pin 
14; 
·CAS1 byte 
0,2 
C1 
pin 
15; 
·state variable 
C2 
pin 
16; 
·state variable 
CAS 11- 
pin 
17; 
·CAS1 byte 
1,3 


state_diagram 
[CAS10-,CAS11-, 
Cl, C2) 


state 
[1, 1, 1, 1): if RESET 
II !BOFF- then 
[1, 1, 1, 1) else 
if 
!RFACK & ICIP- & LA2 & LW_R- & !MEMCS- then 
[1, 1, 0, 1) else 
if 
IRFACK & lCIP- & !LW_R- & !RAS- 


& IHIT- & !MEMCS- II (RFACK & RAS-) 
then 
[0, 0, 1, 1) else 
[1, 1, 1, 1); 


state 
[1, 1, 0, 1): if RESET 
II IBOFF- then 
[1, 1, 1, 1) else 
if 
IRAS- & RDY- 
then 
[0, 0, 0, 0] else 
[1, 1, 0, 1]; 


state 
[0, 0, 0, 0]: if RESET 
II IBOFF- then 
[1, 1, 1, 1] else 
if lCIP- & LA2 & LW_R- & !MEMCS- then 
[1, 1, 0, 0] else 
if CIP- 
II (ICIP- & (MEMCS- II !LW_R-)) II (ICIP- & LW_R- 
& 
lLA2) then 
[1, 1, 1, 1) else 
[0, 0, 0, 0]; 


state 
[0, 0, 1, 1]: if RESET 
II IBOFF- then 
[1, 1, 1, 1] else 
if 
IBRDY- & IBLAST- & !RFACK then 
[1, 1, 1, 1] else 
if 
IBRDY- & BLAST- & LA2 then 
[1, 1, 1, 0] else 
if 
lBRDY- & BLAST- & !LA2 then 
[0, 0, 1, 0] else 
if RFACK 
then 
[0, 0, 1, 0] else 
if BRDY- & IRFACK then 
[0, 0, 1, 1]; 


state 
[1, 1, 1, 0]: if RESET 
then 
[1, 1, 1, 1] else 
if 
!BOFF- then 
[1, 1, 1, 0] else 
if 
IBRDY- & BLAST- 
then 
[0, 0, 1, 1] else 
if 
IBRDY- & IBLAST- then 
[1, 1, 1, 1] else 
[1, 
1, 
1, 
0]; 


• 


Ap·447 
infell') 


state 
[0, 
0, 
1, 
0): 
f 
RESET * 
IBOFF- 
then 
[1, 1, 
1, 
1] else 
f 
IBRDY- & BLAST- 
then 
[1, 
1, 
1, 
0] else 
f 
IBRDY- & !BLAST- * BRDY- 
then 
[1, 1, 
1, 
1) 


([CLK,RFACK,CIP-,LA2,HIT-,BOFF-,LW_R-,RAS-,RESET,RDY-,MEMCS-, 
BRDY-, BLAST-) 
-> 
[CAS10-,C1,C2,CAS11-]l 


C 
R 
C 
L 
F 
I 
K 
A 
P 
C 
K 


L 
H 
A 
I 
2 
T 


B 
L 
R 
o 
W 
A 
FRS 
F 


R 
R 
M 
E 
D 
E 
S 
Y 
M 
E 
C 
T 
S 


B 
B 
R 
L 
D 
A 
Y 
S 
T 


title 
'DRAM CONTROLLER 
- PLD 
11, 
INTEL 
CORPORATION' 
" This 
PLD 
generates 
the mux 
enables, 
write 
enables 
and 
WIP* 
" Implemented 
with 
the 
Intel 
85C220 
EPLD 


SC11 
device 


x 
.X.; 
c 
.c. ; 


" 
Inputs 


CLK 
pin 
1; 
LA2 
pin 
2; 
CIP- 
pin 
3 ; 
MEMCS- 
pin 
4; 
RESET 
pin 
5; 
LW_R 
pin 
6; 
C01 
pin 
7 ; 
CAS01- 
pin 
8; 
C11 
pin 
9; 
CAS 11- 
pin 
19; 


" Output 


WIP- 
pin 
12; 
MENO- 
pin 
13; 
WEO- 
pin 
14; 
LWIP- 
pin 
15; 
dum 
pin 
16; 
WE1- 
pin 
17 ; 
MEN1- 
pin 
18; 


state_diagram 
(WEO-] 


ABEL 
'don't 
care' 
symbol 
" ABEL 
'clocking 
input' 
symbol 


"i486 
CPU 
input 
CLK" 
"Latched 
A2. 
"Cycle 
OK 
"Memory 
Chip 
select. 
"System 
Reset 
"latched 
CPU W/R* 
"Write 
indication 
BankO 
"CAS output 
Bank 
0 
"Write 
indication 
Bank1 
"CAS output 
Bank 
1 


"Write 
in Progress 
"Mux enable 
Bank 
0 
"Write 
enable 
Bank 
0 
"Latched 
WIP- 


state 
(1] : 
if RESET 
then 
(1] else 
if 
ICIP- & LW_R & !MEMCS- & lLA2 then 
[0]; 


state 
(0] : 
if RESET 
then 
(1] else 
if 
!C01 then 
[0] else 
if C01 
then 
[1]; 


state_diagram 
[WE1-] 


state 
(1] : 
if RESET 
then 
(1] else 
if 
ICIP- & LW_R & IMEMCS- & LA2 
then 
(0]; 


state 
(0] : 
if RESET 
then 
[1] else 
if 
tCll then 
[0] else 
if C11 
then 
(1]; 


if RESET 
then 
[1] else 
if 
IC01 * 
IC11 then 
(0] else 
[1]; 


if RESET 
then 
[1] else 
if 
ICIP- & LW_R & IMEMCS- & ILA2 then 
(0]; 


• 


f RESET 
then 
[1) else 
f 
ICOl & CASOl- 
then 
[0) else 
f 
ICIP- & LW_R & IMEMCS- & ILA2 & ICASOl- then 
[0) else 
f 
ICAS01- then 
[1); 


state 
[1): 
if 
RESET then 
[1) else 
if 
ICIP- & LW_R & IMEMCS- & LA2 then 
[0); 


state 
[0): 
if 
RESET then 
[1) else 
if 
IC11 & CAS11- 
then 
[0) else 
if 
ICIP- & LW_R & IMEMCS- & ILA2 & ICAS 11- then 
[0) else 
if 
ICASll- then 
[1); 


lWIP- = 
ILWIP- * 
ICOl * 
ICll; 


end SC_MODE_DRAM_CTRL_ll; 


title 
'DRAM 
CONTROLLER 
- PLD 
12, 
INTEL 
CORPORATION' 


• This 
PLD 
generates 
the 
WRE 
write 
enables 
" Implemented 
with 
the 
Intel 
85C220 
EPLD 


SC12 
device 
'E0320'; 


x 
.x. ; 
" 
ABEL 
'don't 
care' 
symbol 
c 
.C. ; 
" 
ABEL 
'clocking 
input' 
symbol 
· Inputs 


ncO 
pin 
1; 
"no 
connect 
WEO- 
pin 
2; 
"Write 
Enable 
Bank 
0 
WE1- 
pin 
3 ; 
"Write 
Enable 
Bank 
1 
BEO- 
pin 
4; 
"Byte 
Enable 
0 
BE1- 
pin 
5; 
·Byte 
Enable 
1 
BE2- 
pin 
6; 
·Byte 
Enable 
2 
BE3- 
pin 
7; 
•Byte 
Enable 
3 
nOc1 
pin 
8; 
. 
nc2 
pin 
9; 
· Output 


WREOO- 
pin 
12; 
WRE10- 
pin 
13; 
WRE01- 
pin 
14; 
WREll- 
pin 
15; 
WRE02- 
pin 
16; 
WRE12- 
pin 
17; 
WRE03- 
pin 
18; 
WRE13- 
pin 
19; 


equations 


!WREOO- 
!WEO- 
& 
!BEO-; 


IWRE10- 
lWE1- 
& 
IBEO-; 


!WRE01- 
!WEO- 
& 
!BEl-; 


IWREll- 
IWE1- 
& 
!BE1-; 


lWRE02- 
lWEO- 
& 
IBE2-; 


!WRE12- 
!WE1- 
& 
!BE2-; 


!WRE03- 
IWEO- 
& 
!BE3-; 


IWRE13- 
lWE1- 
& 
!BE3-; 


• 


title 
'DRAM 
CONTROLLER 
- PLD 
15, 
INTEL 
CORPORATION' 
" This 
PLD 
combines 
ready 
signals 
" Implemented 
with 
the 
Intel 
85C220 
EPLD 


MEMCS- 
JRDY- 
MRDY- 
BRDY- 
ALD 
CKEN- 
SKEN- 
BRDYO- 
M- 
CIP- 


WEN- 
RDY- 
MRDYCS- 
MALD- 
durn10 
PBRDY- 
KEN- 
DRDY- 


.x. ; 
.C. ; 


pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 


pin 
12; 
pin 
13; 
pin 
14; 
pin 
15; 
pin 
16; 
pin 
17; 
pin 
18; 
pin 
19; 


" ABEL 
'don't 
care' 
symbol 
" ABEL 
'clocking 
input' 
symbol 


"Memory 
Ready 
for 
Write 
Cycles 
"Processor 
BRDY 
for 
Read 
Cycles 
"Address 
latch 
disable 
"KEN* 
output 
of 
485Turbocache 
"KEN* 
input 
to 
485Turbocache 
"Burst 
Ready 
Out 
from 
485Turbocache 
"Miss 
Indicator 
"Cycle 
OK 


"Write 
enable 
for 
write 
latches 
"Processor 
RDY 
input 


"Processor 
BRDY* 
input 
"Processor 
KEN* 
input 


KEN- 
= 
SKEN- 
& CKEN-; 


PBRDY- 
= 
BRDY- 
& BRDYO-; 


title 
'DRAM CONTROLLER 
- PLD 
16, 
INTEL 
CORPORATION' 
" This 
PLD generates 
the 
chip 
select 
outputs. 
" Implemented 
with 
the 
Intel 
85C220 
EPLD 


x 
.X.; 
ABEL 
'don't care' 
symbol 
c 
.C.; 
ABEL 
'clocking 
input' 
symbol 


01 
02 
03 
04 
05 
BRDYO- 
CDIS 
M_IO 
D_C 


dum7 
KSEL- 
MEMCS- 
CS54- 
CS5l0- 
JMCS- 
EPCS- 
RAMCS- 


pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 
pin 


"Burst 
Ready 
Output 
of 
485Turbocache 
"Jumper 
select, 
disables 
cache 
enable 
of 
485 
"Processor 
M/IO* 
output 
"Processor 
D/C* 
output 


pin 
12; 
pin 
13; 
pin 
14; 
pin 
15; 
pin 
16; 
pin 
17; 
pin 
18; 
pin 
19; 


"Active 
to enable 
cacheing 
in 485 
"Memory 
Chip 
Select 
" 


RAMCS- 
= MEMCS- 
* 
!MEMCS-; 


!MEMCS- 
= 
(101 & M_IO & BRDYO-) * 
(02 & 104 & M_IO & BRDYO-); 


!EPCS- 
= 
(!02 * 
!03) & M-IO; 


!CS510- 
=105 & !M_IO & D_C; 


05 & !M_IO & D_C; 


(!EPCS- 
* 
!CS510- * 
!CS54-); 


• 


intel~ 


title 
'DRAM CONTROLLER 
- PLD 
17, 
INTEL 
CORPORATION' 
, This 
PLD 
generates 
AO 
for Bank 
1 
, Implemented 
with 
the 
Intel 
85C224 
EPLD 


SC17 
device 
'E224' ; 


x 
.X.; 
, ABEL 
'don't care' 
symbol 
c 
.C. ; 
, ABEL 
'clocking 
input' 
symbol 
, Inputs 


CLK 
pin 
1; 
'i486 CPU 
input 
CLK' 
BRDY- 
pin 
2; 
'Burst 
Ready 
CIP- 
pin 
3; 
'Cycle 
OK 
MEMCS- 
pin 
4; 
'Memory 
Chip 
Select 
LA313 
pin 
5; 
'Latched 
A3 and 
AD 
DATASEL 
pin 
6; 
'Bank Select 
for Reads 
RAS- 
pin 
7; 
'Row address 
strobe 
LW_R 
pin 
8; 
'CPU W/R 
latched- 
RESET 
pin 
9; 
'System 
Reset 
BLAST- 
pin 
10; 
'CPU BLAST- 
output 
A3 
pin 
11; 
'CPU A3 
line 
ALD 
pin 
14; 
'Address 
Latch 
disable 
dum1 
pin 
15; 
WE1- 
pin 
22; 
'Write 
enable 
dum2 
pin 
23; 
. Output 


B10MAO 
pin 
21; 
'Bank 
1 AO 
B1A 
pin 
20; 
CSO- 
pin 
19; 
dun 
pin 
18; 
dum 
pin 
17 ; 
B11MAO 
pin 
16; 
'Bank 
1 AO 


state_diagram 
[B1A, CSO-) 


state 
[1, 1): 
if RESET 
then 
[I, 1] else 
if CIP- & lALD & !A3 then 
[0, 1) else 
if 
ICIP- & lALD & IA3 then 
[0, 1) else 
if 
lCIP- & lLW_R & lMEMCS- & WE1- 
then 
[1, 0) else 
[1, 1); 


state 
[0, 1): 
if RESET 
then 
[I, 1) else 
if CIP- & IALD & A3 then 
[I, 1) else 
if 
ICIP- & !ALD & A3 then 
[I, 1) else 
if 
ICIP- & lLW_R & IMEMCS- & WE1- 
then 
[0, 0] else 
[0, 1); 


state 
[1, 0) : if RESET 
II 
(IBRDY- & IBLAST-) 
then 
[I, 
1) else 
if 
IBRDY- & !DATASEL 
then 
[0, 0) else 
[1, 0) ; 


state 
[0, 0) : if RESET 
II 
(IBRDY- & IBLAST-) 
then 
[I, 
1) else 
if 
lBRDY- & IDATASEL 
then 
[I, 
0] else 
[0, 0) ; 


equations 
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With the introduction of 33 MHz Inte1386TMmicro- 
processors, desk top system designers needed to ac- 
count for a previously limited phenomena, high speed 
signal effects. As outlined in the Intel Application 
Note, 33 MHz Intel386™ 
System Design Considera- 


tions [1], as edge rates become a more significant por- 
tion of the clock cycle transmission line effects take 
hold. When extending system designs to 50 MHz the 
effects intensify themselves. What were once minor re- 
flections become large spikes. Driver characteristics, 
signal routing, load characteristics take on greater sig- 
nificance and electrical simulations (such as SPICE) be- 
comes essential. One of the most critical signals is the 
clock. Not only is signal quality a key issue, but the 
skew between different instantiations of the clock can 
cause serious system problems. In this application note, 
the key issues in the design of a 50 MHz clock for an 
Intel486 system will be discussed. These will be illus- 
trated with the clock design for a CPU cache subsystem 
using the 82495DX cache controller 
and 82490DX 


Cache SRAM in an Intel486TM DX CPU-Cache mod- 
ule. 


The Intel486 DX CPU-Cache chip set forms the core of 
a CPU-second level cache module. Along with a memo- 
ry bus controller, this chip set provides a CPU-like in- 
terface for many types of memory buses. The Intel486 
DX CPU-Cache chip set is described in the following 
Intel literature [2, 31. 


Previous designs have viewed signal traces as simple 
lumped components 
connected 
by direct 
wiring as 


shown in Figure I-I. However, we know that at very 
high frequencies, long traces behave like transmission 
lines as shown in Figure 1-2. When does the simple 
lumped load model break down? Simply put, you must 
consider transmission line effects when the signal prop- 
agation time becomes comparable to the rise or fall 
times of the signal. This can be restated with simple 
rules of thumb either in the frequency domain as: 


signal transit time > 1/8th the signal rise or fall time 


Thus, the designer must know the signal rise and fall • 
times, the characteristic impedance of the interconnect, 
and the source and load impedances. Several excellent 
treatments of these subjects exist in the open literature 
[refsl so it will not be repeated here. For signals with 
I ns-3 ns rise and fall times (10% to 90%) the maxi- 
mum length of interconnect before transmission line ef- 
fects are significant range from about 2.1 inches to 9.1 
inches for 50n traces. 


Driving 
Receiving 
Component 
Component 


,"---[>--b_1 


Figure 1·1. The simple representation 
for components 
follows 
the lumped load approach. 


Connections 
between 
components 
are simply modeled 
as a wire. 
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Figure 1-2. With faster edge rates, the interconnection 
between 
components 
must now be modeled 
as a transmission 
line. 


2.0 
CLOCK DISTRIBUTION IN 
SYSTEMS 


The primary goals of a system clock distribution is to 
deliver a clock signal to each component's input pins 
which meets the system's requirements for: component 
to component skew; acceptable waveform shape (rise 
time, fall time, overshoot, undershoot, high and low 
times, duty cycle); and stability (cycle to cycle). With 
the complexities involved in distributing a system clock 
at these high frequencies, system clock design plays a 
major role in the overall system architecture. 


With the inherent design complexities of 50 MHz, de- 
signers must trade-off the exploitation of 50 MHz com- 
ponents and subsystems with the benefits they bring to 
the system performance. For example, the system de- 
signer may decide not to use a 50 MHz memory bus 
with the 50 MHz Intel486 CPU-Cache subsystem. This 
decision would be based on the trade-off of the addi- 
tional components which would need to run at 50 MHz 
against the system throughput benefits of greater mem- 
ory bus bandwidth. 


Another 
trade-off 
decision 
might. be to 
keep the 
50 MHz confined to as few components as possible, in a 
relatively small physical area, in order to reduce the 
high speed design requirements. Since the operation of 
the Intel486 microprocessor and the speed of its exter- 
nal interface are tied together, the designer must look 
for other areas to produce a hierarchy of component 
speeds. 


An important benefit of a look through second level 
cache has always been a level of isolation between the 
CPU and memory buses. However, in the past, this 
benefit has mainly been realized as reduced transac- 
tions based on cache hits. Now, this second level cache 


provides a frequency 
isolation 
function in addition to 
transaction isolation. Figure 2-1 below shows a possible 
system implementation with a hierarchy of buses rang- 
ing from the high frequency, 50 MHz, CPU bus down 
to the lower speed, 8.33 MHz EISA bus. This type of 
design keeps the critical 50 MHz signals easily confined 
to a small physical area-paramount 
to reducing trans- 


mission line effects. 


8.33 MHz EISA Bus 
33 MBytes/sec 


Figure 2·1. System 
Design with 
Bus Frequency 
Hierarchy 


Previous clock designs, even with multiple frequency 
components, have largely been implemented with fully 
synchronous circuitry. While syn~hronous designs typi- 
cally have the highest performance and simplify the 
logical design of the system, at 50 MHz they will test 
the designer's ability to route and generate acceptable 
quality clocks. Figures 2-2 through 2-4 show three pos- 
sible methods of implementing clocking schemes for the 
triple frequency system design shown in Figure 2-1. 


intelQP 


Figure 2-2 shows the traditional method of clock gener- 
ation. A 50 MHz oscillator (a 100 MHz differential 
PECL oscillator may be used with some clock driver 
chips) feeds a clock generation circuit to produce a 
number of 50 MHz clock signals. These are routed to 
the 50 MHz components (Inte1486 DX CPU-Cache 
subsystem and memory bus controller) and the 25 MHz 
clock generation 
circuitry. 
Here multiple 
25 MHz 
clocks are generated synchronously (but delayed) from 
the master 50 MHz clock. Some of the newer clock 
generation 
chips also contain 
subharmonic 
outputs 
which could also produce the 25 MHz clocks. These 
25 MHz clocks are routed to the 25 MHz components 
(the memory module and bus interface unit) and the 
8.33 MHz clock generation circuits. In this scheme, the 
8.33 MHz 
clock will also be synchronous 
to the 
25 MHz and 50 MHz clocks, but delayed from both. 
Note, however, that the delay becomes less critical with 
the lower frequency signals. 
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Figure 2-2 also shows a variant of this scheme where 
the oscillator output is routed to additional clock gener- 
ation circuits rather than routing the 50 MHz clocks. 
This may have an advantage depending on the oscilla- 
tor design and the logic levels used to route the oscilla- 
tor output. 


A second method of clock distribution relies on the 
generation of a 25 MHz master signal and its routing to 
25 MHz components and the local generation of asyn- 
chronous 50 MHz clocks as shown in Figure 2-3. The 
advantage of this scheme is the simplified routing of 
high speed signals if multiple 50 MHz clocks are re- 
quired (for example for a second CPU module in a mul- 
ti-processing system). The disadvantages are increased 
cost due to multiple oscillators and the reduced per- 
formance of a CPU/memory 
bus that is not fully syn- 
chronous. 


CLOCK 


GENERATION 


CIRCUITS 
+ 2 


CLOCK 


GENERATION 


CIRCUITS 
+ 3 


CLOCK 


GENERATION 


CIRCUITS 


CLOCK 


GENERATION 


CIRCUITS 
+ 3 
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A new type of clock driver circuit provides a third 
method of clock generation. This new class of clock 
chip has an internal phase lock loop circuit which syn- 
chronizes the output clock to the input clock. Also, 
since the chip's internal clock is running much faster 
than the input or output clock, multiple (usually 2x) 
frequency components are available. Figure 2-4 shows 
an example of this clock generation method. Here, a 
25 MHz oscillator is used with a clock generation cir- 
cuit to produce the 25 MHz clock for the memory and 
I/O bus modules. This 25 MHz clock is locally stepped 
up to 50 MHz using this new type of clock chip pro- 


CLOCK 
GENERATION 
CIRCUITS 


viding a local, synchronous, 50 MHz clock which has a 
minimal, controlled skew referenced to the 25 MHz 
clock. There are several advantages to this scheme: first 
a lower frequency crystal is required; second, if multi- 
ple 50 MHz modules are required, such as a second 
CPU module for a multi-processing system, a central 
50 MHz signal need not be routed and deskewed across 
a large physical distance. A drawback to this scheme is 
it's relatively new and there are a limited number of 
clock chip vendors. Also, as with other clock genera- 
tion methods, it is important 
to verify the 50 MHz 


clock meets the stability specification of the compo- 
nents. 


CLOCK 
GENERATION 
CIRCUITS 
x 2 


CLOCK 
GENERATION 
CIRCUITS 
x 2 


CLOCK 
GENERATION 
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With high frequency systems, controlling component to 
component skew will be one of the greatest challenges 
of system clock designs. There are numerous sources of 
clock skew as outlined below: 
1) Intrinsic 
Skew-Clock 
skew generated from the 
clock chi~ 
A) Rising edge delay to falling edge delay skew on 
the same output (often called pin skew-tps)' 
B) Skew between the delay from one output to an- 
other output on the same chip (often called out- 
put skew-toJ. 
C) Output to output skew between different chips. 
2) Extrinsic skew-Clock 
skew generated from rout- 
ing/loading- 
A) Skew generated by different rise or fall times. 
B) Skew from different tap points. 
C) Skew from flight time differences as a result of 
trace length or loading differences. 


D) Clock skew generated from poor signal wave- 
form integrity. 


Reference 4 covers the different components of skew 
generated by the inherent limitations within the chip 
design. These skews can only be changed by choosing 
different clock chips. The second class of skew, extrin- 
sic skew, are under the control of the system designer. 


Figure 2-5 illustrates some of these skew components. 
Signals A and B have the same loading, however, the 
resulting waveforms at the destination have a skew. 
This is due to the intrinsic buffer skew, tos. Signal C has 
twice the capacitance of signal A and the extra delay 
and signal rise time result in an even greater skew be- 
tween these two signals. Finally, signal D must pass 
through a transmission line and the resultingjlight time 
delay causes an even greater skew between itself and 
signal A. 
• 


Designing a clock distribution requires the careful ex- 
amination of a series of issues: 
I) Choosing a clock driver. 
2) Determining the layout and loading of the clock sig- 
nal on the board. 
3) Choosing the type of termination. 
4) Simulating the expected performance of the clock 
signal on an electrical simulator (such as SPICE). 


This application note will work through each of these 
steps using the design of the CPU subsystem shown in 
Figure 3-1 below. Here, a 50 MHz Intel486 DX CPU- 
Cache module is interfaced to a 25 MHz memory bus 
with a memory bus controller and a clock generation 
and distribution chip. The example will assume the In- 
tel486 DX CPU-Cache subsystem is running with the 
25 MHz memory bus clock synchronous to the 50 MHz 


CPU core clock. The 82495DX and 82490DX support 
this directly (divided synchronous mode). 


The 82495DX and 82490DX components provide a 
highly efficient 128K Byte, 256K Byte or 512K Byte 
second level cache. One 82495DX cache controller and 
8 (or 9 for parity) 82490DXs implement a 256K Byte 
cache core, requiring only a memory bus controller to 
interface to a high speed system memory bus. The In- 
tel486 DX CPU-Cache module is a multi-package mod- 
ule (MPM) which implements a 256K Byte cache with 
parity along with an Intel486 DX microprocessor. This 
board can interface to the rest of the system through a 
special 240-pin impedance matched connector. 


The Intel486 DX CPU-Cache chip set, operating at 
50 MHz, has the clock requirements shown below in 
Figure 3-2. Skew between the 82495DX and Intel486 
CPU can not exceed 0.5 ns (measured at 1.5V), while 
skew between the 82490DX and either the 82495DX or 
Intel 486DX CPU may not exceed 1.0 ns. 
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Figure 
3-2. Clock 
requirement 
for the 50 MHz Intel486 
OX CPU-Cache 
chip set and module. 
The 7 ns and 20 ns valid 
and period 
times 
are 
minimum 
values 
and the 2 ns rise and fall times 
are maximum 
values. 
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Figure 3-3 shows a conceptual model of the clock dis- 
tribution to the Intel486 DX CPU-Cache subsystem. A 
number of elements are outlined: a clock driver chip; a 
series of clock traces on the motherboard which lead to 
the module's connector; the impedance matched con- 
nector itself; the traces on the module; and, finally the 


CLOCK 
DRIVER 
AMP 
CONNECTOR 
BOARD 
TRACES 


components themselves. For a first order investigation, 
the module traces shown actually reflect the branching 
and loading of the traces implemented on the module. 
For the module, the maximum clock skew is specified 
at the connector inputs. 


MODULE 
TRACES 
• 
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put clock is synchronized to the input clock resulting in 
very little input to output delay. 
There are two basic types of chips available: buffered 
master 
clocks and phase locked loop synchronized 
clocks. Figure 3-4 shows the differences between these 
two types of components. With the buffer version, an 
incoming clock signal, either from an oscillator or a 
master clock, is buffered and driven back out on multi- 
ple outputs. Expect to see a significant delay between 
input and output signals. With a PLL circuit, the out- 


Table 3-1 shows some of the driver chips which may be 
used for clock generation. Clearly, there is no standard 
in any parameter, type, package, voltage levels, number 
of outputs, etc., making the choices very difficult. A 
good strategy may be to complete both a primary and 
alternate design for safety. Each parameter needs to be 
examined for implications on your design. 


Voltage 
Controlled 
O.cillalor 


_. 
€: 
Manufacturer(1) 
Device 
Technology 
Type 
Package 
Max 
tos 
Rise/ 
Max 
Level 
Clock 
Output 
Fall 
Delay 
In/Out 
Out@2 


Texas 
74ACT 
CMOS 
Buffer 
20-Pin DIP 
NA(3) 
1 ns 
NA 
NA 
TIll 
CMOS 
4@ 1xand 
Instruments 
11208 
LCC/Plastic 
4@ 1x 


or Ceramic 


IDT 
49FCT805A 
CMOS 
Buffer 
20-Pin DIP, 
NA 
0.7 ns 
NA 
5.8 ns 
TIll 
CMOS 
5@1xand 
49FCT806A 
SOIC, 
5@ 1x 


CERPACK, 
LCC 


Silicon 
SC3501 
BiCMOS 
Buffer 
52-Lead 
80 MHz 
0.5 ns(4) 
2 nsl 
NA 
PECLor 
10 @ 1x 
Connection 
PGFP 
2 ns 
TIL/TIL 
5@ 1xorO.5x 
5 @ 0.5 or 0.25x 


Motorola(5) 
MC88915 
NA 
PLL 
28-Pin PLCC 
80 MHz 
0.5 ns 
2.5(6) 
0.5 ns 
TIL/CMOS 
5@ 1xand 
@ 
1@2xand 


50pF 
1 @ 0.5x 


Gazelle 
GA 1110 
GaAs 
PLL 
16-Pin DIP 
50 MHz 
0.5 ns 
1.5 
0.5 ns 
TILITIL 
6@ 1x 


PLCC Due 


Gazelle 
GA 1210 
GaAs 
PLL 
16-Pin DIP 
100 MHz 
0.5 ns 
1.5 
0.5 ns 
TILITIL 
2 @ 1x and 
PLCC Due 
4@2x 


National Semi. 
CGS74 
FACT 
Buffer 
14-Pin PDIP 
70 MHz 
0.7 ns 
2 
12.4 ns 
TIL/TIL 
8 @ 1x 
CT2525 
SOIC 
or CMOS 


NOTES: 
1. Check with manufacturers 
for the most accurate 
information. 


2. Number 
or clock 
outputs 
specified 
as a fraction 
of the input clock frequency. 
For example 
5 @ 0.5x indicates 
there 
are 5 copies 
available 
at % the input frequency. 


3. NA indicates 
the data is not available 
on the data sheet. 


4. 0.25 ns skew within 
a group of 5 outputs. 


5. This device 
requires 
1 external 
resistor 
(1.0 KIl) and 2 external 
capacitors 
(0.01 I£F and 5.0 nF). 
6. Rise and fall times 
are 0.2 Vcc 
to 0.8 Vcc. 
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Multiple 50 MHz 
Outputs for 
Intelo486 OX 
CPU-Cache Module 
and its MBC 


Packaging is important because of the board area avail- 
able and the need (or inability) to use surface mount. 
Maximum skew and rise/fall times are important in 
meeting the Intel486 DX CPU-Cache chip set specifi- 
cation. Also, a skew of 0.5 or 0.7 ns uses most of the 
skew budget at the clock driver and leaves little margin 
for routing generated skew. 


When considering the rise and fall time specification 
(which many manufacturers do not specify in their pre- 
liminary data sheets) it is important to understand the 
effect of additional loads. Only the Motorola 88915 
specified the rise and fall times at a given capacitance. 


Delay will be an important consideration for the syn- 
chronization required. The memory bus controller may 
be sensitive to skew between the memory bus clock and 
the CPU core clock. 


Finally, the number of outputs available, their drive 
capability and the relative frequency is critical. Insuffi- 
cient numbers of copies will require multiple chips, in- 
troducing synchronization and chip to chip skew issues. 
It is critical to determine the maximum load any single 
output can drive while maintaining the clock require- 
ments shown in Figure 3-2. 


A technique for driving additional load without suffer- 
ing the skew resulting from using another clock driver 
output is to connect together multiple clock outputs. 


Note, however, this technique may not work with cer- 
tain clock driver chips. Consult the data sheet or the 
clock manufacturer before using this technique. 


For the design example an input 25 MHz clock must be 
stepped up to 50 MHz and distributed to the module 
and the memory bus controller. The Intel486 DX CPU- 
Cache module requires four separate clock inputs. The 
memory bus controller requires at least one input and 
preferably more. One possible design is shown in Figure 
3-5. Here a PLL clock chip is used as a clock doubler to 
generate a synchronous .50 MHz clock. This 50 MHz 
signal is then buffered with another clock driver chip to 
produce multiple 50 MHz clocks. 


The most critical aspects of signal routing are maintain- 
ing equal route lengths and loads for equal flight times. 
Figure 3-6 shows several methods of routing clock sig- 
nals. These include distributing the loads along clock 
route at various tap points, building a star configura- 
tion and designing a series of branches. The tapped lay- 
out, while often minimizing routing, suffers from un- 
equal flight times of the clock to the various tap points. 
Thus, it would be difficult to control skew between the 
first and last loads. With the star or branched layouts, 
the flight distance of each leg can be carefully matched. 
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Our analysis and experiments show that right angles in 
traces, vias and other similar discontinuities produce 
rather minimal effects. They should be avoided if possi- 
ble, but 
are not 
disastrous. 
At 
frequencies 
above 
50 MHz, however, these layout parameters will become 
much more crucial. 


This leaves the routing up to the connector and the 
routing to the MBC as the critical components remain- 
ing. The Intel486 DX CPU-Cache module clocks are 
designed such that equal traces to the connector will 
result in near zero flight time skews. Thus, routing to 
the MBC will be the most challenging aspect. Applying 
the same ideas discussed here will result in minimal 
flight time skews. 
Figure 3-7 shows the routing of the clocks on the In- 
tel486 DX CPU-Cache module. The layout is carefully 
designed to eliminate any flight time skew. 
• 
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Termination of the clock signals is a critical design pa- 
rameter. The module's traces and connector are well 
matched at son. If the impedance of the board traces 
from the clock chip to the connector are controlled to 
son, the remaining variable will be the output imped- 
ance of the clock driver chip. Unfortunately, determin- 
ing this may not be a trivial procedure. The first step 
would be to contact the clock manufacturer and deter- 
mine if they supply the clock driver's output impedanc- 
es or I/O models of the driver. If not, Appendix A 
outlines three methods of estimating the driver's output 
impedance. 


There are two basic problems with matching a driver's 
output impedance. First, the driver may have a differ- 
ent pull-up to pull-down output impedance. This drasti- 
cally complicates termination. Second, output imped- 
ances are typically not specified on the clock chip data 
sheets. Thus they are not guaranteed by the manufac- 
turer's testing programs. 


If the driver's output impedance matches the imped- 
ance of the clock traces, no termination is required. 
However, this is not likely to be the case. If termination 
is required, series termination will probably be the best 
choice. Reference 5 outlines the various approaches and 
concludes that series termination 
is the best design 
choice for star configuarations. 


Ideally, series termination into a son transmission line 
with a star configuration should be son minus the driv- 
er's impedance. In practice a combined impedance of 
35n-50n 
may yield acceptable results. The driver it- 
self can range from 7n-35n. 
After determining the 
output impedance of the clock chip being used, the ap- 
propriate series termination can be chosen. For exam- 
ple, if the clock chip being used has a IOn output im- 
pedance, then a 25n-40n 
series resistor may be need- 
ed. If the clock chip's output impedance is 35n, then 
series termination may not be necessary. Note that if 
your clock routing is based on a taped configuration, 
then series termination may not be adequate. 


The initial design choices will need simulation with a 
circuit simulator such as SPICE for final verification. 
There are a number of issues which need to be ad- 
dressed: 
I. Methods to model the transmission lines. 
2. Generation of a netlist from layout database. 
3. Modeling the clock driver. 
4. Modeling the connector if the module is being used. 
5. Input 
models 
of 
the 
Intel486, 
824950X 
and 
824900X 
components, clock inputs or the input 
model of the module's clock pins. 


The transmission line can be modeled in one of two 
ways, depending on the simulator chosen. First, if the 
simulator has a transmission line model built into it, 
this will greatly simplify the task. Care should be taken 
to verify the accuracy of this model by contacting the 
simulator's vendor. If the transmission line model is 
available, the line segment can be simply modeled as a 
separate component 
with parameters 
of length and 
characteristic impedance as show below 


where first_node 
and second_node 
are the netlist 
names of the segments connections, 1.2 in is the length 
of the segment in inches and son is the characteristic 
impedance. If the simulator does not have a built in 
transmission line model, discrete inductors and capaci- 
• 
tors can be used. The values of the inductors and capac- 
itors will vary with the board's 
characteristics 
and 
should be supplied by the board manufacturer. Intel's 
analyses have shown that at least one inductor and ca- 
pacitor pair per '/3 inch of PC trace yields sufficient 
accuracy. The example below shows a PC trace mod- 
eled by a capacitor and inductor. 


11first-Jlode 
seconLnode 
25 nh 
Cl seconLnode 
vss 
1.0 pF 


The netlist can be automatically generated by most PC 
board design tools. In addition, most tools will supply a 
switch which allows PC traces to be broken into trans- 
mission line or L-C segments. If designing with the In- 
tel486 OX CPU-Cache module, reference 6 will give a 
detailed model of the module's clock traces. 


When attempting to model the clock driver, first con- 
tact the manufacturer for I/O models. If these are not 
available, a simple model must be constructed. First, 
use the method outlined in Appendix A to estimate the 
output impedance of the driver. This can be used in the 
simple model shown in Figure 3-8. The last element of 
this simple model is the voltage source. The rise time of 
this element is equivalent to the rise time measured on 
an unloaded clock driver chip. Also, note that for many 
drivers, the output resistance and driver dv/ dt may dif- 
fer between the rising and falling transitions. 


If the Intel486 OX CPU-Cache module is being used, 
the connector must also be accurately modeled. AMP 
provides a SPICE model for al1module customers (Ref- 


erence 7). This model emulates the impedance match- 
ing characteristics of the connector and faithfully re- 
produces any mutual inductance effects. 


Finally, the input pins of the Intel486 CPU, 8249SDX 
and 82490DX components must be modeled. The Intel 
reference 8 accurately specify models for these pins. 


With all of the pieces now modeled, the task turns to 
simulating the network and evaluating waveforms for 
skew and quality. In addition to meeting the clock spec- 
ifications shown in Figure 3-2, ringback and settling 
time must be evaluated for spurious levels which would 
violate the clock input limits. 


An important consideration when modeling the input 
circuits of typical Intel components is the inclusion of 
diodes to the supplies. Most CMOS components includ- 


-1.00000 


0.0000 
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ing 
the 
Intel486 
microprocessor, 
8249SDX 
and 
82490DX contain these diodes to prevent damage to 
the input buffers due to electrostatic discharge. When 
first simulating the clock circuits it is advisable to not 
include these input diodes. The reason is their clamping 
action could hide damaging amounts of energy being 
driven into the input pins of the components. Final sim- 
ulations could include these diodes in order to more 
closely model the actual waveform shapes. 


Figure 3-9 shows the result of a simulation of the clock 
signals 
delivered 
to 
the 
Inte1486, 
8249SDX 
and 
82490DX components on an Intel486 DX CPU-Cache 
module. The simulation is for a hypothetical clock driv- 
er modeled as shown in Figure 3-8 and unloaded wave- 
form as shown for UCLOCKE in Figure 3-9. This sim- 
ulation shows the clock signals meeting all of the input 
clock specifications outlined in Figure 3-2. 


Figure 3·9. SPICE Simulation 
of Clock Waveforms 
at the Inputs 
to the Inte1486™, 
82495DX 
and 82490DX 
Components 


This application note has discussed techniques for the 
design of clocks in high speed, 50 MHz Intel486 sys- 
tems. At these frequencies, edge rates and trace lengths 
cause routes to behave as transmission lines with their 
associated 
characteristics. 
For 
the system designer, 


clock routing issues become important enough to affect 
the overall system design. 


A second level cache such as implemented 
by the 
82495DX and 82490DX components provides the po- 
tential for a frequency isolation function by decoupling 
the memory bus speed and the CPU bus speed. Thus, 
hierarchies of buses can be produced, isolating the diffi- 
cult 50 MHz signals. 


A number of clock generation and distribution tech- 
niques were discussed including a traditional centrally 
generated master clock, local asynchronous clocks and 
the new technique of locally regenerated synchronous 
clocks. This latter techniques is made possible by a new 
class of clock chip which have internal phase locked 
loops. 


The sources of clock skew were outlined and broken 
into two major categories: intrinsic skew, which are at- 
tributable to the clock generation chip; and extrinsic 
skew, which is caused by the routing and loading of the 
clock lines themselves. 


To address intrinsic skew, various clock drivers were 
compared and contrasted. There is no clear choice, as 
different clock chips fit various system configurations. 


To address extrinsic skew, clock signal layout and ter- 
mination were discussed. Either a star or tree clock 
layout with balanced loads is recommended. 
Series 
source termination is also recommended. The value of 
the termination resistor will depend on the characteris- 
tic impedance of the PCB traces and the output imped- 
ance of the clock driver. 


Finally, electrical simulation with a tool such as SPICE 
becomes essential at these speeds. Methods to model 
transmission lines, generate a netlist, model the clock 
driver and module connector and inputs to the Intel486 
DX, 82495DX and 82490DX components were dis- 
cussed. 
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APPENDIX 
A 
METHODS TO MEASURE THE OUTPUT 
IMPEDANCE 
OF A CLOCK DRIVER 


In this method, the driver's impedance is determined by 
measuring the output I-V. Figure A-I illustrates this 
method for determining the pull-up impedance. The 
50n load line is drawn and the intercept with the out- 
put driver characteristic is also the intercept of a second 
load line drawn through the initial voltage of the driver. 


Figure A·1. Extraction 
of Output 
Driver Pull-Up 
Impedance 
with the I-V Method 


The second method of estimating the clock driver's out· 
put impedance is diagramed in Figures A-2 and A-3. 
The 10% to 90% rise time is measured for a range of 
load capacitances. Plotting the resulting data will yield 
a line whose slope is proportional to the driver's output 
impedance. A derivation of the slope (and the constant 
k) is shown below: 


where to = the unloaded rise time (the y intercept in 
Figure A-3) 


dt, = kR 
dC 


let 
T = RC 


. 
VI!) = VI (1 - e-tlT) 


0.1 = (1 
- e-110%IT) 
then t10% = 0.105T 


0.9 = (1 
- e-190%IT) 
then t90% = 2.303T 


t90% - 
t1O% = (2.303T 
- 
0.105T) 
= 2.198 
RC 


Figure A·2. Test Circuit and Response 
for the Derating 
Curve Method 
of Extracting 
the Output 
Driver's 
Impedance 


Figure A-3. The derating 
curve of an output 
driver. The slope Is proportional 
to R. 


• 


Another method uses the clock chip driving an open 
transmission line of known impedance as shown in Fig- 
ure A-4. The voltage will initially rise to the value Vx at 
point A as shown in Figure A-5. This is the result of the 
voltage divider across the driver and transmission line 


Once the wave reaches the endpoint of the transmission 
line it reflects back to the source. The wave will contin- 
ue back and forth several times until the voltage at 
node A eventually rises to its final value. Vp. With 
Vx,vp and ZL known Rs can be determined. 


Figure A-4. Test Circuit Used to Extract 
the Output 
Impedance 
with the 
Open Transmission 
Line Method 


Voltage 
at point 
A 


Figure A·5. Output at Node A of a Driver 
Into an Open Transmission 
Line 
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1.0 OVERVIEW 
OF APPLICATIONS 
NOTE 


The design flexibility of Intel's high-performance sec- 
ond-level cache solution (82495DX/82490DX) allows a 
wide spectrum of implementations to match a diversity 
of system requirements. These implementations range 
from a simple write-through design suitable for single 
processor systems through to a comprehensive write- 
back design optimized for multiple processors. This 
Applications Note focuses upon the design process re- 
quired to build the memory bus controller (MBC) that 
interfaces the second-level cache to the system's memo- 
ry bus. 


The example chosen as a vehicle to demonstrate this 
process is an MBC for a uni-processor Intel486 DX 
system. 
A design is presented 
that 
combines 
the 
50MHz Intel486 DX CPU and 256KB-second level 
cache module with an MBC that interfaces to a 25MHz 
Intel486 DX-like memory bus. The resulting design can 
plug into the Intel486 DX socket of an existing system. 
Schematics and EPLD codes are included. 


2.0 
INTRODUCTION 
TO A SECOND 
LEVEL CACHE SOLUTION 


Figure 2-1 shows a block diagram of an Intel486 DX 
microprocessor tightly coupled to a second level cache 
utilizing the 82495DX cache controller 
and several 
82490DX dual-ported intelligent cache SRAMs. This 
cache subsystem was designed as a set of components 


and they interface gluelessly to the Intel486 DX micro- 
processor 
to provide 
zero wait state 
operation 
at 
50MHz. They take full advantage of on-chip silicon 
flexibility to provide a degree of capability and per- 
formance otherwise unachievable with discrete imple- 
mentations. The interface to the memory bus was de- 
signed with flexibility in mind and features may be se- 
lectively implemented via the MBC design to provide a 
wide range of price/performance 
implementations. 


2.1 
Main Components 
of the Cache 
Subsystem 


The 82495DX cache controller forms the heart of the 
cache subsystem. The 82495DX includes the tag array 
with line state information to provide hit or miss deci- • 
sions. It handles the CPU bus requests completely and 
coordinates with the memory bus controller when an 
access needs the memory bus; CPU accesses which can 
be serviced locally by the cache subsystem are filtered 
out from the memory bus traffic. The 82495DX moni- 
tors the memory bus and performs snoop operations 
when other bus masters are using the memory bus. 


The 82490DX dual-ported 
intelligent cache SRAMs 


implement the data storage and data paths. They in- 
clude latches, mllltiplexors and other logic which allow 
operation in lock-step with the 82495DX to efficiently 
serve both hit and miss accesses. They support zero 
wait state hit accesses and permit concurrent CPU and 
memory bus operations. 


The Memory Bus Controller (MBC) is a user-definable 
array of logic that interfaces the CPU-Cache subsystem 
to a user-definable memory bus. The MBC adapts the 
CPU-Cache subsystem's generic protocol to a specific 
memory bus protocol. During the definition of this so- 
lution it was decided NOT to include a specific inter- 
face to a specific memory bus into the component im- 
plementation since such integration suffers from inflexi- 
bility and bandwidth limitations. The characteristics of 
the memory bus is pivotal to the performance and cost 
targets of a system solution, so if the memory bus were 
"hardwired" 
into the cache controller, it might be too 
costly for small systems and too slow for larger sys- 
tems. With the memory bus interface implemented sep- 
arately, it can be a complex ASIC for a high-bandwidth 
multi-processor system, or a few EPLDs for a basic 
implementation. Moreover, this key design point may 
be made into a product differentiator in the market- 
place. The focus of this Applications Note is a step-by- 
step design of a 'typical' MBC - this Note is the first in 
a series of MBC implementation descriptions and cov- 
ers the process of designing a Memory Bus Controller. 


3.0 
MEMORY 
BUS CONTROLLER 
DESIGN 
ALTERNATIVES 


This section focuses upon the architectural model for 
an MBC and introduces some of the design flexibility of 
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Intel's solution. The implementation 
alternatives are 


discussed and a framework for decision-making is built. 


The architectural model for the MBC is shown in Fig- 
ure 3-1. 


Microprocessors are quickly achieving operating fre- 
quencies which can be very difficult for a memory sub- 
system to meet. A key part of the MBC model therefore 
includes the ability for the memory bus to operate asyn- 
chronously or at a different frequency from the CPU's 
clock frequency. This decoupling of the memory bus 
frequency from the CPU frequency allows a faster mi- 
croprocessor to be included with no impact upon the 
memory bus design; the MBC must, of course, compre- 
hend this higher CPU frequency. 


Most of the MBC operates at the CPU's clock frequen- 
cy, CLK. While it is possible to design the memory bus 
to operate at the same frequency as the 50MHz CPU 
clock, such a fast design is difficult to implement. The 
MBC architecture allows these high frequencies to be 
contained 
within 
the 
CPU-Cache-MBC 
core. 
The 


memory bus can operate in clocked or strobed modes. 
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In clocked mode the memory bus clock, MCLK, may 
be synchronous or asynchronous to CLK. The MBC 
defines synchronizers for the few signals which cross 
the two timing domains - these are not required in a 
synchronous design where MCLK is equal to or is a 
sub-multiple of CLK. 


An MBC is implemented in a few basic blocks as shown 
in Figure 3-2. The 82495DX cache controller requests 
control of the memory bus by signalling the MBC. The 
MBC is responsible for arbitrating and granting the bus 
to the 82495DX. Once granted, the MBC is responsible 
for executing the requested cycle, getting a snoop result 
from the other caches, and ending the cycle. The 
82495DX supports different modes of snooping, differ- 
ent modes of memory bus operation, and various spe- 
cial cycles. The MBC design dictates which of these 
features are used, and exactly how they are used. 


An MBC consists of a few basic blocks: a cycle control 
block (with synchronizers if necessary), a data path 
control block, a snoop logic block and a test block. The 
cycle control block must interface to some arbitration 
logic in order to take control of the memory bus and 
the snoop block must be able to communicate with the 
other caches when snooping is necessary. 


Cycle control logic is responsible for initiating and ter- 
minating a memory bus cycle. Cycle control logic also 


provides cycle attributes such as cacheability, whether 
it is allocatable, pipelining, and all aspects of the prog- 
ress of the current cycle. 


The MBC supports all types of memory bus interfaces; 
the memory bus interface can be clocked or strobed. 
Table 3-1 details the engineering trade-offs for these 
types of memory buses. 


Since cycle control logic interfaces memory bus signals 
to the 82495DX, and since the memory bus is not nec- 
essarily synchronous to the 82495DX CLK, it must 
also provide proper synchronization. Careful design of 
this synchronization 
logic can minimize or eliminate 
synchronization penalties. 


The data path control logic determines how data is 
written to, or read from, the 82490DX's. It handles the 
actual transferring of data to/from 
the memory data 
bus, including the CPU burst order and the control of 
data during allocation cycles. In systems with memory 
buses that are wider than the CPU bus, the data path 
control logic controls the different data requirements of 
the CPU and the 82490DXs. 


The memory data bus can be 32-bits (the same as the 
Intel486 DX CPU), 64-bits or 128-bits wide; this will 
require four, eight or sixteen 82490DXs to implement 
the cache memory. This data cache memory may be 
parity protected; an 82490DX may be configured as a 
parity device and one can be used for up to each eight 
data cache devices. 


,. 
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Advantages 
Disadvantages 


Clocked 
Design techniques 
for clocked 
systems are 
Must round-up delays to MClK 
period quanta. ie 
well known. 
30nsec means two 25nsec 
MClKs. 


Fast Arbitration 
using MClK 
state machines. 
Some cache-to-cache 
signals must be 
synchronized 
twice, at sender & receiver. 


Burst transfers 
proceed 
at MClK 
rate. 
Memory bus length is limited. 


MClK 
skew must be controlled. 


Strobed 
Delays determined 
by device speed and 
System design may require delay lines or non- 
physics, 
not by MClK 
quanta. 
conventional 
design techniques. 


Each signal only passes through 
synchronizer 
Arbitration 
slow because 
signals must be 
once, at the receiver. 
synchronized 
at the arbitor. 


Fewer limits on bus length. 
Burst throughput 
is slowed if each transfer 
requires acknowledgement 
from receiver. 


Fewer clock skew worries. 


There are two functions within the snoop control block. 
When the MBC is master on the memory bus it must 
snoop all other cache controllers - in this mode it is a 
snooper (similar to an employer, i.e. in control); when 
another master has control of the memory bus the 
snoop control block must respond to a snoop request - 
in this mode it is a snoopee (similar to an employee, i.e. 
being controlled). 


As a snooper the MBC snoop control logic must recog- 
nize the responses from the memory bus and alter the 
current cycle appropriately. This could mean aborting 
the cycle, delaying the cycle until a write-back is per- 
formed, or changing the master's tag state according to 
the returned snoop information. 


As snoopee the MBC snoop control logic must initiate 
a snoop into the 82495DX and provide a response to 
the rest of the system (the snoop logic must also recog- 
nize when its 82495DX is not capable of snooping and 
delay its snoop initiation). The MBC may initiate this 
snoop request in one of three modes: synchronous, 
clocked, 
and 
strobed. 
The 
snoop 
response of the 
82495DX is always synchronous to the CPU CLK. 


When initiating the snoop in synchronous snoop mode, 
all snoop information is latched by the 82495DX syn- 
chronous to the CPU CLK. The snoop is then per- 
formed on the next CLK edge and the response given 
on the CLK edge after that. This is the fastest possible 
method of snooping. 


In clocked snooping mode, information is latched by 
the 82495DX with respect to an external snoop clock 
source (which is typically slower than CLK). 
The 


82495DX will internally synchronize this information 
to CLK and provide a response. 


In strobed snooping mode, information is latched into 
the 82495DX with respect to the falling edge of a refer- 
ence signal. Thus, the snoop initiation is clock indepen- 
dent. The 82495DX again synchronizes this informa- 
tion with CLK. 


Note that choice of snooping modes is an orthogonal 
choice to the memory bus mode; the two are not linked 
providing additional design flexibility. 


The 82495DX and 82490DX provide two means of 
testing the cache subsystem, a built-in self-test and 
boundary scan test. The built-in self-test (BIST) can be 
initiated during RESET along with the CPUBIST. The 
boundary scan test uses separate and dedicated pins on 
the both components. More details can be found in the 
Intel486 DX CPU-Cache module hardware reference 
manual. 


This section reviews the key decision criteria 
that 


should be considered during the design of an MBC. 
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Certain choices will limit the options on later choices so 
it may be necessary to use an iterative approach to 
reach a fmal implementation plan. We shall start from 
the center of the design and work outwards. 


4.1 
Range of Memory Bus 
Implementations 


The MBC interface was defmed with a minimal set of 
memory bus implementation assumptions in order to 
maximize design choices. Table 4-1 lists a wide range of 
implementations 
possible using the 824950X 
cache 


controller and 824900X cache SRAMs. 


For desktop PCs, a 32-bit simple memory bus is ade- 
quate. For a server, workstation or small multipro- 
cessor of two CPUs, a faster 64-bit bus may be required 
to give adequate bandwidth for graphics frame buffers 
and intensive numeric calculations. Bus bandwidth re- 
quirements grow as the MIPS rating of each CPU in a 
system grows; for example, a bus adequate for twelve 
Inte1386™ OX CPUs may be too slow for six In- 
tel486TM OX CPUs, as they process far more data per 
second. 


A large multiprocessor system of six or more CPUs 
may need a wide and fast bus such as Futurebus +, 
with split-transaction capability to prevent bus bottle- 


necks from slowing the performance of every processor. 
Hierarchies of buses and caches can further allow more 
CPUs with reasonable performance increases as CPUs 
are added. Compatibility with existing buses is often 
crucial in product design, so that new faster compo- 
nents can plug into existing machines and I/O devices. 
Intel's flexible cache subsystem bus interface allows 
compatibility as well as extension. For proprietary bus- 
es, the "proprietor" 
can design an ASIC or EPLO 


based MBC incorporating the required features. 


The first stage in the definition of an MBC is the choice 
of the memory bus. A short review of Table 4-1 should 
give a first approximation to your memory bus feature 
• 
set. There are many degrees of freedom possible and an 
overriding consideration will be the best price/perform- 
ance for your target application. 


For this design example a well-known memory bus is 
chosen - the Intel486 OX CPU bus. The design will 
include a 50MHz Intel486 OX CPU-Cache module and 
MBC which can plug into the Intel486 OX socket of an 
existing host system, as shown in Figure 4-1 - 
this 


effectively produces a Intel486 OX 'super' CPU mod- 
ule with a very large internal cache. 
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update policy. This design example will implement 
both the write-back and write-thru policies and these 
will be selectable via a jumper. When write-thru is se- 
lected, the cache-line state transitions simplify to that 
shown in Figure 4-3. 


The 8K-bytes of internal Intei486 OX cache supports a 
write-through memory update policy. The 824950X/ 
824900X second level cache solution can support the 
full MESI cache protocol (see Figure 4-2) - 
which can 
be used for either a write-back or write-thru memory 
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NOTES: 
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Figure 4-2. MESI State Transitions 
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Figure 4-3. MESI Simplified 
to Support 
a Wrlte-Thru 
Policy 
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Use of the write-back policy is a little complicated since 
the host system that this design will plug into probably 
only supports the write-thru policy of the Intel486 DX. 
Therefore, the only snoops generated to the Intel486 
DX socket will be a result of non-caching device (e.g. 
DMA) writes via the EADS signal (no snoops occur on 
reads). Furthermore, in the case of a snoop hit to modi- 
fied data, there are no means by which the MBC can 
tell the the non-caching device to back-off to allow the 
intervening snoop write-back cycle. Three methods al- 
low this example design to work in write-back mode 
under these circumstances. These are described as fol- 
lows: 


Write-back mode can still be used to gain extra per- 
formance if the software used can assure that DMA-ac- 
cessed pages are either non-cacheable or write-through 
using the Inte1486 DX CPU paging mechanism with 
PCD = 1 or PWT = I. If this is possible, then DMA-ac- 
cessed cache-lines stay in either the I or S states. As 
such, snoop write-back cycles will never be required. 


Of course, if no DMA device or secondary bus master 
exists in the host system, then snoops will never be 
required. This would be the case for a typical PC sys- 
tem that does not use the floppy system and uses a 
CPU-accessed hard disk drive - 
such as an IDE drive. 


If control over the software is not possible and a DMA 
device does exist, then the design may be modified to 
support both DMA write and read snoops as well as 
snoop write-back cycles. This is described briefly in sec- 
tion 5.3.5.2. 


A wider, faster memory bus can otTer some perform- 
ance gains but at the cost of some implementation ex- 
pense. This design example will use the Intel486 DX 
CPU bus which is 32-bits and supports operating fre- 
quencies of 25MHz, 33MHz and now 50MHz. This de- 
sign implements a 25MHz memory bus but should be 
easily modified to run at 50MHz. This choice defines a 
synchronous interface so synchronizers will not be re- 
quired in the MBC implementation. 


Timely access to the memory bus is key for high per- 
formance but this bus is shared by other devices such as 
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DMA. The arbitration mechanism can favor the CPU 
if performance is a key design parameter or could favor 
the I/O driven DMA channels if 110 throughput is of 
higher importance in your design. This design example 
will use the centralized arbitration scheme defmed by 
BREQ, HOLD and HLDA. 


Modem 
microprocessor 
techniques used to improve 


performance include pipe1ining and split transfers. This 
design example will focus upon simplicity so neither of 
these features will be implemented. The design example 
will use the Intel486 DX CPU bus which supports 
burst reads but not burst writes (since the Intel486 DX 
CPU never has to flush a modified cache line). LOCK 
and PLOCK cycles are also implemented. 


This example design relies upon the EADS'" 
signal 


from the host system to initiate an invalidating snoop of 
the cache. Snoops are assumed to only occur when the 
host system has been given back a hold acknowledge 
(HLDA) from the MBC's arbitration control; i.e. when 
the host system thinks the Intel486 DX CPU/cache 
is 


in the HOLD state. Jumper options are provided to 
evaluate two modes of initiating snoops to the cache - 
synchronous and strobed. 


The choice of memory bus width will limit the choice of 
second level cache sizes. If the cache size you had 
planned is not available then the memory bus width 
decision should be re-evaluated. The current 82490DX 
cache SRAM supports cache sizes of l28K, 256K and 
512K-bytes - 
future implementations of the 82490DX 


component may extend this limit. 


Referring to Figure 4-4, there are four options for a 32- 
bit memory bus. With the use of the Intel486 DX CPU- 
Cache module, the cache size is selected as 256K. That 
leaves us with two options. 


In order for our module to look like an Intel486 DX 
CPU to the host system it is plugged in to, we shall 
choose four transactions per cycle - 
this is configura- 
tion 3 in Figure 4-4 with a line ratio of one, two lines 
per sector and 16 bytes per cache line. 
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Figure 
4·4. The Choice 
of Memory 
Bus Width 
and Cache 
Size are Coupled 


In order to simplify the example design, some addition- 
al assumptions were made regarding the host system: 
- 
It is assumed· that the host system has no parity- 
DPO:3 and PCHK # are not used. 


- 
All transfers on the memory bus are assumed to be 
to 32-bit devices - 
BS8# and BSI6# 
are no con- 
nects. 


- 
BOFF # is assumed to be a no-connect on the host 
system. 


- 
MKEN # from the host system is ignored. The 
cacheability attribute is generated by a decoder in 
the MBC. 


Of course, the reader is free to modify the MBC exam- 
ple design described here to support the functionality 
required by the signals above. 


Finally, use of the boundary scan logic (TDI, TDO, 
TCK, TMS pins) is not implemented. 


Now that the system design decisions have been made, 
this section details the logic design process for the 
MBC. 


As discussed earlier, this design implements a 25MHz 
memory bus. This choice is called a divided synchro- 
nous clock scheme and synchronizers are not required. 
The timing reference is derived from the 25MHz clock 
input of the Intel486 DX CPU socket. One Gazelle 
GAl110 and two Gazelle GAl210 phase locked loop 
(PLL) clock doubler chips are used to derive 50MHz 
reference clocks: one for the CPU-Cache Module and 
one for the MBC PLDs (refer to Figure 5-1). The 
GAI210 is able to provide the four 50MHz clocks into 
the CPU-Cache module with a maximum output-to- 
output skew of 500ps. 


All of the MBC's state machines operate at 50MHz. In 
order to interface to the 25MHz host system's Intel486 
DX interface, a delayed version of the 25MHz clock is 
generated by the GA 1110 in Figure 5-1: a delayed 
MCLK. This signal is used as an input to the MBC 
state machines as a qualifier for signals to and from the 
host interface. This MCLK scheme assures that the 
MBC outputs mimic a 25MHz Intel486 DX CPU inter- 
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PLL: 
GA 
1110 


I 
I 
Delayed MCLK 
: 
,-----\ 
I I 
25MHz 
~ 
• 
I. 


25 MHz SIGNALS ARE SAMPLED 4 
BY THE MBC WHEN THE DELAYED---! 
MCLK IS LOW 


face. It is used to maintain the MBC outputs active for 
two CLKs and helps to correctly sample the memory 
bus inputs to the MBC, such as MBRDY # (so they are 
not sampled active twice mistakenly). 


Finally, note that the use of PLLs is only required be- 
cause of the need to align the 50MHz clocks with the 
host system's clock. If the clocks are generated by the 
MBC and supplied to the host system instead, simpler 
low-skew clock butTers may suffice. 


The configuration options of the cache subsystem are 
selected by driving certain inputs of the 82495DX and 
82490DX during RESET. Note that many of the inputs 
are configuration inputs only during RESET and are 
used for operational inputs or outputs following RE- 
SET. An example of this is the KWEND#[CFG2) 
in- 
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put to the 82495DX that is forced active during reset 
by the XWND state machine but is used to indicate 
cacheabi/ity 
window end during normal operation. The 


configuration inputs used during reset are tabulated in 
the Appendix. 


Table 5-1 lists the configuration options for this design. 
These signals are used to configure different modes dur- 
ing operation. 
Most of the jumper options included 
here are intended for evaluation purposes. They may be 
removed if so desired. 


The operation of the CPU-Cache subsystem's interface 
is well defined (refer to Section 9) and since the memo- 
ry bus is an Inte1486 DX bus then this too is well de- 
fined. The logic task then, entails matching the proto- 
cols of the two bus specifications. 


Signal names as defined in the data sheets for the In- 
te1486 DX 
CPU, 
82495DX 
cache 
controller 
and 


82490DX SRAMs will be used throughout this discus- 
sion (refer to Section 8.0 for additional information). 
Signals from the memory bus are prefixed with an 'M'. 


In the illustrated state diagrams, the following conven· 
tion is used: During transitions, a hash (#) following a 
signal name indicates that the signal is not asserted; and 
a signal name without a hash indicates that the signal is 
asserted (e.g. BGT means that BGT# =0). Signals that 
appear inside states bubbles indicate that the signal is 
asserted 
(e.g. 
BGT 
inside 
a 
bubble 
means 
that 


BGT# =0 in that state). 


The delayed MCLK used by the MBC state machines is 
referred to as simply MCLK in the state machine de- 
scriptions. 


Signal 
Jumper 
Used by: 
Explanation 
Oumper setting) 
Normal 
operation 


MWBWT# 
J6 
82495DX 


, 


Write-thru 
mode (if installed) 
Installed 
Write-back 
mode (if open) 


DRCTM# 
J7 
82495DX 
Enable direct-to-modified 
transition 
Open 


(if installed) 


SLFTST# 
J2 
XCRDY state machine 
Enables CPU-Cache 
self test (installed) 
Open 


SNPDIS# 
J11 
Snoop state machine 
Disable snoops 
(installed) 
Open 


SNPNCA 
J5 
82495DX 
Non caching 
snoops (open) 
Open 


ENKEN# 
J3 
Address 
Decoder 
Enable caching 
(installed) 
Installed 


EPKEN# 
J8 
Address 
Decoder 
Enable BIOS caching 
(installed) 
Installed 


RFO# 
J10 
82495DX 
Enable Read for ownership 
Open 
Oumper installed) 


CPLOCKEN 
J13 
State machine 
Enable PLOCK cycles (open) 
Open 


ENALLC# 
J4 
Address 
Decoder 
Enable allocations 
(installed) 
Installed 


SNPCLK 
J14 
82495DX 
Select synchronous 
(installed) 
or 
Open 


strobed 
(open) snooping 
modes 


FLUSH# 
J12 
82495DX 
Enable flushing 
(installed) 
Installed 


EWRMRST# 
J9 
XWRMRST 
state machine 
Select warm reset option 
Open 


(see section 
5.3.9) 
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5.3.1 824950X 
CYCLES 
VERSUS 
Intel486™ 
OX 
CYCLES 


The 82495DX cache controller drives a sequence of 
output signals and expects a sequence of cycle progress 
signals as shown in Figure 5-2. 


CADS# 
indicates the start of the cycle address phase. 


CDTS # tracks CADS # and indicates the start of the 
cycle data phase. For READ cycles CDTS# 
indicates 
that the CPU data bus is in read mode and under the 
control of the MBC from the next CLK until CRDY #. 
It indicates to the MBC the earliest time that BRDY # 
can be returned 
to the 82495DX. For Write cycles 
CDTS # indicates that the first piece of data is available 
on the memory bus. 


At the beginning of the cycle, cycle information signals 
are presented to the MBC that indicate to the MBC the 
type and the length of the cycle. These signals along 
with the type of ready that is returned from the host 
system (RDY# 
or BRDY#) 
tell the MBC what type 
of cycle to run as shown in Table 5-2. 
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To handle the various types and phases of a 82495DX 
cycle, we start by partitioning the MBC into several 
state machines as shown in Figure 5-3. State machines 
XMBREQ and XHLDA handle the cycle requests and 
host system arbitration. State machine XMADS gener- 
ates BGT # and the Inte1486 DX signal MADS # to 
the 
host 
system 
once 
bus 
ownership 
is granted. 


XMBLAST generates the signal MBLAST # as well as 
MA[2:31 to the host system. This is required to convert 
multiple dword requests from the 82495DX into In- 
te1486 DX burst cycles or multiple single cycles. For 
example, a write-back cycle of 16 bytes needs to be 
separated into four separate dword write cycles to the 
host system. 


State 
machine 
XWND 
and 
a 
decoder 
provide 
KWEND#, 
SWEND# 
and cacheability information 
to the 82495DX. Locked and psuedo-locked cycles are 
handled 
by XMLOCK 
and 
XMPLOCK. 
Finally, 


XBRDY, XDISBRDY and XCRDY take care of the 
generation of BRDY # and CRDY #. 


These state machines will be built up step-by-step as we 
progress through the cycles - the completed and [mal 
state machines are covered in Section 7. 


I 82495DX, I 


CADS" 


CYCLE 
INFO· 
COTS" 
~ 


IHOST SYSTEMI 


82495DX 
HAS REQUESTED 
A CYCLE. BGT INDICATES 
SGT" 
MADS" 
BUS OWNERSHIP, 
+-- 
---i~ 
- - - - - -1---------1 


KWEND" 
+-- 


SWEND" 
+------ 


MBC STARTS HOST CYCLE; 
GENERATES CACHEABILITY 
&: ~NOOP INfO 


~, 
DATA TRANSFER OCCURS 
..J ~B~~Y .L. 
-' 


SIGNALS 
THAT DETERMINE 
MBC CYCLES 


Inputs from the 82495DX 
Inputs from the Intel486™DX 
Host System 


CWR,*. 
CMIO,*, 
MCACHE'*, 
CLEN[O:1],RDYSRC. 
MRDY '*. MBRDY,* 
MKEN'* 
(from decode 
of 82495DX 
addresses), 
KLOCK '* ,CPLOCK '* 


POSSIBLE 
82495DX 
CYCLES 
RESULTANT 
Intel486™DX 
CYCLES 


Posted Write 
Single Cycle Write 
Non-cacheable 
Read1 (one dword) 
Single Cycle Read 
Non-cacheable 
Read1 (2 or 4 dwords) 
Burst Read or Two or Four Single Cycle Reads 
Line Fill 
Burst Read or Four Single Cycle Reads 
Allocation 
Burst Read or Four Single Cycle Reads 
Write Back 
Burst Write2 or Four Single Cycle Writes 
I/O Cycle 
I/O Cycle 
Lock Read 
Lock Read 
Lock Write 
Lock Write 


1 A non-cacheable 
read occurs 
when 
MCACHE# 
or MKEN# 
= 1. 


2 The MBC will support 
burst writes 
even though 
the host system 
does not. 


Inte1486Tt.4 
OX, 824950X, 
824900Xs 


CADS" 
KWEND" 
CRDY" 
SNPADS" 
KLOCK" 
SWEND" 
BRDY" 
RESETCPU 
BGT" 
CPLOCK" 
IolKEN" $ 


MBRDY490_ 
SNPSTB 
RESET 
COTS" 
IolRO" 
BUFFER 
1ol00E" 
IolAOE" 


CYCLE 
INFO 


CLOCK 
CYCLE 
CONTROL 
I 
DATA 
CONTROL 
SNOOP 
GENERATION 
RESET 
a: 
I 
CONTROL 
CONrlGU- 
ARBITRA- 
CYCLE 
LOCK 
CACHEABILITY 
I 
...• 
RATION 
TION 
TRACKER 
TRACKER 
CONTROL 
I 
CPUCLK - 
I 
XBRDY 
XWRIolRST EJ I XIolBREQ I 
XIolLOCK I 
XWND 
I 
XCRDY 
IolCLK - 
...• 
XIolADS 
XIolPLOCK 
I 
XDISBRDY 
XIolBLAST 
CACHE- 
I 
ABILITY 
I 
DECODER 
I 


IolBCLKA 
IolPCD 
IolRDY" 
IolEADS 
(2SIolHz) 
IolBREQ 
IolADS" 
IolLOCK" 
IolWR" 
IolRST 
IolH~LD 
IolBLAST 
IolPLOCK" 
IolDC" 
IolBRDY" 


IolHLDA 
IolA(2:3] 
1ol1ol10" 


BE(3:0J" 


Inte1486Tt.4 
OX MEMORY BUS 
241166-11 
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5.3.2 
REQUESTS 
FOR THE MEMORY 
BUS 


We can define an MBC stae REQ (Memory Bus Re- 
quest) entered due to a write request (CDTS.CWR #) 
or a read request (CADS.CWR) as shown in Figure 
5-4. Entry into this state causes the MBREQ signal to 
the host system to be asserted. The assertion of the 
signal CDTS# 
is used to qualify writes so th~t the 


translation into REQ is independent of whether It was 
started by CADS# or SNPADS#. If the 82495DX is 
not pipelined, CDTS# is always asserted along with 
CADS#. For snoop write-back cycles, CDTS# lags 
SNPADS# by two to four CLK cycles. 


MBREQ is also used by other MBC state machines. We 
shall retain MBREQ active until the cycle is completed 
as indicated by MBLAST# active (MBLAST# indi- 
cates the last transfer in a cycle and is described in 


Figure 5-4. The XMBReq 
State Machine for the MBC 


section 5.3.3.3).An internal signal, IMBRDY, equal to 
(MRDY + MBRDY).MCLK # is useful to simplify the 
state machines. 


This design example uses a HOLD/HLDA arbitration 
scheme (state machine described later). If a hold is not 
being requested (MHOLD = 0) or not being granted 
(MHLDA = 0) to the Intel486 DX socket then the 
MBC grants the cycle to the 82495DX by driving 
BGT# 
active as shown in Figure 5-5. An address 


strobe is also generated (MADS = I) onto the memory 
bus at this time. Note that MADS# is active for one 
MCLK period. BGT# 
is maintained active until the 


cycle completes as indicated by MBLAST# active. 


Figure 5·5. The XMADS State 
Machine 
for the MBC 


The 
signal 
BGT# 
is 
also 
connected 
to 
the 
MSEL#[MTR4/MTR8#] 
input of the 82490DXs. 


If the CPU is not in HOLD, the cycle may be started in 
the same cycle as MBReq - this can save a clock cycle 
and will result in higher performance. To accomplish 
this, the transition out of the IDLE state of XMADS 
could be augmented to include this parallel operation: 
the additional term would be MHOLD#.MHLDA#. 
MCLK#.(CADS.CWR + CDTS.CWR#). This addi- 
tional term is added to XMADS in a later Figure. 


The MBC responds to the CPU generated address by 
responding with cycle progress signals and then con- 
trols the data flow int%ut 
of the 82490DX cache 
SRAMs. 


The MBC closes the cacheability window by driving 
KWEND# 
after BGT# 
is active for two CLKs as 
shown in Figure 5-6. At this point, the 82495DX will 
sample MKEN# 
and MRO#; 
these signals indicate 
the cacheability and read-only attributes of the ad- 
dressed memory cycle. These attributes can be deter- 
mined by decoding the current address. Based on those 
attributes the 82495DX executes allocations, line-fills, 
replacements, etc. From Figure 5-6, the time available 
to decode MKEN# 
and MRO# from the address bus 
is 63 ns (not including clock skew). 


The MBC will also close the snoop window by driving 
SWEND# 
at which point the 82495DX will sample 
MWBIWT# 
and DRCTM#; 
these signals are deter- 
mined by snooping the other caches in the system in a 
multiprocessor system. At this point the 82495DX up- 
dates its TAGRAM state related to the line access in 
progress. However, since there are no other caches to 
snoop in this design example, SWEND # may be as- 
serted at the same time as KWEND #. Cache line attri- 
bute information such as MWB/WT # is sampled ac- 
cording to the fixedjumper settings. 


The state machines for KWEND# and SWEND# are 
shown 
in 
Figure 
5-6. 
(In 
the 
schematics, 
the 
KWEND# 
and SWEND# inputs of the CPU-Cache 
module are driven by a single signal SKWEND#.) 


An address decoder PLD is used to generate MKEN # 
and MRO# Jor the 82495DX. This design example in- 
cludes PLD equations for the decoder corresponding to 
the cacheable areas of a typical PC address map. The 
MRO (read-only) attribute is required when caching 
the BIOS for proper operation in a PC system. MRO 
cache lines are cached in the 82495DX/82490DX but 
not in the Intel486 DX CPU. CPU writes to MRO 
protected cache lines in the 82495DX/82490DX cause 
a write thru cycle. 


The format of the MKEN # decode equation is as fol- 
lows: 


where ENKEN 
enables caching, EPKEN 
enables 


BIOS caching, and ENALLC enables write allocations. 
Note that MKEN # must be returned active during 
cache memory write misses in order for the 82495DX 
to generate an allocation cycle. 


5.3.4.2 
Supporting 
Multiple 
Transfer 
Requirements 


When CADS# is generated, the 82495DX will output 
CLENO, CLENI, CWIR#, 
and MCACHE#. 
These 
signals provide the MBC with enough information to 
determine the cycle length. CLENO and CLENI indi- 
cate the number of transfers (one, two or four) that the 
CPU expects if the cycle is returned non-cacheable. 
MCACHE# 
is driven as an indication that the cache 
controller could cache this access; the MBC can make 
the line cacheable by returning MKEN # active. The 
possible cycles were shown in Table 5-2. 
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Figure 5·6. KWEND and SWEND are Generated 
by the 
XWND State Machine 
on the MCLK Following 
BGT 


The XMBLAST state machine (Figure 5-7) is used to 
determine the number of transfers required by decoding 
the cycle information from the 82495DX. Using this 
information, it generates the lower order burst address- 
es (MA2, MA3) to the host system in the appropriate 
order after each transfer. It also generates the signal 
MBLAST # -this 
is driven into the host system and 
also used internally by other MBC state machines to 
indicate the end of the cycle. 


Finally, the XMBLAST state machine generates the 
signal IDISB (Internal Disable BRDY #). This signal is 
used during read-only cache linefills by the state ma- 


chine that generates BRDY# 
(discussed later). It is 
asserted 
during 
cacheable 
cycles in states 
LEN42 
through LEN44 for a one-dword read and in states 
LEN43 through LEN44 for a two-dword read. 


In the case where the 82495DX has requested more 
than one transfer (two or four dwords) and the host 
system returns MRDY # instead of MBRDY #, the cy- 
cle cannot be burst. Instead, the cycle is broken down 
into multiple single cycles to the host system. This will 
be the case for write-back cycles (since the host system 
probably does not support burst writes) and for code 
fetches from non-burstable BIOS EPROMs. 
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NonCacheable1 
= BGT.MCACHE"'.CLENU.CLENO", 
NonCacheable2 
= BGT.MCACHE'" 
.CLEN1 ", .CLENO 
Potentially 
Cacheable 
or NonCacheable4 
= BGT.(MCACHE+CLEN1) 
Cycles1 
= KWEND.MKEN 
# .CLEN1 ", .CLENO'" .1MBRDY # 
Cycles2 
= KWEND.MKEN 
# .CLEN1 # .CLENO.IMBRDY 
# 
MR01 
= MKEN.MCACHE.CLEN1", 
.CLENO# 
.1MBRDY 
MR02 
= MKEN.MCACHE.CLEN1", 
.CLENO.IMBRDY 


NOTE: 
State 
LEN21 
refers 
to the first cycle 
of a two cycle 
transfer. 
Similarly 
for other 
state names. 


The XMADS state machine, introduced in Figure 5-5 
must be augmented to produce multiple memory ad- 
dress strobes when MRDY # is returned before the cy- 
cle is over (see Figure 5-8). Also, the additional term 
that enables quicker transitions out of IDLE is added. 


Data is transfered into or out of the 82490DXs whenev- 
er the host system tells the MBC that it has or can 
accept data. Thus, the generation of MBRDY490# 
is 
straightforward: 


The output enable signal for the data caches MDOE# 
is simply equal to BGT.CWR #. 
The generation of 
MEOC# 
is also simple: 
The MBC subsection that controls data flow must pro- 
vide the following functionality: 


control the data into and out of the 82490DX data 
caches via MBRDY490 and MDOE# 
tell the 82490DXs when the data transfer is over via 
MEOC# 
tell the Intel486 DX when to transfer data to and 
from the CPU's data bus via BRDY # 
tell the 82495DX when the cycle is complete via 
CRDY#. 


where MBLAST is provided by the XMBLAST state 
machine. 


Under most conditions, the generation of BRDY # is 
also straightforward. BRDY # is asserted for all reads 
except allocation 
cycles (RDYSRC= 
I) and 
tracks 


MRDY # or MBRDY # from the host system by one 
MCLK period. This allows enough time for the data to 
propagate through the 82490DXs from the memory 


• 


bus. 
BRDY *' 
is 
not 
generated 
for 
write 
cycles 


(RDYSRC =0) since the write data will be already 
available in the data caches. Figure 5-9 illustrates the 
state machine XBRDY that transitions between the 
IDLE and SIGNAL states for the case described above. 


The state machine XDISBRDY 
is needed for MRO 
read cycles. MRO cache lines are cacheable in the 
82495DX/82490DX 
but not in the Intel486 DX CPU. 


As such, four ready's from the host system are needed 
to fill the second level cache whereas the Intel486 DX 
CPU must receive BRDY*"s 
equal to the number of 
dwords that it has requested----either one or two. 


The signal DISBRDY (DISable BRDY *') is asserted 
when the MBC has satisfied the number of BRDY *,'s 


required by the Intel486 DX CPU. The state machine 
does this by watching the IDSIB signal generated by 
the XMBLAST state machine. When the CPU requests 
one dword, IDISB is asserted for the second through to 
the fourth dword and this results in DISBRDY asser- 
tion which blocks BRDY *' generation. The same is 
true for two-dword reads where BRDY *' is blocked for 
the third and fourth dword. 


Finally, CRDY *' is asserted for a single cycle when 
MBLAST is active and the last ready from the host 
system is received. The XCRDY 
state machine 
is 


shown in Figure 5-10. A CNFIG state is also present to 
allow for a self-test on reset sequence; it is entered on 
reset when MAHOLD from the host system is asserted 
and the self-test option (SLFfST) is enabled. 


(MHOLDllO +MHLDAllO ).MCLKllO .MBReq 
+MHLDAllO.MHOLDllO.MCLKllO. 
(CADS.CWR+CDTS.CWRllO) 
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MEADS* 
is qualified by MCLK* 
to bring it into the 
CLK timing domain and generates SNPSTB (SNooP 
STroBe) into the 82495DX. Note that synchronous and 
strobed modes of snoop timing can be evaluated with 
this design. 


When the MBC does not have control of the memory 
bus, the host system generates snoops via MEADS * 
into the CPU-CacheIMBC. 
In this design example 
• 


The 82495DX samples the memory address, SNPNCA 
(SNooP Non-Caching Access) and SNPINV (SNooP 
INValidate) with SNPSTB and will generate state in- 
formation indicated by MTHIT 
(a Tag HIT, ie the 
snoop address 
is contained 
within 
the cache) and 
MHITM 
(a HIT to a Modified line in the cache). 
SNPNCA is used to prevent the cache state needlessly 
moving from Exclusive when the snooper is a non- 
caching device. SNPINV is used to indicate that a 
memory bus write is in progress and the cache should 
invalidate a cached line. 


The response from the cache controller will depend 
upon the memory 
update 
policy (write-through 
or 
write-back) that is being implemented. 


5.3.5.1 
Snoopee 
Response 
In Write-Through 
Mode 


In write-through 
mode no data is maintained in the 
cache that is different from main memory. This results 
in cache data only having two possible states, S and I 
(Shared and Invalid), within the MESI protocol model. 
In Write-Through 
mode SNPNCA will not be used 
since the cache state will never be Exclusive or Modi- 
fied; but SNPINV will be used to invalidate cache lines. 
In this design example MTHIT 
will be generated but 
the MBC may ignore it since there are no other caches 
in the system. MHITM will never be generated since 
the Modified state is not allowed in write-through. In 
short, there is nothing the MBC need do except gener- 
ate the invalidating snoops into the 82495DX. 


In write-back mode the cache controller reduces write 
traffic on the memory bus by maintaining data in the 
cache SRAMs. This results in cache data having up to 
four possible states, M, E, S and I (Modified, Exclusive, 
Shared and Invalid) within the MESI protocol model. 
In write-back mode some of the data in main memory 
will be stale and any attempt by another master (in- 
cludes DMA channel) to access stale data in main 
memory should be prevented by either software (via 
PWT) or by the absense of any other master apart from 
the CPU as discussed earlier. 


If secondary masters cannot be prevented from access- 
ing memory that has been write-back enabled, the 


82495DX will assert MHITM when the accessed data 
is found 
modified 
in 
the 
cache. 
The 
82495DX/ 
82490DX MUST be allowed to write-back this data to 
main memory and the other master should be told to 
retry the access. A 'typical' DMA controller will not be 
able to 'back-off' and retry an access - write-back as- 
sumes that the other masters or caches in a system are 
able to deal with this situation. 


Therefore, in order to incorporate full write-back sup- 
port, two changes would be required to this design ex- 
ample and to the host system. The first is that snoops 
must occur for secondary master reads in addition to 
writes. The second is that 
on the occurence 
of a 
MHITM, 
the secondary master (the snooper in this 
case) must allow an intervening snoop write-back cycle 
from the 82495DX/82490DX to occur before complet- 
ing its cycle. This allows main memory to be updated 
so that the read data retrieved from it will be consist- 
ent. On the case of secondary master writes, it allows 
the cache line - of which some or all bytes may be dirty 
- to be replaced into main memory before the secondary 
master completes its write. The snoop write-back cycle 
is explained in the following section. 


In this design example we will always assert SNPNCA 
since DMA is the only other master connected to the 
Intel486 DX CPU socket. There are no other caches to 
signal in our uni-processor design example so the MBC 
may ignore MTHIT. 


If the change described above can be implemented to 
enable full write-back support, then when a snoop to a 
modified line is detected the 82495DX will schedule a 
write-back of this modified line into main memory. The 
MBC should treat this as a priority event. To simplify 
the implementation of the MBC the 82495DX signals a 
write-back 
request 
using a separate 
output 
signal, 
SNPADS (SNooP ADdreSs) instead of CADS #. The 
82495DX will abort a memory request that has not be 
granted (ie BGT # not asserted) and will retry this 
memory request once the write-back is completed. This 
will require a modification to our MBReq state ma- 
chine ·asshown in Figure 5-11. Snoop write-back cycles 
are a subset of normal write cycles and no other action 
is required by the MBC. 


Figure 5-11. A Snoop Request 
Can 
Abort a Cycle Request 
(XMBReq) 


As a snooper, the MBC must generate snoops into oth- 
er caches in the system during cacheable memory reads 
and writes. The other caches will respond with both 
MTHIT and MHITM. The 82495DX uses this infor- 
mation to update the status bits of the accessed line 
according to the MESI model. Since the host system in 
this case is assumed to have no other cache apart from 
the 82495DX/82490DX, 
no snooper functions are re- 
quired. 


The 82495DX handles all memory accesses and passes 
non-cachable 
cycles 
through 
to 
the 
MBC. 
The 
82495DX will also pass I/O cycles, cache maintenance 
cycles (such as FLUSH 
and SYNC) and LOCK/ 
PLOCK cycles through to the MBC. 


I/O cycles are decoded by the 82495DX and not post- 
ed. These cycles wait until all buffers have been written, 
and all cycles have been completed, before they cause 
CADS# 
assertion. The CPU waits until the I/O ends 
with the MBC's BRDY # assertion before it continues. 


The on-chip cache of the Intel486 DX CPU can be 
flushed via the CPU's 
FLUSHCPU # 
pin. On the 
82495DX/82490DX, there is an equivalent input called 


PFLUSH # which causes all modified lines to be writ- 
ten back to main memory before the second level cache 
is invalidated. In this design these two FLUSH inputs 
are both connected to MFLUSH # coming from the 
host system. 


When the cache subsystem is performing a flush, many 
write back cycles are required. These cycles look like 
ordinary write back cycles, and are handled as such. 
FSIOUT # is active during these write back cycles, so 
when FSIOUT # goes inactive the cycle is complete. 


The Intel486 DX instruction set includes commands 
that control the on-chip cache. INVD and WBINY 
both invalidate the on-chip cache. When executed, 
these commands also appear as external I/O-like spe- 
cial cycles. If so desired, the MBC design may be modi- 
fied to decode these cycles and to provide the necessary 
control of the 82495DX/82490DX 
second-level cache. 


SYNC cycles apply only to the second level cache and 
are similar to FLUSH cycles except that the cache lines 
are not invalidated upon completion. SYNC cycles are 
initiated by asserting the SYNC input to the 82495DX. 
However, this feature is not implemented in this design. 


The 82495DX provides a lock signal for the memory 
bus called KLOCK #. KLOCK # is generated by the 
82495DX whenever the CPU generates the LOCK # 
signal. KLOCK"', 
like the other cycle attributes, is val- 
id with CADS # assertion. 


When the CPU generates a locked cycle, the 82495DX 
always generates a bus cycle even if the access was a 
cache hit. If the access was a cache miss, the locked 
cycle will be non-cacheable to both the 82495DX and 
CPU, 
so 
the 
information 
is 
passed 
through 
the 
82495DXs to the CPU with BRDYs generated by the 
MBC. If the locked read cycle is a modified hit in the 
82495DX, the 82495DX ignores the data that it is re- 
ceiving and supplies data from the cache array (in ac- 
cordance with BRDYs supplied by the MBC). Locked 
writes are posted like any other write. Locked cycles, 
both reads and writes, never change the 82495DX tag 
state. The state machine for MLOCK'" 
is shown in 
Figure 5-12. MLOCK # is driven active upon start of a 
locked cycle. 


Following the 82495DX's protocol, KLOCK # may go 
inactive in the CLK cycle following CADS# 
assertion. 


However, the MLOCK # signal must stay active for 
the whole cycle. Therefore, it is necessary to hold 
MLOCK'" 
until KLOCK'" 
is deasserted and it is the 


last transfer in the cycle, or KLOCK'" 
is deasserted 


and it is the beginning of a new unlocked cycle. 


• 


5.3.8.4 
PLOCK cycles 


The 
82495DX supports 
the 
Intel486 DX 
CPU's 
PLOCK"" protocol by providing a PLOCKEN config- 
uration input. If PLOCKEN is detected high at RE- 
SET, the 82495DX CPLOCK"" pin reflects the In- 
tel486 DX CPU PLOCK"" output for write cycles and 
the 82495DX follows certain PLOCK 
rules. The 
82495DX's read cycles never generate CPLOCK"". If 
PLOCKEN is detected low at RESET, CPLOCK"" is 
not generated and the PLOCK rules are not enforced. 


CPLOCK"" is driven like PLOCK"" of the Intel486 
DX CPU, but only for write cycles. It is active with 
CADS"" of the first PLOCK write and indicates that 
the next write will be PLOCKed to it. To emulate the 
behavior of the Intel486 DX CPU's PLOCK"" signal, 
MPLOCK"" is asserted for write cycles as well as for 
multiple-dword read cycles. CPLOCK"" goes inactive 
with CNA"" or CRDY"" of that first write cycle. The 
state machine for PLOCK is shown in Figure 5-12. 


The 82495DX allows the Intel486 DX CPU to be reset 
without resetting the cache subsystem, this is termed a 
warm-reset. This sequence is important for some PC 
software that uses the warm reset to return from pro- 
tected mode to real mode of the CPU. 
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The 82495DXprovides a specialpin, called WRMRST, 
to help with this function. The reset input to the CPU 
(RESETCPU) is connected to the 82495DX WRMRST 
input within the CPU-Cache module (WRMRST does 
not appear on the CPU-Cache module's interface) so 
that the 82495DX knows when CPU is being reset. 


To warm-reset the Intel486 DX CPU, the following 
must be done: 
, 


l. Assert HOLD to the CPU and wait for HLDA. 


This ensures that no CPU bus cycles are running 
that could corrupt the cache subsystem. 


2. Assert RESET to the CPU only via RESETCPU. 
3. Wait for CAHOLD to be inactive if it is not inactive 


already. This is so when RESETCPU is deasserted, 
the CPU does not go into self-test (note that al- 
though the Inte1486DX CPU may be resetting, the 
cache is allowed to be active. This is possible be- 
cause the CPU allows back-invalidations during re- 
set). 
4. Deassert 
RESETCPU 
in 
the 
next 
clock. The 


82495DX will not guarantee that CAHOLD will 
stay inactive beyond this. 


During a warm reset sequence, the resets to the 
82495DX and 82490DXs (RESETOand RESET!) are 
not asserted. This sequenceis implemented in state ma- 
chine XWRMRST as shown Figure 5-13. The MBC 
signal WARM is used to generate RESETCPU as 
shown in the PLD equations in Section 7. 
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Figure 5·13. Generating 
a 
Warm Reset (XWRMRST) 


WHOLD.WCLK"'. 
(WBREQ'" .WPLOCK'" .WLOCK'" 
+ 
WBREQ.WBLAST .IWBRDY. 
CPLOCK'" .KLOCK'" 
+ 
LOCKWR.KLOCK.CWR. 
CPLOCK"'.WPLOCK"'.CADS 
+ 
LOCKWR.KLOCK.CWR. 
CPLOCK"'.lotPLOCK"'. 
lotBREQ.BGT"') 


In this MBC design, the host system Intel486 DX sock- 
et only includes a cold reset pin, MRST (since a sepa- 
rate warm reset is not necessary for the CPU by itself). 
Two options are available if software that uses the 
warm reset must be tried or evaluated. The first is to 
allow the system to start with a cold reset (jumper J9, 
EWRMRST# 
= I) and then switch to warm resets 
before the software that uses them is started. The other 
option is to separately wire a warm reset signal directly 
from the host system. 


The bus arbitration state machine-XMHLDA-is 
ac- 
tually a combination of two simple state machines as 
shown in Figure 5-14. For the most part, bus exchanges 
occur as a result of transitions between states IDLE 
2 


and BHOLD of the first state machine shown. If a bus 
hold is being requested, the first state machine tran- 
sitions to BHOLD where MHLDA is asserted to give 
away bus ownership if the MBC is not running locked 
or pseudo-locked cycles (first two terms in the tran- 
sition equation). 


CADS.CWR.WCLK'" 
+ 
WBREQ.CWR.lotCLK'" 


A second state machine is used to generate the signal 
LOCKWR. It helps dealing with multiple back-to-back 
atomic (locked) operations. An example of this is a con- 
tinuous sequence of read-modify-write cycles. Under 
these conditions, the KLOCK # signal stays asserted 
for the entire sequence. This will increase the MHLDA 
latency since the transition from IDLE to BHOLD 
waits for the entire locked sequence to end. To mini- 
mize this latency, the boundary between one atomic 
operation and the next should be detected and the bus 
relinquished in between. This boundary is signaled by a 
locked write (end of atomic operation n) followed by a 
locked read (beginning of atomic operation n + 1). 


The LOCKWR state machine detects the occurrence of 
a locked write at which point LOCKWR is asserted. If 
the next cycle is a locked read and there is a bus 
MHOLD 
pending, then the first state machine tran- 
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sitions to state BHOLD where MHLDA 
is asserted 
(third and fourth terms of the transition equation). If 
the 
next 
cycle is also a locked 
write 
cycle, the 
LOCKWR 
state machine remains in state LKWR. 
Otherwise, it returns to IDLE. 


Note that the MHLDA 
signal is also used as the 
MAOE# 
signal into the 82495DX; 82495DX address 
outputs are tri-stated when MBC does not have bus 
ownership. 


Finally, the XMADS machine must be modified a final 
time to account for the locked cycle just described. This 
is shown in Figure 5-15 where the transition out of the 
IDLE state is changed to account for the case where 
the previous cycle was a locked write and there is a bus 
hold pending; the state machine stays in IDLE and the 
assertion of MADS # and BGT# 
is delayed until bus 
ownership is returned. 


(WHOlDlHWHlDA*). 
WClK*.WBReq.lOCKWR* 
+ 
WHlDA*.WHOlD*.WClK*. 
(CADS.CWR+CDTS.CWR*) 
+ 
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The state machines and other functions described in the previous text are partitioned into physical PLD devices as 
follows: 


Physical 
U# 
Logical State Machine 
PLDName 


XMEOC 
U7 
MKEN, MEOC, MBRDY490 
generation 


XRESET 
• U1 
Reset generation, 
XWRMRST 


XMBUFF 
U8 
Buffer for Intel486™DX 
signals 


XKEN 
U2 
Address 
decoder, 
CPLOCKEN, 
SNPSTB generation 


XMHLDA 
U3 
XMHLDA, 
XMADS, XMBReq, 
XMLOCK, 
XMPLOCK 


XRDY 
U14 
XBRDY, XCRDY 
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U11 
XDISBRDY 
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XWND 
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2 • 3 • 4 
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S1 
SOV 


0.1 ~~ TC34 


~ 
SOV 
1 + 
"~~ 
~- '" 


SOVI 


0.1 ~~IC19 


SOVI 


0.1~~ IC3S 


SOV 1+ 


10 ~~Ji- C40 


SOVI 


0.1 ~~IC20. 


SOVI 


0.1 ~~IC33 


SOV 1+ 


10~~ Ji- C39 


SOVI 


0.1 ~~IC21 


SOVI 


0.1 ~~IC32 


SOV 1+ 


10 J'F Ji- C38 
S% 
2 


SOVI 


0.1 ~~ IC22 


SOVI 


0.1 ~~ IC23 


SOVI 


0.1 ~~IC31 


SOVI 


0.1 ~~ IC30 


SOVI 


0.1 ~~IC2S 


SOVI 


0.1 ~~IC29 


SOVI 


0.1 ~~IC26 


SOVI 


0.1 ~~IC28 


SOVI 


0.1 ~~IC27 


SOV1 
"~~~ 


SOV 1 
"~~~ 


--€: 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


XMEOC 
F7A.PDS 
A 
MODULES 
INTEL CORP. 
8/17/91 


RSTCPU IMRDY IMBRDY MCLK 
IMBLAST NCI IRFO IPALLC IENKEN ICWR ISKEN GND 
IENALLC IMCACHE IIMBRDY MHLDA IMBRDY490 IMFRZ IMEOCO IMEOCI IMKEN lMEOC2 
IIHLDA VCC 


• ENKEN 
CWR * SKEN 
+ ENKEN 
ICWR * ENALLC * PALLC * SKEN 
+ ICWR * MCACHE 


MKEN is qenerated in this PLD. 
ENKEN is a confiqurable option 
for enablinqldiablinq 
cache. 
ENALLC allows allocation cycles 
to take place in the case of write-miss cycles. These can be 
read-for-ownership 
cycles usinq the RFO jumper. This freezes 
the memory write data in 82490 durinq the write-miss cycle 
and is combined 
with the rest of the data fetched durinq 
the allocation. 
• 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COHPANY 
DATE 


XRESET 
F1A.PDS 
A 
HODULES 
INTEL 
CORP. 


8/16/91 


CHIP 
XRESET 
85C060 


CLK 
HRST 
IEWRHRST 
HLDA 
CAHOLD 
HCLK 
ACLK 
IIHBRDY 
RDYSRC 
IHBLAST 
ISLFTST 
GND 
IOE 
HAHOLD 
RSTCACHE 
ICRDY1 
IBRDYO 
ICRDYO 
IBRDY1 
HOLD 
IWARM 
RSTCPU 
IDISB 
VCC 


,. 
HRST 
* 
HOLD 
* 
HLDA 
* EWRHRST 
+ HRST 
* 
WARM 
* 
EWRHRST 
+ CAHOLD 
* WARH 
* 
EWRHRST 


RSTCPU 
• HRST 
* 
IHCLK 
* 
IEWRHRST 
+ RSTCPU 
* HRST 
* 
IEWRHRST 
+ RSTCPU 
* 
HCLK 
* 
IEWRHRST 
+ WARM 
* 
EWRHRST 


RSTCACHE 
= HRST 
* 
IHCLK 
* 
IEWRHRST 
+ RSTCACHE 
* HRST 
* 
IEWRHRST 
+ RSTCACHE 
* HCLK 
* 
IEWRHRST 


This 
state 
machine 
generates 
reset 
lines 
to 
CPU,82495,82490. 
Because 
the 
i486 
motherboard 
socket 
has 
only 
one 
RESET 
pin 
it 
can 
be 
either 
used 
to 
reset 
all 
of 
them 
or 
only 
the 
CPU. 
A 
configuration 
jumper 
IEWRHRST 
selects 
between 
WARMRESET 
and 
full-reset. 
If 
the 
system 
has 
to 
be 
reset 
the 
jumper 
has 
to 
be 
removed 
so 
that 
82495DX 
also 
gets 
reset 
and 
cache 
is 
flushed. 
Or 
IEWRHRST 
can 
be 
connected 
from 
a 
software 
configurable 
port 
pin 
to 
the 
i486 
socket. 


TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


XMBUFF 
F8A.PDS 
A 
MODULES 
INTEL CORP. 
6/3/91 


OPTIONS TURBO - ON 


CHIP 
XMBUFF 
85C224 


NC4 ICWR RDYSRC IMCACHE ICDC 
ICMIO IIHLDA IIBEO lIBEl IIBE2 IIBE3 GND 
NCl MBREQ IMBE3 IMBE2 IMBEl IMBEO IMMIO 1M DC IMWR IMBCACHE NC3 VCC 


EQUATIONS 


MWR 
-CWR 


MWR.TRST 
- /IHLDA 


MDC 
-CDC 


MDC.TRST 
- IIHLDA 


MMIO 
- CMIO * MBREQ 


MMIO.TRST 
- IIHLDA 


MBEO.TRST 
- IIHLDA 


MBEO 
- IBEO + (MCACHE * IRDYSRC) 


MBEl.TRST 
- IIHLDA 


MBEl 
- IBEl + (MCACHE * IRDYSRC) 


MBE2.TRST 
- IIHLDA 


MBE2 
- IBE2 + (MCACHE * IRDYSRC) 


MBE3.TRST 
- IIHLDA 


MBE3 
- IBE3 + (MCACHE * IRDYSRC) 


Generates the memory bus control signals and byte enables. 
Note that all byte enables are activated when RDYSRC is inactive 
and MCACHE is active. 
This is for write-back cycles. 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


XKEN 
F2A.PDS 
A 
MODULES 
INTEL CORP. 
7/22/91 


CLK 
MA24 
MA23 IEPKEN MA22 MA21 MA20 MA19 MAl8 MA17 CPLOCKEN GND 
RSTCPU ISNPDIS MCLK 
IMRO ISNPSTB IRESETCPU ISKEN ICPLOCK IMEGl5 IMEGO IMEADS VCC 


EQUATIONS 


MEGO 
• IMA24 • 1MA23 • 1MA22 • IMA21 • IMA20 


SKEN 
• IMEGO • IMEG15 
+ MEGO • IMAl9 
+ MEGO • MAl9 • IMAl8 • IMA17 
+ MEGl5 • IMA19 
+ EPKEN • MEGO • MAl9 • MA18 • MA17 


MRO 
• EPKEN • MEGO • MA19 • MA18 • MA17 


CPLOCK.TRST 
• ICPLOCKEN • RSTCPU 


;c;Locxi-d;i~;~-b;-~;b~~-~;;d-;~-;;;;~-~i;;-;;-;LOCXE~-~i~. 
;After reset the pin is floated by Debug board since CPLOCKI 
;is output signal from 82495. 


;MRO signal is generated for the last 128KB of 1MB (system BIOS 
;area). It is a configurable option 
(EPKEN/). System KEN generated 
;based on the address decoding. Cache is disabled only from 
;640KB-IMB and between 15.5MB to 16MB. BIOS area can be enabled 
;by making EPKEN active. 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


XHHLDA 
F3A.PDS 
A 
MODULES 
INTEL 
CORP. 
9/03/91 


CLK1 
RSTCPU 
ICADS 
ICPLOCK 
IKLOCK 
ICDTS 
ICWR 
ISNPADS 
MCLK 
IMRDY 
MHOLD 
GND 
CLK2 
IIMBRDY 
ILOCKWR 
IMPLOCK 
IMADS 
IBGT 
IIHLDA 
MBREQ 
IMLOCK 
ACLK 
IMBLAST 
VCC 


EQUATIONS 


IHLDA 
,. RSTCPU 
• MHOLD 
• 
IMCLK 
+ 
IRSTCPU 
• MHOLD 
• IHLDA 
+ 
IRSTCPU 
• MCLK 
• 
IHLDA 
+ 
IMLOCK 
• 
IMPLOCK 
• MHOLD 
• 
IMCLK 
• IMBREQ 
+ 
IKLOCK 
• 
ICPLOCK 
• MHOLD 
• IMBRDY 
• MBLAST 
• IMCLK 
• MBREQ 
+ 
IRSTCPU 
• LOCKWR 
• CADS 
• CWR 
• KLOCK 
• MHOLD 
• 
IMCLK 
• 
ICPLOCK 
• 
IMPLOCK 
• 
IMBREQ 
+ 
IRSTCPU 
• LOCKWR 
• MBREQ 
• CWR 
• KLOCK 
• MHOLD 
• 
IMCLK 
• 
ICPLOCK 
• 
IMPLOCK 
• 
IBGT 


MBREQ.CLKF 


MADS 


,. IRSTCPU 
• 
+ 
IRSTCPU 
• 
+ 
IRSTCPU 
+ 
IRSTCPU 


ISNPADS 
• IIMBRDY 
• MBREQ 
ISNPADS 
• IMBLAST 
• MBREQ 
COTS 
• 
ICWR 
• IMBREQ 
CADS 
• CWR 
• IMBREQ 


I· 
IRSTCPU 
• MCLK 
• MADS 
+ 
IRSTCPU 
IIHLDA 
• MBREQ 
• IMCLK 
• 
IBGT 
• ILOCKWR 
+ 
IRSTCPU 
IMHOLD 
• MBREQ 
• IMCLK 
• 
IBGT 
• ILOCKWR 
+ 
IRSTCPU 
HRDY 
* IMBLAST 
• IMCLK 
• 
IMADS 
* BGT 
+ 
IRSTCPU 
IIHLDA 
* IMHOLD 
• COTS 
• ICWR 
• IMCLK 
* IBGT 
+ 
IRSTCPU 
IIHLDA 
* IMHOLD 
• CADS 
* CWR 
• IMCLK 
• 
IBGT 
+ 
IRSTCPU 
MBREQ 
* IMCLK 
* IBGT 
* LOCKWR 
• ICWR 
* KLOCK 


• ACLK 


I· 
IRSTCPU 
* IIHLDA 
* MBREQ 
• IMCLK 
• IBGT 
• ILOCKWR 
+ 
IRSTCPU 
* IMHOLD 
* MBREQ 
• IMCLK 
• IBGT 
• ILOCKWR 
+ 
IRSTCPU . IIHLDA 
* IMHOLD 
• COTS 
• ICWR 
• 
IMCLK 
• IBGT 
+ 
IRSTCPU 
* IIHLDA 
* IMHOLD 
• CADS 
• CWR 
• 
IMCLK 
* IBGT 
+ 
IRSTCPU 
• MADS 
+ 
IRSTCPU 
* IIMBRDY 
• BGT 
+ 
IRSTCPU 
• IMBLAST 
* BGT 
+ 
IRSTCPU 
• MBREQ 
• IMCLK 
* IBGT 
* LOCKWR 
• ICWR 
• KLOCK 


,. IRSTCPU 
* CADS 
* KLOCK 
+ 
IRSTCPU 
* 
KLOCK 
* MLOCK 
+ 
IRSTCPU 
• 
IMBLAST 
* MLOCK 
+ 
IRSTCPU 
• 
IIMBRDY 
* MLOCK 


,. IRSTCPU 
+ 
IRSTCPU 
+ 
IRSTCPU 
* 
+ 
IRSTCPU 
+ 
IRSTCPU 


MPLOCK.CLKF 
• ACLK 


CADS 
* CPLOCK 
CPLOCK 
* MPLOCK 
IMBLAST 
• MPLOCK 
MCLK 
* MPLOCK 
BGT 
• 
1MB LAST 
• IMCLK 


I· 
IRSTCPU 
* 
+ 
IRSTCPU 
* 
+ 
IRSTCPU 
+ 
IRSTCPU 


BGT 
• KLOCK 
* 
ICWR * 
1MB ROY 
• MBLAST 
LOCKWR 
* KLOCK 
* 
ICADS 
• 
IMBREQ 
LOCKWR 
* KLOCK 
• ICWR 
LOCKWR 
* 
KLOCK 
* MCLK 


• 
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;memory system does not want to burst and for write-backs 
the state 
;machine qenerates MADSI 
after each MRDYI until the last transaction 
,on the memory 
bus. 


;MBREQ keeps track of the pendinq request 
from the Module 
and used 
,to qenerate MADSI 
after IHLDA becomes 
inactive. 


;IHLDA is qenerated when MHOLD is sampled active and the bus is not 
,locked by KLOCKI or PLOCKI.But 
two atomic cycles occur back-to-back 
;then IHLDA is qenerated 
before the start of the second atomic cycle. 
;LOCKWRI is used to keep track of the write-portion 
of the atomic 
;cycle and if LOCKWRI 
is active and the next cycle is a read then 
;IHLDA is qenerated 
if MHOLD 
is pendinq 
at that time. 


;Memory bus MLOCK and MPLOCK 
;Since the i486 LOCK remains 
;state machine 
extends 
82495 
:memory. 


are also qenerated 
in this state machine. 
active until after the last ready this 
KLOCK until after the last ready from 


TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
OATE 


XRDY 
FHA.POS 
A 
MODULES 
INTEL CORP. 
7/30/91 


CHIP 
XRDY 
PAL16R4 


CLK IOISB RSTCPU 
ISLFTST MCLK 
IIMBRDY IMBLAST RDYSRC MANOLD GND 
10E 
ICRDY1 ICRDYO ICROY NC1 NC2 IBRDY IBRDY1 IBRDYO VCC 


EQUATIONS 


BRDYO 


BROYl 


CRDYO 


CRDYl 


This state machine 
qenerates 
BRDY to the CPU 
,82495 and 82490. 
BRDY is qenerated 
after samplinq an active IMBRDY 
(MRDY or MBRDY). 
CRDY is qenerated 
after samplinq active IMBRDY and MBLAST. 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


XBRDYDIS 
FllA.PDS 
A 
MODULES 
INTEL CORP. 
8/15/91 


,. IRSTCPU 
+ IRSTCPU 
+ IRSTCPU 


MDOE 


A20M 


• BGT • ICWR 


• MA20M • IMCLX 
+ A20M • MCLX 
+ A20M • MA20M 


MA02 
• IMA02 


MA02.TRST 
• IIHLDA 


MA03 
• IMA03 


MA03.TRST 
• IIHLDA 


IDISB • MRO 
DISB • IIMBRDY 
DISB • IMBLAST 


This PLD provides tri-state control for burst address lines 
MA2,MA3 and also qenerates MDOE used to enable 82490 data 
buffers durinq write-cycles. 
DISB is used in the XRDY 
state machine to control the number 
of BRDYs to CPU durinq MRO cycles. 
• 
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TITLE 
PATTERN 
REVISION 
AUTHOR 
COMPANY 
DATE 


XHBLAST 
F6A.PDS 
A 
MODULES 
INTEL 
CORP. 
7/17/91 


OPTIONS 
TURBO 
• 
ON 


CHIP 
XMBLAST 
85C060 


CLK 
RSTCPU 
IBGT 
MCLK 
IIMBRDY 
CLEN1 
IMCACHE 
CA2 
CA3 
IMKEN 
CLENO 
GND 
NC1 
IIHLDA 
MA3 
ISV3 
ISV2 
ISV1 
IIDISB 
ISKWEND 
IHBLAST 
MA2 
ACLK 
VCC 


EQUATIONS 


SV3 
,. IRSTCPU 
* 
IIMBRDY 
* SV3 
+ IRSTCPU 
* 
IMBRDY 
* 
ISV3 
* SV2 
+ 
IRSTCPU 
* 
IMBRDY 
* 
ISV3 
* SV1 
+ 
IRSTCPU 
* 
ICLEN1 
* 
ICLENO 
* 
IIMBRDY 
* SKWEND 
* 
IHKEN 
* 
ISV2 
* 
SV1 
+ 
IRSTCPU 
* 
IMCACHE 
* 
ICLEN1 
* 
ICLENO 
* BGT 
* 
ISV3 
* 
ISV2 
* 
ISV1 


SV3.CLKF 
• ACLK 


SV2 
,. 
IRSTCPU 
* 
IIMBRDY 
* 
SV2 
+ 
IRSTCPU 
* 
ISV3 
* 
SV2 
+ 
IRSTCPU 
* 1MB ROY 
* SV3 
* ISV2 
* SV1 
+ 
IRSTCPU 
* 
IMCACHE 
* 
ICLEN1 
* CLENO 
* BGT 
* ISV3 
* 
ISV1 
+ 
IRSTCPU 
* 
ICLEN1 
* CLENO 
* 
IIMBRDY 
* SKWEND 
* 
IHKEN 
* ISV3 
* SV1 


SV2.CLKF 
• ACLK 


SV1 
,. IRSTCPU 
* CLEN1 
* IIMBRDY 
* SV1 
+ 
IRSTCPU 
IIMBRDY 
* 
ISKWEND 
* SV1 
+ 
IRSTCPU 
IIMBRDY 
* MKEN 
* SVl 
+ 
IRSTCPU 
* IIMBRDY 
* SV3 
* SV1 
+ 
IRSTCPU 
* 
ISV3 
* SV2 
* SVl 
+ 
IRSTCPU 
* IHBRDY 
* 
ISV2 
* SVl 
+ 
IRSTCPU 
* CLEN1 
* BGT 
* 
ISV3 
* 
ISV2 
+ 
IRSTCPU 
* MCACHE 
* BGT 
* 
ISV3 
* 
ISV2 


MBLAST 
,. IRSTCPU 
* 
IHBRDY 
* 
ISV3 
* SV2 
+ IRSTCPU 
* 
IMCACHE 
* 
ICLEN1 
* 
ICLENO 
* BGT 
* 
ISV3 
* ISV2 
* 
ISVl 
+ 
IRSTCPU 
* 
IIMBRDY 
* SV3 
* SV2 
+ 
IRSTCPU 
* 
IIMBRDY 
* SV3 
* 
ISV1 
+ IRSTCPU 
* 
ICLENl 
* 
ICLENO 
* 
IIMBRDY 
* SKWEND 
* 
IMKEN 
* 
ISV3 
* 
ISV2 
* SV1 


MBLAST.CLKF 
• ACLK 


IMA2 
,. RSTCPU 
* 
ICA2 
+ 
ICA2 
* 
IMBRDY 
* 
SV3 
+ 
ICA2 
* 
IIMBRDY 
* 
ISV3 
+ 
ICA2 
* 
ISV2 
* 
ISV1 
+ IRSTCPU 
* CA2 
11MB ROY * 
SV3 
SV2 
+ 
IRSTCPU 
* CA2 
IHBRDY 
* 
ISV3 
SV2 
+ 
IRSTCPU 
* CA2 
* 
IIHBRDY 
* SV3 
SV1 
+ 
IRSTCPU 
* CA2 
* IHBRDY 
* ISV3 
* SV1 


IMA3 
,. RSTCPU 
* 
ICA3 
+ 
ICAJ 
* 
ISV1 
+ 
ICAJ 
* 
IIMBRDY 
* 
ISV2 
+ 
ICAJ 
* 
ISV3 
* 
ISV2 
+ 
ICAJ 
* 1MB ROY 
* SV3 
* SV2 
+ 
IRSTCPU 
* CAJ 
* IIMBRDY 
* SV2 
* SV1 
+ 
IRSTCPU 
* CAJ 
* 
ISV3 
* SV2 
* SV1 
+ 
IRSTCPU 
* CAJ 
* 
1MB ROY 
* SV3 
* 
ISV2 
* SVl 


SKWEND 
,. RSTCPU 
+ 
IRSTCPU 
+ 
IRSTCPU 


SKWEND.CLKF 
• ACLK 


IDISB 
,. IRSTCPU * IIDISB * MKEN 
* MCACHE 
* ICLEN1 * ICLENO * 
ISV3 * ISV2 * SV1 * IMBRDY * BGT 
+ IRSTCPU * IIDISB * MKEN * MCACHE 
* ICLEN1 * CLENO * 
ISV3 * SV2 * ISV1 * IMBRDY * BGT 
+ IRSTCPU * IDISB * ( SV3 + SV2 + SV1 
) 


IDISB.CLKF 
• ACLK 


;;hi;-;~;~;-;;~hi~;-~;~;-f~;-~h;-;;;~;;-b~;:-~;~~~;-i;-;~~i~;-~~- 
;the last transaction 
on the memory bus.This state machine also 
;generates the burst address lines MA2 and MAJ. 


;It also generates 
BRDY disable 
signal IBRDY used when caching is 
;done with MRO attribute. In this case BRDY is generated only for 
;the no. of transfers 
required by the CPU. 


;The state machine 
also generates 
KWEND and SWEND to 82495. MKEN is 


Igenerated by address decoding on the debug board. 
It is valid one 


lclock after MADS. 
KWEND is generated at that time. 
• 


Input 
82495DX 
Source 
Input 
Value 
Explanation 
Input 
(state 
During 
Reset 
machine, 
etc.) 


CFG3 
Tied low 
0 
82495DX 
initialization 


KWEND#< [CFG2] 
XWND 
0 
82495DX 
initialization 


SWEND#<[CFG1] 
XWND 
0 
82495DX 
initialization 


CNA#<[CFGO] 
Tied high 
1 
82495DX 
initialization 


CPLOCK#< 
XKEN PLD 
Depends 
on MBC signal 
Enables/ disables 
PLOCK 
[CPLOCKEN] 
CPLOCKEN 
(J13) 
functionality 


BGT#<[CLDRV] 
XMADS 
1 
Select low capacitance 
drivers for 82495DX 


CRDY #<[SLFTST] 
XCRDY 
Depends 
on MBC signal 
If MAHOLD 
is asserted, 
SLFTST #<(J2) and MAHOLD 
CPU-Cache 
self-test 
is 
from host system. 
executed 
following 
reset. 


MALE [WWOR #<] 
XMBReq 
0 
Select weak write ordering 


MBALE[HIGHU] 
Tied high 
1 
Required 
for selftest 


MFRZ#< [MDLDRV] 
XMEOCPLD 
1 
Normal capacitance 
data bus 
output drivers 


SYNC#< [MALDRV] 
Tied high 
1 
Normal address 
bus output 
drivers 


MSEL#<[MTR4/MTR8#<] 
XMADS 
1 
Four transfers 
per Mbus 
transaction 


MZBT #<[MX4/MX8 
#<] 
Tied high 
1 
Four I/O pins per 82490DX. 


SNPCLK[SNPMD] 
Jumper 
14 
high or low 
Select either synchronous 
or 
strobed 
snoops 


• 
The clock skew from PCLK 
to CLK[I:4] 
is calculat- 
ed from the GAl210 
data sheet. It works out to be 
SkewA = ± I ns. Similarly 
for PCLK 
to CLK[S:6]. 
Therefore, 
the worse case skew from 
the Module's 
SO MHz 
clocks 
to the MBC's 
SO MHz 
clocks is 
±2 ns. 


• 
DMCLK 
is nominally 
delayed 2 ns from 
MCLK. 
The worse case clock skew can be calculated 
from 
the GAlllO 
data sheet. It works out to be SkewB = 
I.S ns min/2.S 
ns max. 


• 
BRDY #<, CRDY #<: Required 
Setup = 8 ns, Hold 
= 3 ns. 


To satisfy the CPU-Cache module's input hold time, 
the 
BRDY #< and 
CRDY #< signals 
are wrapped 
around inside the PALI6R4 
in order to increase the 


minimum 
valid delay time. 


Setup = Clock 
Period 
- 
SkewA 
- 
PAL16R4_5 
tea 
max - 
PAL16R4_5 
tpD max 
20-2-4-5=9ns 


Hold 
PAL16R4_5 
tea min + PAL16R4_5 
tpD min 
- 
SkewA 


=3 + 3 - 2 = 4 ns 
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• 
KWEND#, 
SWEND#, 
BGT#: 
Required 
Setup = 
6 ns, Hold = 3 ns. 


In this case the hold time is more easily met by con- 
tinuing 
to assert 
these signals past the CLK 
cycle in 


which 
they are sampled 
by the CPU-Cache 
module. 


Setup 
= Clock Period 
- 
SkewA 
- 
B5C060 teo 
max 
(asynchronous 
mode) 


20 - 
2 - 
12 = 6 ns 


• 
MRO#, 
MKEN# 


From 
the schematics 
and PLD codes, MKEN 
# gen- 
eration 
experiences 
three 
propagation 
delays 
from 


the 
cache 
address-twice 
through 
a 
85C224 
and 


once 
through 
a 20L8_5. 
The 
maximum 
delay 
al- 
lowed 
was calculated 
in Section 
5.3.4.1 to be 63 ns. 
After 
accounting 
for 
clock 
skew, 
this 
reduces 
to 
61 ns. 


Tdecode 
= 2 
• 
B5C224 
tpD 
max 
+ 
PAL20IB_5 
tpD max 


2 • 10 ns + 5 ns = 25 ns 


• 
MEOC#: 
Required 
Setup = 4 ns, Hold = 3 ns. 


Setup 
= Clock Period 
- 
SkewS 
max - 
PAL20LB_5 
tpD max 


20 - 
2.5 - 
5 = 12.5 ns 


Hold 
PAL20LB_5 tpD min + SkewS min 


3 + 1.5 = 4.5 ns 


• 
MBRDY490#: 
Required 
Setup 
= 
5 ns, Hold 
= 


2 ns 


Setup 
= Clock Period 
- 
SkewS 
max - 
PAL20LB_5 


tpD max 


20 - 
2.5 - 
5 = 12.5 ns 


Hold 
PAL20LB_5 tpD min + SkewS min 


3 + 1.5 = 4.5 ns 


The 
following 
signals 
are 
used 
as 
inputs 
by 
the 


85C060's 
in asynchronous 
clocked 
mode 
in which 
the 


setup 
and hold times ae 2 ns and 2 ns respectively. 


• 
CADS#, 
SNPADS#, 
CDTS#: 


Setup = Clock Period - 
SkewA - 
Module Valid Delay 
max (t32) 


20 - 
2 - 
12 = 6 ns 


Hold 
Module Valid Delay min (t32) - 
SkewA 


4 - 
2 = 2 ns 


• 
CLENI, 
CLENO, 
MCACHE#, 
RDYSRC, 
PALLC: 


Easily 
satisfies 
setup 
since these signals 
are sampled 


after the CLK 
period 
in which 
they are valid. 
Hold 
requirement 
is met since these 
signals 
remain 
valid 


throughout 
the bus cycle. 


• 
HLDA#: 


Setup 
= Clock Period - 
SkewA - 
Module Valid Delay 
max (t17) 


20 - 
2 - 
14 = 4 ns 


Hold time is not important since the active edge 
is not 
used in the MSC. 


Documents 
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to this Applications 
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Clock 
Design 
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Intel486TM 
Systems, 
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Number 
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An 
Application 
Note 
on 50 MHz 
Intel486TM 
DX 
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Module 
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Number 
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Designing 
a Memory 
Bus Controller 
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Write-Through 
EISA 
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Architectural 
Overview 
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(Order 
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This section discusses CPU performance optimization 
techniques for the Intel486TM DX2 microprocessor. 
The reader should be familiar with the Inte1486™ OX 
microprocessor as well as knowledgeable about memo- 
ry systems and cache architectures. For further refer- 
ence, the reader is directed to the following documents 
and application notes (corresponding Intel order num- 
ber is shown in parentheses): 


• Inte1486™ 
DX2 
Microprocessor 
Data 
Book 


(241245-00I) 


• Intel486™ 
OX 
Microprocessor 
Data 
Book 


(240440-004) 
• Inte1486™ OX Microprocessor 
Hardware 
Refer- 
ence Manual (240552-001) 


• Cache Tutorial 1991 (296543-002) 
• AP447: A Memory Subsystem for the Intel486TM 
Family of Microprocessors including Second Level 
Cache (240799-002) 
• 485TurboCache 
Module 
Intel486TM DX 
Micro- 


processor Cache Upgrade (240722-002) 


2.0 
The High-Performance 
Intel486™ DX2 Microprocessor 


The 
Inte1486 DX2 
Microprocessor 
is functionally 
equivalent to the now-familiar Intel486TM OX Micro- 
processor. However, the Inte1486 DX2 CPU's internal 
core runs at twice the frequency of its external bus. 
This architecture enables a very high level of perform- 
ance while, at the same time, maintaining straightfor- 
ward system design. 


The Inte1486 DX2 CPU is partitioned such that the 
cache and write buffers operate at the full core speed as 
illustrated in Figure 2.1. As such, the processor is only 
slowed to the external bus speed on cache misses and 
when the write-buffers are full. The Inte1486 DX2 
CPU's external bus interface is identical to its predeces- 2 
sor, i.e. all system cycles emanating from the CPU look 
exactly as if they would from the Inte1486 OX CPU. 
The Intel486 DX2 microprocessor includes additional 
features such as ITAG boundary scan and power-down 
capability that are not covered in this section. 


6611Hz 
Inte1486'" OX 
Wicroprocessor 
Core 


} 


3311Hz Inle1486'" OX 
Microprocessor 
External Bus 


MS-DOS.Windows.Word.Exceland FlightSimulatorare trademarks of MicrosoftCorporation. 
UNIXis a trademark of UNIXSystem Laboratories. 
Lotus. 123 are trademarks of Lotus DevelopmentCorporation. 
Turbo C is a trademark of BorlandInternational. 
AutoCADis a trademark of AutoDesk.Inc. 


Performance optimization for the Intel486 DX2 CPU is 
subtly different than for the Intel486 DX CPU due to 
the difference in the internal architecture. This is evi- 
dent if you consider that external memory latencies 
now affect the full speed core by twice as many CPU 
clocks (refer to Figure 2.2). In other words, the memo- 
ry system should really be designed to satisfy the data 
throughput 
demands of a 50/66MHz CPU. The next 
few sections examine the situation in detail so that edu- 
cated trade-offs can be made during system design. The 
discussions will focus on CPU-cache memory perform- 
ance; I/O performance and other architectural issues 
are not addressed in this applications note. 


In an ideal system, all CPU cycles operate at zero-wait 
states and the theoretical maximum performance of the 
CPU is achieved. However, short of spending a lot of 
money on SRAMs, a real system always falls short of 
the ideal zero wait-states. There are many cache-memo- 
ry designs that differ in both architecture and imple- 
mentation. However, it may be impossible to design a 
system that performs better than all others across all 
applications; different applications 
generate different 
types of CPU bus activity and the cache-memory sys- 
tem will perform differently in each case. A range of 
statistical parameters 
may be used to illustrate this 
point: 
• Internal Cache (Ll) Hit Rate 
• Number of prefetches 
• Number of operand reads 
• Number of operand writes 
• Bus Utilization 
(amount of time spent using the 
CPU bus). 


These parameters are examined next and will be useful 
information for cache-memory design trade-offs. 


The Intel486 DX2 CPU maintains the same unified 
code/data, 
four-way 
interleaved, 
8K-byte 
internal 
cache as the Inte1486 DX CPU. The internal cache 
(Ll) hit rate is shown in Figure 2.3 for some different 
operating systems and applications. 
These hit rates 
were obtained from instruction traces captured from 
specific applications. Note that the Ll hit rate for the 
Intel486 DX2 CPU will be almost identical to that for 
the Intel486 DX CPU; the 2X-internal frequency does 
not significantly alter the cache miss statistics. 


The DOS applications included Auto Cad, Lotus123, 
Excel, Turbo C, and Flight Simulator. Lotus123 had 
the highest hit rate at 99% while Auto Cad was the 
lowest at 89.6%. The Windows 3.0 results included in- 
structions executed while clipping an image, drawing a 
dialog box, executing Excel and while executing Page- 
maker. The category UNIX-iSPEC 
refers to applica- 
tions within the UNIX SPEC benchmark suite that are 
mostly integer intensive, whereas UNIX-fSPEC refers 
to those which are floating point intensive. The last 
category UNIX-TPI 
refers to the hit rates typical while 
running the TPI transaction processing benchmark. 


These results illustrate the different nature of different 
software on the bus characteristics. 
Single-threaded 
DOS applications have a typically higher hit rate com- 
pared to the multi-tasking nature of UNIX and Win- 
dows benchmarks. The UNIX 
floating point bench- 
marks show the lowest hit rates; this is partly due to the 
large data structures typical of floating point applica- 
tions that do not fit well into the Ll cache. It is also 
due to the nature of the operations performed on those 
data structures; i.e. the application does not exhibit 
very much temporal or spatial locality. 


• 
External 
Bus Speed 


• 
Sensitivity 
to loIemory 
Latency (Internal 
Core 
Frequency) 


Figure 2.2. The Intel486™ 
DX2 Microprocessor 
Is More 
Sensitive 
to Memory 
Latency 
than Its Predecessor 
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TP 1 is a' UNIX multiuser multithreaded 
application 
with heavy amounts of disk and I/O as well as compu- 
tation. The LI cache hit rate is low since it has many 
active contexts due to multiple requests per user. 


High Ll hit rates are typical of many prevalent and 
commonly used DOS benchmarks - some have hit rates 
approaching 100%. This is unfortunate since they fail 
to properly 
account for the more realistic external 


cache and memory demands of most DOS and Win- 
dows applications. These demands will continue to be 
true with the advent of newer graphical-user-interface- 
based multi-tasking operating systems and applications. 
With these benchmarks, there is a danger of misrepre- 
senting system performance for the Intel486 DX micro- 
processor. For the Intel486 DX2 microprocessor, this 
misrepresentation can be even more damaging since LI 
cache misses incur a penalty that is twice the number of 
external 
clock 
cycles 
- 
since 
the 
CPU 
core 


now runs twice as fast internally. In other words, the 2 
Intel486 DX2 CPU is twice as sensitive to wait states 
compared to the Intel486 DX CPU. To properly gauge 
the external cache and memory performance, more de- 
manding benchmarks (e.g. UNIX SPECmarks) or real 
application benchmarks should be used. 


Different applications cause the CPU to generate a dif- 
ferent number of reads, writes and instruction prefetch- 
es. The reads and prefetches are filtered by the internal 
LI cache before reaching the external CPU bus. All 
writes propagate through to the external bus since the 
internal cache follows a write-through protocol. This is 
shown in Figure 2.4 for an instruction trace captured 
from an integer SPEC benchmark. 


__ 
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Internal 
Bus: 


Number 
of Cycle 
Types 


External 
Bus: 
Number 
of Cycle 
Types 


As with the Intel486 DX CPU, the bus cycle mix for 
the Intel486 DX2 CPU consists of mostly write cycles. 
However, the exact ratio of reads to writes is again 
application dependent. For example, for Lotusl23 that 
has a Ll hit rate of 99%, writes make up 99.5% of 
external bus cycles. 


Bus utilization refers to the amount of time that the 
CPU spends executing bus cycles for a given applica- 
tion. It is a measure of the amount of bus traffic gener- 
ated by a particular application on the CPU's external 
bus. This metric is illustrated in Figure 2.5 where the 
CPU bus is busy for 75% of the twelve external clock 
cycles shown. 


Bus utilization is dependent on the application and the 
external cache/memory 
system. Different applications 
generate different amounts of bus traffic. For example, 
some applications may have small data structures that 
fit easily within the internal cache and therefore smaller 
amounts of external bus cycles are generated. 


The Intel486 DX2 CPU will have a larger percentage 
of bus utilization for the same application as compared 
to the Intel486 DX CPU. This is due to the faster CPU 
core that can now operate twice as fast and that will try 
to generate twice as many bus cycles in the same 
amount of time. However, since the external CPU bus 
remains 
at 
a 
IX-frequency, 
it 
experiences 
heavy 
amounts of bus traffic as it trys to keep up with the 2X- 
internal core. In other words, with faster internal core 
execution, less time is spent idling on the external bus. 


Different external cache/memory 
systems also affect 
the bus utilization; faster cache/memory systems allow 
CPU cycles to complete faster and therefore free up the 
bus more. 


traces were subsequently used in a CPU-cache-memory 
simulator to extract the desired information. The re- 
sults shown here assume an ideal zero wait-state memo- 
ry system; i.e. all bus cycles complete in zero wait- 
states. 


The Intel486 DX2 CPU bus characteristics of some dif- 
ferent applications and operating systems are shown in 
Tables 2.1 through 2.3. These results are derived from 
traces captured 
from the actual applications. These 


UNIX Applications 
SPEC1 
GCC 
UNIXMIX1 


Total Number of CPU clocks simulated 
12.14M 
12.54M 
12.44M 


Overall L1 hit rate 
91.4% 
90.5% 
94.3% 
Prefetch 
hit rate 
93.6% 
91.7% 
94.8 
Read hit rate 
93.0% 
90.3% 
94.2% 
Write hit rate 
82.0% 
85.7% 
93.5% 


Number of external 
bus cycles 
1.12M 
0.882M 
1.18M 
% bus code prefetches 
14.5% 
22.6% 
10.2% 
% bus data reads 
·10.0% 
20.0% 
8% 
% bus data writes 
75.5% 
57.4% 
81.8% 


Bus Utilization 
51.6% 
46.4% 
51.4% 


Windows 
Applications 
Word 
Excel-Calc 
Pagemaker 


Total Number of CPU clocks simulated 
27.02M 
7.39M 
30.06M 


Overall L1 hit rate 
95.2% 
78.4% 
88.1% 
Prefetch 
hit rate 
96.9% 
76.0% 
81.8% 
Read hit rate 
98.0% 
85.7% 
95.2% 
Write hit rate 
87.7% 
69.7% 
83.5% 


Number of external 
bus cycles 
2.43M 
0.654M 
3.04M 
% bus code prefetches 
4.2% 
27.9% 
19.3% 
% bus data reads 
3.4% 
15.1% 
6.7% 
% bus data writes 
92.4% 
57.0% 
74% 


Bus Utilization 
40.9% 
60.1% 
56.0% 


DOS Applications 
Excel 
TurboC 
Auto 
Cad 


Total Number of CPU clocks 
11.1M 
13.9M 
16.1M 
simulated 


Overall L1 hit rate 
98.2% 
95.6% 
89.3% 
Prefetch 
hit rate 
98.8% 
94.2% 
87.7% 
Read hit rate 
97.9% 
98.1% 
96.5% 
Write hit rate 
97.4% 
93.8% 
81.3% 


Number of external 
bus cycles 
1.06M 
1.18M 
1.77M 
% bus code prefetches 
2.1% 
11.2% 
11.8% 
% bus data reads 
3.1% 
3.0% 
4.0% 
% bus data writes 
94.8% 
85.8% 
84.2% 


Bus Utilization 
41.0% 
40.6% 
55.5% 


• 


The UNIX applications are described as: 


• SPECI: A mixture of integer SPEC benchmark suite 
programs running concurrently. 
• GCC: SPEC benchmark 
suite GNU C compiler, 
compiling itself. 


• UNIXMIXI: 
A mixture of UNIX utility programs 
like awk and grep, running concurrently. 


The Microsoft Windows 3.0 applications are described 
as: 
• Word: Microsoft Word for Windows converting a 
document for import (no VGA activity, includes 
kernel calls). 


• Excel-Calc: 
Microsoft Excel for Windows running 
a calculation. 


• Pagemaker: Pagemaker for Windows formatting a 
document (no VGA activity, includes kernel calls) 


The three DOS applications shown are described as: 
• Excel: Microsoft Excel (DOS version) recalculating 
a spreadsheet. 


• Turbo C: Borland's Turbo C compiler compiling a 
large C program. 
• Auto Cad: Auto Desk's Auto Cad program comput- 
ing and displaying a drawing (reason for low hit 
rate) 


Note that most DOS applications will typically use the 
external bus much less than UNIX or Windows appli- 
cations. On the other hand, heavy duty DOS applica- 
tions such as Auto Cad will actually exhibit a larger 
demand for the external bus similar to the demands of 
UNIX and Windows. Also, recall that the bus utiliza- 
tion numbers shown assume a zero wait-state memory 
system; more realistic external cache/memory 
systems 
with a finite number of wait-states will actually experi- 
ence a larger percentage of bus utilization. Finally, note 
that since the L1 hit rate for the DOS applications is 
high, the external bus mix consists of mostly writes. 


intel~ 


Now that we have considered the characteristics of LI 
hit rates, bus cycle mix and utilization, we can examine 
the impact that the cache-memory system has on over- 
all Intel486 DX2 CPU performance. To examine these 
effects, traces were captured from three applications 
(one from each operating environment) and were used 
in a CPU-cache-memory 
simulator 
to measure the 
CPU 
performance 
under 
different 
conditions. 
The 
traces used were SPECI 
(UNIX), 
Pagemaker (Win- 
dows) and Turbo C (DOS). The simulator used is an 
accurate and convenient method of comparing per- 
formance by varying different parameters independent- 
ly. This allows us to develop some heuristic rules to 
guide system design. Before continuing, the following 
convention is defined to denote memory performance: 


Notation Convention: 
A memorysystem's performance is abbreviated as: 
Lead-offclocks - Burst 2 - Burst 3 - Burst 4, Write 
clocks 
e.g. 3-1-2-1, 3 wouldlooklike: 


External 
Clack 
I I I I I I 


ADS* 


BRDY* 


BURST 
READ 
WRITE 
241261-6 
A zero-wait-state case corresponds to a 2-1-1-1, 2 
memorysystem. 


2.5.1 
THE IDEAL 
ZERO 
WAIT 
STATE 
MEMORY 
SYSTEM 


As a starting point, the ideal zero-wait-state memory 
system was characterized 
for three applications. For 
I/O cycles, it was assumed that a constant 8 wait-states 
were required. Since the I/O instructions were a small 
portion of the instruction traces used, any inaccuracy 
due to this assumption will be insignificant. 


intel~ 


Table 2.4. Number 
of Internal 
Clocks (millions) 
Needed 
to Complete 
Application 
Trace 


SPEC1 
Pagemaker 
TurboC 


Intel486 OX CPU 
10.76 
26.68 
13.34 


Intel486 OX2 CPU 
12.14 
30.06 
13.90 


As an example, we can now compare the actual time 
required 
to 
complete 
the 
applications 
between 
a 
66MHz Intel486 OX2 CPU and a 33MHz Intel486 OX 
CPU. The number of clock cycles is multiplied by l5ns 
for the Intel486 OX2 CPU and by 30ns for the Intel486 
OXCPU. 


SPEC1 
Pagemaker 
TurboC 


Intel486 OX CPU 
323 ms 
800 ms 
400 ms 


Intel486 OX2 CPU 
182 ms 
451 ms 
209 ms 


Performance Increase 
+77% 
+77% 
+91.4% 


Note that although the Intel486 OX2 CPU's internal 
clock rate is twice that of the corresponding Intel486 
OX CPU, the relative improvement is less than 100%. 
This is due to cache miss reads and write cycles that 
run at the external bus speed. Note that the improve- 
ment is much greater for Turbo C which has a lower 
cache miss rate and low bus utilization. 


As wait states are added to the ideal zero wait state 
memory system, performance degrades. Three memory 
parameters are of interest in characterizing the memory 
performance. They are: 


• Number of wait states added to the first ready of a 
read (a.k.a. the lead-off cycle). e.g. One wait-state 
with zero wait-state burst = 3-1-1-1 


• Number of wait states during the remainder of the 
burst cycle. e.g. a zero wait-state lead-off with a one 
wait-state burst = 2-2-2-2, 


• Number of wait states on a write cycle. e.g. a one 
wait-state write takes three clocks. 


To examine the impact of adding wait-states to each of 
the parameters above, three series of simulations are 
done where wait states are added to each memory pa- 
rameter separately while the other memory parameters 
are held constant: 


Zero 


walt· 
1 walt· 
2 walt· 
3 walt· • 


states 
state 
states 
states 


• Lead-off 
2-1-1·1,2 
3-1-1·1,2 
4·1·1·1,2 
5·1·1·1,2 ... 


Series: 


• Burst Series: 
2·1·1·1,2 
2·2·2·2,2 
2-3-3·3,2 
2·4·4·4,2 


• Write Series: 
2-1·1·1,2 
2·1-1·1,3 
2·1·1·1,4 
2·1·1-1,5 ... 


As the number of wait states increases, the number of 
clocks needed to complete the application increases 
(and the CPU performance decreases). This series of 
measurements is used to separate out the dependency of 
the CPU performance on the different memory parame- 
ters. This information is useful for subsequent external 
cache/memory design trade-offs. 


The number of clocks needed to complete the. applica- 
tion relative to the zero wait-state case is referred to as 
the relative total execution time. This metric will be 
used in the following graphs instead of the reciprocal of 
total execution time which would be CPU performance; 
this is so that any inherent linear relationships between 
wait-states and execution time can be more easily rec- 
ognized. To translate the total execution time back to 
CPU performance, the following table is provided for 
convenience (100% refers to the zero wait-state case): 


Total Execution 
Time 


CPU Performance 


Figures 2.7 and 2.8 show the total execution time as 
wait states are added for the SPEC 1 trace described 
earlier. 


As would be expected, as wait states are added, the 
relative total execution time for the Intel486 DX2 CPU 
increases faster than the Intel486 DX CPU. (however, 
note that the absolute total execution time for a In- 
tel486 DX2 CPU will never be greater than a Intel486 
DX CPU of the same external bus speed). Also note 
that the order of importance of the memory parameters 
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changes between the Intel486 DX and the Intel486 
DX2 CPU. For the Intel486 DX CPU, burst perform- 
ance is the most important, with read-lead-off perform- 
ance being slightly more important than writes. This 
conclusion was presented in Chapter 4 of the original 
Intel486 
DX 
Microprocessor 
Hardware 
Reference 


Manual (Order Number 240552-0(1). However for the 
Intel486 DX2 CPU, while burst performance is still the 
most important, write performance becomes more im- 
portant than read-lead-off performance. 


Figure 2.7.lnteI486 
DX2 CPU Performance 
Degradation 
as Walt States 
are Added - for the SPEC1 Application 
Trace (UNIX) 
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Figure 2.8. Intel486 
OX CPU Performance 
Degradation 
as Walt States 
are Added - for the SPEC1 Application 
Trace (UNIX) 
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show the burst cycles as being the most sensitive to wait 
states. However, note that the lead-off cycle is now 
more important than writ~ cycles. This is due to the 
small percentage of bus writes (57.4%) while running 
the GCC application as compared with the SPECI 
trace. 


Figures 2.9 and 2.10 show the total execution time as 
wait states are added for the GCC trace described earli- 
er. 


Compared to the results for the SPECl 
application 


trace, the GCC results with the Intel486 DX2 CPU still 
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Figure 2.9. Intel486 
DX2 CPU Performance 
Degradation 
as Walt States 
are Added· 
for the GCC Application 
Trace (UNIX) 
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Figure 2.10. Intel486 
OX CPU Performance 
Degradation 
as Walt States 
are Added· 
for the GCC Application 
Trace (UNIX) 
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The results for the Pagemaker Application under Win- 
dows are shown in Figures 2.11 and 2.12. 


earlier. Basically, the degradation of the Intel486 DX2 
CPU's relative execution time increases faster as wait- 
states are added as compared to the Intel486 DX CPU. 


The same conclusions can be made for the Pagemaker 
application under Windows as for the SPECI results 
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Figure 2.11. Intel486 
DX2 CPU Performance 
Degradation 
as Wait States 
are Added· 
for the Pagemaker 
Application 
Trace (Windows) 
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Figure 2.12. Intel486 
DX CPU Performance 
Degradation 
as Wait States 
are Added· 
for the Pagemaker 
Application 
Trace (Windows) 


external bus utilization of the application. However, the 
degradation is still about twice what it is for the In- 
tel486 DX CPU. Note that the write importance is 
about equal to the burst importance in this case. This 
can be attributed to the greater percentage of writes in 
the bus cycle mix for this application. 


Finally, the results for the Turbo C application under 
DOS are shown in Figures 2.13 and 2.14. 


The rate of performance degradation for the Intel486 
DX2 CPU with the Turbo C application is less than the 
UNIX and Windows examples. This is due to the lower 
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Figure 2.13. Intel486 
DX2 CPU Performance 
Degradation 
as Wait States 
are Added· 
for the Turbo C Application 
Trace (DOS) 
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Figure 2.14. Intel486 
OX CPU Performance 
Degradation 
as Wait States 


I 
are Added. 
for the Turbo C Application 
Trace (DOS) 
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The main reason why the relative performance of the 
Intel486 DX2 CPU degrades faster than the Intel486 
DX CPU is that for every external wait state, two inter- 
nal clock delays are caused. In fact, a zero wait state 
cycle on the external bus of the Intel486 DX2 CPU is 
already a two-wait state cycle as experienced by the 2X- 
clock internal core as shown in Fig. 2.15. The imagi- 
nary 
2X-clock versions of the signals ADS# 
and 
BRDY # illustrate what the cycle might have looked 
like if the internal bus frequen cy was equal to the ex- 
ternal. 


Extending this fact, this means that a one wait-state 
cycle for the Intel486 DX2 CPU is actually equivalent 
to a four wait-state cycle for the 2X-clock internal CPU 
core. 


The internal cycle start indication conditions may also 
have a one internal clock cycle synchronization penalty 
if it is active in the wrong phase of the external clock 
(also shown in Figure 2.15). 


As the effective number of wait states increases, the 
CPU will stall program execution differently for each of 
the three memory parameters 
described above. The 
stall conditions for each memory parameter are elabo- 
rated below. 
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Wait states incurred on the first ready of a external 
read (the lead-off cycle) affect both data reads and code 
prefetches. For data reads, the CPU's 
execution is 


stalled under most conditions; no other operation can 
happen in parallel until the first ready of the line fill is 
received. For code prefetches, execution is stalled if the 
processor is fetching code as a result of a code branch 
(therefore flushing the prefetch buffers). 


For most applications, the read lead-off delay increases 
the execution time the least compared to the other pa- 
rameters. This is because writes cycles usually make up 
the dominant share of the bus cycles. However, there 
are exceptions to this case; for example, with the GCC 
trace, only 57.4% of the bus cycles were writes. 


Adding wait states to the burst cycle increases the exe- 
cution time the most. The burst is usually the result of a 
cache line fill or a code prefetch. Adding wait states to 
this parameter ties up the bus for the longest periods of 
time compared to adding the same number of wait 
states to the other parameters. As a result, all subse- 
quent external bus requests are stalled as the CPU waits 
for the burst cycle to complete. These include stalls 
while the CPU waits to do a read cycle. A longer burst 
cycle also delays the rate at which the internal write 
buffers can be emptied since the write buffers must also 
wait for the external bus to free up. This causes stalls as 
described below for write cycles. 
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Finally, longer code prefetch bursts will slow down 
CPU execution if the prefetch was a result of the pre- 
fetch queue being flushed. This is especially so if the 
instruction required extends beyond the first dword of 
the burst and therefore the CPU must wait for subse- 
quent dwords before execution can start. 


There are three conditions under which a longer write 
cycle will stall CPU execution as additional wait states 
are added. These conditions are: 
I. The write buffers are full and cannot accept any 
more writes. 
2. A read cannot bypass the write buffers and must 
wait for them to be flushed. 


3. A read bypasses the write buffers but must wait for 
an existing write cycle to complete. 


Before these effects are elaborated, it is worthwhile to 
reexamine the operation of the internal write buffers. 


The Intel486 OX2 CPU uses the same four-deep write 
buffers as the Intel486 OX CPU. The write buffers can 
accept data writes from the execution core as fast as 
one per clock. Once a write request is buffered, the 
internal unit that generated the request is free to contin- 
ue processing. When all write buffers are full, any sub- 
sequent write transfer will stall inside the processor un- 
til a write buffer becomes available. 


The bus interface unit can re-order pending reads in 
front of buffered writes. This is done because pending 
reads can prevent an internal unit from continuing, 
whereas buffered writes need not have a detrimental 
effect on processing speed. Writes are propagated to the 
external bus in the same first-in-first-out order in which 
they are received from the internal unit. However, a 
subsequently generated read request (data or instruc- 
tion) may be reordered in front of buffered writes. As a 
protection against reading invalid data (reading stale 
data from a location in main memory when the location 
has been modified in the write buffers), this reordering 
of reads will only occur if all buffered writes are inter- 
nal cache hits. Because an external read will only be 
generated for a cache miss, and will only be reordered 
in from of buffered writes if all such writes are internal 
cache hits, any read generated on the external bus will 
never read a location that is about to be written by a 
buffered write. 


This reordering can only happen once for a given set of 
buffered writes, because the data returned by the read 
cycle could otherwise replace data about to be written 
from the write buffers. 


The first condition that causes CPU stalls is when the 
write buffer is full. Write wait states decrease the rate 


at which the write buffer can be emptied. Since the 
Intel486 OX2 runs at a 2X-internal frequency, the like- 
lihood of filling up the write buffers increases when 
compared to the Intel486 OX CPU as shown in Figure 
2.16. 


postedL 


Writes 
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Figure 2.16. The Intel486 
OX2 CPU's 
Write Buffers 
are More Heavily 
Used than the Intel486 
OX CPU's 


The second situation that degrades performance is dur- 
ing reads which cannot bypass the write buffers - either 
because the buffered writes were cache misses or be- 
cause a read reordering had already occurred. These 
reads will be stalled until the write buffers are emptied. 
The more wait states required for writes on the external 
bus, the longer these stalls will last. 


Finally, reads which can bypass the write buffers may 
be stalled by a write already in progress on the external 
bus. This condition is illustrated in Fig 2.17 for both 
the Intel486 OX and Intel486 OX2 CPUs. Note that 
for this example, although both the Intel486 OX2 and 
Intel486 OX CPUs take the same amount of time to 
complete the instruction 
stream, the Intel486 OX2 
CPU is stalled longer (waiting for the write to com- 
plete) relative to its own internal 2X clock. 


Out of the three conditions described above, stalls on 
write buffers full and stalls because of reads on busy 
writes dominate the increase in execution time as wait 
states are added to write cycles as shown in Table 2.7 
for the SPECI trace. (The results shown in Figure 2.7 
assume that the read and burst cycles complete in zero 
wait-states). These effects will be discussed again later 
when the addition of external write buffers is consid- 
ered. 
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Figure 2.17. Reads Stalled as a Result of a Write Already 
In Progress 


Table 2.7 Percentage 
of Total Execution 
Time Stalled under the Three Different 
Write Stall Conditions 


Intel486 OX 
Intel486 OX2 
CPU 
CPU 


Write walt states 
0 
1 
2 
0 
1 
2 


Stalls on write 
0.0% 0.1% 0.6% 1.0% 3.1% 6.5% 
buffers full 


Reads cannot 
0.1% 0.1% 0.3% 0.3% 0.5% 0.5% 
overtake writes 


Stalls because 
0.6% 1.3% 2.2% 2.4% 4.8% 7.5% 
of reads on 
busy writes 


3.0 
Memory Design Optimization 


Some Intel486 DX2 CPU-based designs will include a 
memory system without an external cache. This section 
covers the design of such a cacheless memory system. 
Different memory architectures are discussed and the 
benefits of improving write performance through the 
addition of external write buffes will also be considered 
(see Figure 3.1). 


Main memory performance will be important for exter- 
nal cache-based designs also, especially for applications 
with low external cache hit rates. It is recommended 
that the performance impacts of design choices in a 
cacheless memory design are understood even if you 
have already specified an external cache in your design. 


Figure 3.1 The Two Cacheless 
Intel486 
DX2 CPU·Based 
Systems 
Considered 


As discussed in the previous sections, the Intel486 DX2 
Microprocessor requires a fast memory system for opti- 
mum performance. Memory systems that may have 
been adequate for Intel486 DX CPU designs running 
DOS applications may be suboptimal for the Intel486 
DX2 CPU, especially running today's more demanding 
operating systems and applications. 


Main memory page-mode operation and interleaving 
techniques are important for Intel486 DX2 CPU per- 
formance. These are commonly used in existing, well- 
designed Intel486 DX CPU memory systems. Howev- 
er, some systems still use non-interleaved memory de- 
signs borrowed from Inte1386 DX systems. These will 
be less than optimal for a high performance Intel486 
DX2 CPU workstations design. 
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Page-mode main memory controllers can be imple- 
mented in several fashions. Typical memory systems 
utilize paging for all accesses - during the beginning of a 
read, during read bursts and during write cycles; i.e. the 
RAS 11 line is held active after all accesses and only 
returned 
inactive during a page miss. Alternatively, 
paging may be used only for the burst portion of a read 
cycle; the RASlI line always returns inactive after the 
read or write cycle has been completed. This method is 
more commonly used in conjunction with a write-back 
external cache as discussed in Section 4.4. 


For read burst cycles, the 16-byte linefill of data or 
code will always lie within a DRAM page, thereby al- 
lowing the data or code to be strobed out of memory 
with a series of back-to-back CAS 11 pulses. Paging al- 
lows for a much faster burst cycle compared to the case 
where a full RASlI-CASlI 
cycle is required for each 
dword. A page mode burst read access to a single bank 
of memory is shown in Fig. 3.2. 


A paged memory system also allows for faster back-to- 
back write cycles. As was true for the Intel486 DX 
CPU, 
the Intel486 
DX2 
CPU 
generates writes in 
strings of two, about 60%-70% of the time, and writes 
in strings of three about 40%-50% of the time. This bus 
characteristic 
accounts for a large page hit rate for 
writes; therefore, it is faster to perform the back-to- 
back write cycles in a fast page mode rather than per- 
forming a full RAS 11-CAS 11 cycle for each write. 


At this point, it is worthwhile to examine the page hit/ 
miss ratio for the three applications considered in the 
previous section. These results are shown in Table 3.1 
and assume no external cache and a page size of 8192 
bytes. 


SPEC1 
Pagemaker 
TurboC 


Read Page Hit 
31.2% 
26.4% 
25.4% 
Read Page Miss 
68.8% 
73.2% 
74.6% 


Write Page Hit 
65.5% 
68.8% 
68.9% 
Write Page Miss 
34.5% 
31.2% 
31.1% 


Note the low page hit ratio for CPU reads. This is due 
to the internal cache of the Intel486 DX2 CPU that 
filters read requests and tends to make the cache miss 
reads more randomly 
distributed 
throughout 
main 
memory. 


Interleaving involves the use of more than one bank of 
memory; different banks are controlled separately. As 
an access is occurring, the other banks are being read- 
ied for the next access. Interleaving can be implement- 
ed in several ways. Horizontally interleaved banks gen- 
erate accesses for consecutive locations in memory. For 
example, one bank can be designated as an odd dword 
and another for the even dword. Vertically interleaved 
banks separate large contiguous regions of memory be- 
tween banks; i.e. multiple DRAM 
pages are open. 


Memory controllers often combine both methods. 


Horizontal interleaving can be combined with paging to 
generate very quick burst reads. Two 32-bit banks can 
generate a zero wait-state burst as detailed in the Intel 
Applications Note AP447 "A Memory Subsystem for 
the Intel486™ DX Family of Microprocessors includ- 
ing Second Level Cache." Fig. 3.3 illustrates a burst 
read cycle from a paged-interleaved memory system. 
The signals CASO11 and CAS I 11 drive each of the two 
32-bit banks of memory in this example. 
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Some implementations of a two-bank interleaved mem- 
ory may be limited to a 1-2-1 burst cycle for the last 
three dwords. This is mainly limited by the amount of 
time it takes to invert the A3 address line between the 
second and third dwords. 


3.3 
Memory Read Performance 
Considerations 


Seven memory systems with different read performance 
parameters 
are examined; write performance is kept 
constant during these simulations. The memory param- 
eters are as follows: 


Systems A through D represent the performance of 
some typical page mode memory controllers while the 
performance of systems E through G would require a 
paged-interleaved memory controller. 
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Table 3.2. Memory 
Systems 
used 
for the No-Cache 
System Test 


Read 
Read 
Write 
Write 
Page 
Page 
Page 
Page 
Hit 
Miss 
Hit 
Miss 


System A 
4-3-3-3 
8-3-3-3 
3 
6 
System B 
4-2-2-2 
8-2-2-2 
3 
6 
SystemC 
4-2-2-2 
7-2-2-2 
3 
6 
System D 
3-2-2-2 
8-2-2-2 
3 
6 
System E 
3-1-2-1 
7-1-2-1 
3 
6 
System F 
3-1-2-1 
6-1-2-1 
3 
6 
System G 
3-1-1-1 
6-1-1-1 
3 
6 


The results for the Intei486 DX2 CPU with these mem- 
ory systems is shown in Figure 3.4 through Figure 3.6 
for the SPECI. Pagemaker and Turbo C traces used 
previously. The graphs show the total execution time 
relative to the ideal zero-wait state memory system. 


Figure 3.4. Totallntel486 
DX2 CPU Execution 
Time 
versus Memory 
Read Performance 
- for SPEC1 (UNIX) 
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Figure 3.5. Totallntel486 
DX2 CPU Execution 
Time versus 
Memory 
Read Performance· 
for Pagemaker 
(Windows) 
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Figure 3.6. Totallntel486 
DX2 CPU Execution 
Time versus 
Memory 
Read Performance· 
for Turbo C (DOS) 


The interesting points to note here are: 


• As expected, the DOS application suffers the least 
from slow memory performance. 


• Burst performance is very important. Note the im- 


provement from system A to B, system D to E and 
even from system F to G (where one clock was re- 
moved from the third burst). 


• Since the read page hit ratio is lower than 50%, 
improving the read page miss lead-off cycle is more 
important than the read page hit lead-off cycle. Note 
the improvement from system B to C versus the im- 
provement from system B to D. 


3.4 
Memory Write Performance 
Considerations 


There are several methods of improving the write per- 
formance of the memory system. These methods are 
first described; the benefits of the different methods are 
discussed later. 


The most common method for improving write per- 
formance is to employ page mode accesses to DRAM. 
As shown in Table 3.1, the page hit ratios for write 
cycles favor the use of page mode accesses. An example 
of a DRAM write cycle is shown in Fig. 3.7. On page 
hits, back-to-back three clock write cycles can be main- 
tained. A page miss write would of course take an addi- 
tional number of clocks to allow for the RAS# 
pre- 


charge time. 
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Memory write performance can be improved further by 
two related methods: write butTering and pipelining. If 
one external write butTer is added to the memory sys- 
tem shown in Fig. 3.7, the write cycles in Fig. 3.8 may 
be observed: 


In the example shown, the one level of write butTering 
allows the first ready signal to be returned one clock 
earlier. The first write cycle finishes in two clocks (zero 
wait states); however, if the CPU puts out many back- 
to-back writes (as is typical), the memory system will 
still be limited to a throughput 
of three clock writes 
subsequent to the first write cycle. If the CPU write is 
not followed immediately by any bus traffic, the one 
write butTer does relieve the CPU quickly, especially if 
the write was a page miss. 


More than one level of write butTering is sometimes 
employed. This would allow multiple writes to be ac- 
cepted at zero wait states before wait states of the main 


memory system atTect the CPU bus. To get the maxi- 
mum benefit from multiple write butTering, CPU reads 
that occur when there are more than one writes pend- 
ing in the external write butTers should be allowed to 
bypass the writes and be executed as soon as the exist- 
ing memory write is complete. This is similar or course 
to the internal write butTersof the Intel486 CPU. If the 
read cycle's address corresponds to an address already 
in the write butTers, the read must wait until the corre- 
sponding write completes so that the read does not 
fetch stale data from memory. In other words, care 
must be taken'to 
ensure data consistency when using 
external write butTers. 


Write pipelining extends the use of butTering by over- 
lapping the memory controller operations during the 
write cycle. An example is shown in Fig. 3.9 below. The 
data phase of the last write cycle (CAS # pulse) is over- 
lapped in time with the address phase of the next mem- 
ory write cycle (Page hit/miss decoding, etc.). 
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With pipelining, it is possible to achieve a throughput 
of many two-clock back-to-back page hit writes. (An 
example of write pipeliningcan be found in Intel Appli- 
cations Note AP447) Note that pipelining may affect a 
subsequent read cycle; if the CPU read occurs immedi- 
ately after the write, the beginning of the read will be 
delayed until the DRAM write cycle has completed. 
This is especially true for page miss writes where the 
write may take several clocks to complete. Note also 
that pipelined memory write systems can be combined 
with write buffering;this helps for the case where many 
back-to-back page miss writes occur. 


Note that both write buffering and/or pipelining re- 
quire a data path device between the CPU and main 
memory (see Fig. 3.10). This is needed to capture the 
data from the CPU before RDY # or BRDY # is re- 
turned, after which point the data will become invalid. 


Figure 3.10. A Data Latch Is Required 
for Write Buffering 
or Plpellnlng 


To understand the performance benefits of the various 
methods described, systems Band 
F from the earlier 
simulations for read performance are repeated with dif- 
ferent write performance parameters (refer to Table 
3.3). Systems Band F were chosen as typical represen- 
tations of paged and paged-interleaved memory con- 
trollers respectively. 


• 


Read 
Read 
Write 
Write 
Write Method 
Pg Hit 
Pg Miss 
PgHlt 
Pg Miss 


System 
81 
4-2-2-2 
8-2-2-2 
3 
6 
Normal 
System 82 
4-2-2-2 
8-2-2-2 
3 
6 
One buffer 
System 83 
4-2-2-2 
8-2-2-2 
2 
6 
Pipelined 
System 84 
4-2-2-2 
8-2-2-2 
2 
5 
Pipelined 
System 85 
4-2-2-2 
8-2-2-2 
2 
5 
Pipelined with two buffers 
System 
86 
4-2-2-2 
8-2-2-2 
2 
5 
Pipelined with four buffers 


System F1 
3-1-2-1 
6-1-2-1 
3 
6 
Normal 
System F2 
3-1-2-1 
6-1-2-1 
3 
6 
One buffer 
System 
F3 
3-1-2-1 
6-1-2-1 
2 
6 
Pipe lined 
System 
F4 
3-1-2-1 
6-1-2-1 
2 
5 
Pipelined 
System 
F5 
3-1-2-1 
6-1-2-1 
2 
5 
Pipelined with two buffers 
System F6 
3-1-2-1 
6-1-2-1 
2 
5 
Pipelined with four buffers 


The memory systems above were simulated again for the Intel486 DX2 CPU with the three applications. The results 
are shown in Fig. 3.11 through Fig. 3.13 below: 
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From the results shown, the following significantly im- 
proved CPU performance: 


• Reducing the number of clocks for page hit writes 


(system B2 to B3 and F2 to F3) 
• Reducing the number of clocks for page miss writes 


(system B3 to B4 and F3 to F4) 


The following caused marginal improvement in the exe- 
cution time: 


• Adding one level of buffering (from memory sys- 


tems BI to B2 and FI to F2) 


• Adding more than two write buffers (from B4 to B5 


to B6 and F4 to F5 to F6) resulting in only a 1-2% 
improvement. 


These results may be somewhat surprising considering 
the earlier graph showing performance degradation as 


intel~ 


the number of write wait states increases (Fig. 3.7). One 
would expect that adding write buffers would compen- 
sate for the slower memory write system more than 
they do. In order to understand the results, consider the 
statistics in Table 3.4 for processor execution stalls as a 
result of write activity as described in Section 2.5.3. The 
statistics are shown for the SPEC I trace. 


Note that the Stalls Because of Reads On Busy Writes 
dominates the increase in execution time for memory 
system FI compared to the zero wait state case. Adding 
one write buffer (from system FI to F2) - in an attempt 
to improve performance - decreases the percentage of 
stalls on a full write buffer from 5.3% to 4.5%. Howev- 
er, while the number of Stalls Because of Reads on 
Busy Writes did decrease, the wait states were simply 
transferred to stalls while waiting for the first ready of a 
read and no net imp rovement is observed. One exam- 
ple of this situation is illustrated in Fig. 3.14. 


Percentage 
of Total Execution 
Time Stalled: 


Because 
a 
On Write 
Because 
of reads 
Read Cannot 
Waiting 
for 
Buffers 
Full 
on Busy Writes 
Overtake 
a 
First Ready 
Write 


Zero Wait State Case 
1.0% 
2.4% 
0.3% 
8.8% 


Memory System F1 
5.3% 
8.0% 
0.5% 
17.1% 


System 
F1 plus one write buffer 
4.5% 
4.9% 
0.4% 
20.9% 
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In the example shown, without external memory write 
buffers, the cache miss read shown stalls for four clock 
cycles while waiting for the write cycle to complete. 
With memory write buffers, the CPU need not wait to 
start the read cycle since the write completed in zero 
wait states. However, since main memory is still occu- 
pied with the original write cycle, the read is still de- 
layed externally while the write completes. The net ef- 
fect is that the read-stalls because of write traffic does 
not decrease; th e write traffic has simply been trans- 
ferred from the CPU bus to the memory bus where it 
has to contend with the next read cycle. 


Note that there will be instances where the addition of 
external write buffers to a cacheless memory system 
does benefit a sequence of bus cycles. This would be the 
case for applications with very low external read traffic 
and large amounts of write traffic. In this case, the 
write buffers do benefit the heavy write traffic while the 
reads on busy writes will be ,alower percentage of total 
stalls. 


3.5 
Viability of Intel486 DX2 System 
without an External Cache 


As shown in this section, the CPU performance of a 
cacheless, 
main-memory-only 
Intel486 
DX2 
CPU 
based system will range from good to fair depending on 
the application. The correct cost-performance 
point 
will dictate the viability of such a product. For the 
Windows and UNIX applications tested, with a good 
memory design, the Intel486 DX2 CPU will get to 


about 120% of the execution time of the ideal zero wait 
state case with the examples shown. The reciprocal of 
total execution time is CPU performance; wh ich works 
out to 83% of maximum in this case. Of course, other 
system design factors will come into play, such as re- 
fresh requirements, other bus master memory require- 
ments, etc. 


More exotic memory architectures 
may improve the 
performance of the cache1ess Intel486 DX2 CPU sys- 
tem design over what has been discussed here. Howev- 
er, the next section will address the more straightfor- 
ward method of increasing CPU performance further: 
adding an external cache. 


An external cache will supplement the on-chip 8K 
cache of the Intel486 DX2 CPU. The requirement for 
an external cache is more important for the Intel486 
DX2 CPU than it was for the Intel486 DX CPU for all 
the reasons discussed in the previous sections. 


Many cache architectures have been implemented with 
the Intel486 DX CPU. Caches differ depending on 
their size, associativity, serial vs. parallel implementa- 
tions, write-through 
vs. write-back policies, etc. This 
section will focus on optimizing the performance of a 
uniprocessing system, i.e. the CPU is the major con- 
sumer of main memory bandwidth. The architectures 
discussed are shown in Figure 4.1. 


4.1 
Overall Effect of an External 
Cache on CPU Performance 


shown in Figures 4.2 through 4.4 for the three applica- 
tions tested earlier. 


The same memory systems tested in the previous sec- 
tion are tested again with a 128K 2-way associative 
write-through 
parallel cache. This will yield the im- 
provement achieved by the decrease in the effective 
number of read and burst wait states. The results are 


The addition of an external write-through cache reduc- 
es the performance degradation caused by main-memo- 
ry wait states for the lead-off cycle of a read and for 
wait states during the remainder of a burst. The impact 
of these wait states was discussed in Section 2.5.3. 
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(Windows) 
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For the UNIX and Windows applications, the addition 
of the external cache improved the execution time by 
15%-35% depending on the memory design. The cache 
used in this case - a 128K two-way set associative cache 
- does an excellent job of buffering the CPU perform- 
ance from the memory system performance. However, 
note that even with the external cache, the execution 
time is sti1112%-18% above the zero wait state case for 
these two applications. This is due to the write perform- 
ance of the memory system since the cache policy in 
this example is write-through. Further improvement on 
the write performance is investigated later in this sec- 
tion. 


For the DOS application, the addition of the cache 
brings the performance of the Intel486 DX2 within 5% 
of the ideal zero wait state case. This is of course due to 
the lower miss rate of the CPU's internal cache and the 
application's low bus utilization. 


4.2 
Effect of Cache Size and 
Associativity 


A 128K two-way-associative, write-through, 
parallel, 


external cache was used in the previous section. As the 
size and associativity of the cache are varied, the CPU 
performance varies. This is shown in Fig 4.5 for the 
memory systems Band F as used earlier (see Table 4.1). 
Both one-way (direct mapped) and two-way set associa- 
tive caches are tested with the SPECI application trace. 


Table 4.1. Memory 
Systems 
for the 
Write-Through 
Cache Test 


Read 
Read 
Write 
Write 


Page 
Page 
Page 
Page 


Hit 
Miss 
Hit 
Miss 


I System B 
4-2-2-2 
8-2-2-2 
3 
6 


I System F 
3-1-2-1 
6-1-2-1 
3 
6 
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Fig 4.6 illustrates the external cache hit rates for CPU read cycles. The hit rates are directly related to the total 
execution time; higher hit rates result in shorter execution times. 


Figure 4.5. Total Execution 
Time for the Intel486 
DX2 CPU 
as a Function 
of Cache Size and Associativity· 
for SPEC1 


4.3 
Improving 
the Performance 
of a 
Write-Through 
Cache 


With a write-through cache, good memory write per- 
formance is necessary to achieve the best possible per- 
formance with the Intel486 DX2 microprocessor. All of 
the methods for improving the write performance for a 
cacheless system, discussed in section 3.4, also apply for 
the write-through cache-based system. 


The previous results for a write-through cache assumed 
a non-pipelined memory system with a page-hit write 
performance of three clocks and a page-miss perform- 
ance of six. The most effective method of increasing the 
memory write performance further is the use of memo- 
ry write pipelining. The write performance can be im- 
proved so that continuous back-to-back page-hit write 
cycles can complete in zero wait-states. Pipelining can 
also reduce the number clocks required for a page-miss 
write cycle. As the write performance improves using 
this technique, the write-through 
cache system can 
come close to that of the ideal zero wait state system. 
These results are shown in Section 4.3.3 to follow. 
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Adding one or more write buffers to a external write- 
through cache-based system improves performance by 
a larger amount compared to the cacheless case. Figure 
4.7 illustrates why. 


The addition of external write buffers allows the memo- 
ry write cycle to be "hidden" from the CPU bus if the 
next CPU cycle happens to be a external cache hit read. 
And since the external cache read hit ratio is high, most 
of the delays which were present in a cacheless system 
under these circumstances are removed. In essence, the 
on-chip cache/write-buffers have been duplicated exter- 
nally to provide a multiple level architecture (see Fig. 
4.8). 
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Figure 4.7. Adding 
External 
Write Buffers 
to an External 
Cache Reduces 
Execution 
Time 


write 
posted 
to memory 
write 
buffer. 
-----. 
MADS# 


writej 
(WAIT) 


(WAIT) 


MBRDY# 


• 


intel~ 


External 
Write 
Buffers 


4.3.3 
PERFORMANCE 
WITH 
AN EXTERNAL 
WRITE·THROUGH 
CACHE 
Fig. 4.9 shows the results of the memory systems tested 
with a l28K, two-way associative, write-through, paral- 
lel cache and the Intel486 DX2 CPU using the SPECI 
application trace. 
To quantify the benefits of improving the write per- 
formance, the systems in Table 4.2 were tested. 


Table 4.2. Memory 
Systems 
Used 
for Write-Through 
Cache Test 


Read 
Read 
Write 
Write 
Write 
Pg 
Pg 
Pg 
Pg 
Method 
Hit 
Miss 
Hit 
Miss 


SystemB1 
4-2-2-2 
8-2-2-2 
3 
6 
Normal 
System 82 
4-2-2-2 
8-2-2-2 
3 
6 
One buffer 
System 83 
4-2-2-2 
8-2-2-2 
2 
6 
Pipelined 
System 84 
4-2-2-2 
8-2-2-2 
2 
5 
Pipelined 


System F1 
3-1-2-1 
6-1-2-1 
3 
6 
Normal 
System F2 
3-1-2-1 
6-1-2-1 
3 
6 
One buffer 
System F3 
3-1-2-1 
6-1-2-1 
2 
6 
Pipelined 
System F4 
3-1-2-1 
6-1-2-1 
2 
5 
Pipelined 


As the write performance increases, the CPU perform- 
ance approaches that of the zero wait state case. The 
improvement from systems Bl to B2 and from Fl to F2 
illustrate the benefit of write buffering with an external 
cache. The improvement from systems B2 to B3 and 
from F2 to F3 reflect the benefit of memory write pipe- 
lining. Finally, the improvement from systems B3 to B4 
and from F3 to F4 show how reducing the page-miss 
write performance also increases performance. 


If correctly implemented, a write-back external cache 
can provide good performance for a uniprocessing In- 
tel486 DX2 
CPU 
based system. 
Serial write-back 
caches have typically been used to reduce bus utiliza- 
tion for multiprocessing systems. The design complexi- 
ty of a write-back cache controller is typically an order 
of magnitude higher than for a write-through 
cache 
controller. However, correct implementation 
is abso- 
lutely necessary if significant performance gains are to 
be realized with the Intel486 DX2 CPU. 


A write-back cache is different from a write-through 
cache in that it allows cache write hits to modify the 
cache line without updating main memory. The cache 
has tags that include a bit called the modified (dirty) 
bit. This bit is set if the cache location has been written 
with new information and therefore contains informa- 
tion that is more recent than the corresponding infor- 
mation in main memory. If a subsequent read miss oc- 
curs and the line being fetched needs to fill the cache 
location that is currently being occupied by the modi- 
fied line, the cache controller must then write the modi- 
fied cache line back to main memory; hence coherency 
is maintained. 


If a CPU write is not a cache hit, the cache controller 
has the option of allowing the write to propagate 
through to memory or to fetch the cache line from 
memory to be merged with the new write data. The 
cache line fill in the second option is called a write-allo- 
cation. In the following discussions, it is assumed that 
no write-allocations are being performed. 


4.4.1 
MAIN 
MEMORY 
CONTROLLER 
CONSIDERATIONS 


The addition of an external write-back cache changes 
the characteristics 
of the main memory bus traffic. 
Since the cache effectively filters all CPU requests, the 
cycles that do propagate to main memory tend to be 
more distributed 
in their locations. This decrease in 
temporal and spatial locality will reduce the DRAM 
page hit rate as shown in Table 4.3 for a l28K, two-way 
associative, write-back cache with the SPEC 1 applica- 
tion trace. Compare these results to the prior results in 
Table 2.1 for a cacheless system. 


Table 4.3. Page Hit Ratios for a 
Write-Back 
Cache - for SPEC1 


MEMORY 
SPEC1 
PGMK 
TURBOC 
CYCLES 
(100%) 


Reads: 
Page Hits 
17.1% 
25.6% 
24.7% 
Page Misses 
13.8% 
13.9% 
12.9% 


Writes: 
Page Hits 
58.9% 
55.8% 
40.8% 
Page Misses 
10.2% 
4.7% 
21.6% 


Therefore, it is less beneficial with a write-back cache to 
implement a page-mode main memory controller. 


Of course, page mode DRAM accesses within the burst 
cycle are still important to retrieve the four words of a 
cache line quickly. This is also true for the Write-back. 
cycle where four dwords of the cache line must be writ- 
ten to memory. Memory controllers should be designed 
to support a burst write cycle instead of having to write 
each dword separately. 


The write-back cycle is the sequence where a cache line 
fill from main memory has to displace a modified line 
that was already in the cache. The method in which the 
modified line is written back to main memory has an 
impact on overall CPU performance. Before analyzing 
the write-back cycle, consider first the architectures 
shown in Fig. 4.10. 


In the simplest implementation, a write-back cache will 
share the data bus with the CPU and main memory as 
shown in configuration X. If this is the case, then dur- 
ing a write-back cycle, the modified line must be writ- 
ten back to main memory before the cache linefill can 
commence. This has a detrimental effect on perform- 
ance since the CPU must wait while the write-back oc- 
curs. This sequence is shown in Figure 4.11. 


With a data path device between the CPU-Cache bus 
and main memory as shown in configuration Y, the 
cache controller is able to defer the write-back of the 
modified data till after the linefill has completed. The 
CPU can continue execution after the linefill as long as 
subsequent cycles are all cache hits. 


In configuration Z, a wider cache bus exists between 
the SRAM and the data path devices. This allows the 
modified data to be transferred more quickly from the 
SRAM to the data path device, thereby allowing the 
cache linefill to commence even sooner. 
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The following systems are used to demonstrate Intel486 
DX2 microprocessor performance with different cache 
sizes and associativities. 


The results are shown in Fig. 4.12 for the Intel486 DX2 
CPU running the SPECI trace. 


Reads 
Writes 
Write-Back 
Method 
(described 
above) 


System A 
5-1-1-1 
4-1-1-1 (burst) 
Concurrent 
Write Back 
System B 
6-3-3-3 
4-4-4-4 (non-burst) 
Concurrent 
Write Back 
System C 
6-3-3-3 
4-4-4-4 (non-burst) 
Delayed Line Fill 


, 
1~:1 
., . 
..... 
--t-c 
---.-.-. , 


114.00% 
.... 
112.00% 
" 
0~ 
110.00% 
oS 
Vl 


'0 
108.00% 
~ 
e 
106.00% 
.. 
N 
2.. 
104.00% 
.~ 
"0 
102.00% 
.. 
It: 


100.00% 
64/1 


.....,.,.------ 
., 
....:. ...• 


. ~.-----...- .. ~ 
............ 
-.~-:--.I"'.-_- .. ----. 


128/1 
128/2 
Cache Size/Associativity 


Figure 4.12. Intel486™ 
DX2 CPU Total Execution 
Time with Different 
Cache 
Size, Associativity, 
Memory 
Speed and Write-Back 
Method - for SPEC1 


The addition of a write-back cache does an excellent job 
of decoupling the CPU performance from the main 
memory performance as shown with memory systems 
A and B. However, note that memory system B (with 
the delayed line fill) performs poorly - even worse than 
a good write-through 
cache - unless a significant 
amount of cache memory is added to reduce the miss 
rate. 


This document has shown that good memory perform- 
ance is especially important for the Intel486 DX2 mi- 


croprocessor. Business workstation designs will require 
excellent CPU performance and will consequently have 
to incorporate well-designed, high-performance cache 
and memory systems. 


In optimizing memory performance, an external cache 
is essential for hiding slow main memory access times. 
Write-through external caches offer good'performance 
if coupled 
with 
good memory 
write 
performance. 


Write-back external caches can also offer excellent per- 
formance 
if designed correctly. 
Parallel 
write-back 
caches that cannot defer the write-back cycle till after a 
cache line fill will perform worse than a good write- 
through cache design. 
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Since the introduction of the Inte1386TMSL microproc- 
essor 
and 
the 
subsequent 
introduction 
of 
the 
Intel486™ 
SL microprocessor, 
the SL Architecture 
has become the de facto standard for mobile computers. 
Given the industry acceptance of SL Architecture, Intel 
is extending the SL Architecture to the SL Enhanced 
Intel486 microprocessor family. This application note 
describes how the same features of the SL Architecture 
can be implemented 
on the SL Enhanced 
Inte1486 
CPUs. Although this application note is written for 
people with experience designing Inte14'86 SL CPU- 
based mobile computers, the information provided in 
this document will also be useful for anyone interested 
in learning more about the SL Enhanced 
Intel486 
CPUs. 


The first section of this document highlights the differ- 
ences between the Intel486 SL CPU and the SL En- 
hanced Inte1486 CPU. Section two describes the archi- 
tectural 
differences 
in 
System Management 
Mode 
(SMM). Section three discusses power management fea- 
tures of the Intel486 SL CPU and the SL Enhanced 


Intel486 CPU. Section four explains reset implementa- 
tion of the Intel486 SL CPU and the SL Enhanced 
Inte1486 CPU. 


1.0 
COMPARISON OF THE SL 
ENHANCED Intel486™ CPU AND 
Intel486 SL CPU 


The SL Enhanced Intel486 CPU supports many of the 
features available in the SL Architecture. 
The major 
difference between the SL Enhanced Intel486 CPU and 
the Intel486 SL CPU 
is level of integration. 
The 
Inte1486 SL CPV is a highly integrated 
CPU with 
memory controller, ISAlPI-bus 
controller, and power 
management built into it. The SL Enhanced Intel486 
CPU has retained all the SMM and power management 
features from the SL Architecture. 
Features not sup- 
ported by the SL Enhanced Inte1486 CPU can easily be 
implemented by external hardware. Table I highlights 
the differences between the SL Enhanced Intel486 CPU 
and the Inte1486 SL CPU. 


Features 
SLEnhanced 
Intel486 
SL 
Intel486 
CPU 
CPU 


System Management 
Mode 
Yes 
Yes 


5MBASE 
Relocation 
Yes 
No 


Stop Clock 
Yes 
Yes 


Upgrade 
Power- 
Yes 
No 
OownMode 


Package Options 
168 lead PGA, 196 lead PQFP, 
196 lead PQFP, 208 lead SQFP 
208 lead SQFP 


3.3V Operation 
Yes 
Yes 


Clocking 
Options 
1X clock input or 2X clock input 
2X clock input 


CPU Frequency 
Intel486 SX CPU: 25 MHz, 33 MHz 
25 MHz, 33 MHz 
Intel486 
OX CPU: 33 MHz, 50 MHz 
Intel486 
OX2 CPU: 40 MHz, 50 MHz, 66 MHz 


2.0 
SYSTEM 
MANAGEMENT 
MODE 
IMPLEMENTATION 


System Management Mode (SMM), first introduced in 
the SL Architecture for notebook computers, provides 
a unique environment for software to perform power 
management 
functions much more efficiently. Since 
then, SMM has found its way into many new applica- 
tions. The SMM hardware interface on the SL En- 
hanced Intel486 CPU is similar to that of the Intel486 
SL CPU except for the handshaking protocol. The SL 
Enhanced 
CPUs 
handshake 
through 
the SMI and 
SMIACT# 
signals (see Figure I), and the Intel486 SL 
CPUs handshake through the SMI and SMRAMCS# 
signals. 


SMIACTII" 


CPU 
SMII 
,- 
SYSTEM 


LOGIC 


The system interrupts the normal program execution 
and invokes SMM by generating a System Management 
Interrupt 
(SMI#) 
to the CPU. On the Intel486 SL 
CPU, the SMI# input is held low as long as the CPU is 
in SMM. With the SL Enhanced Intel486 CPU, SMI # 
input only needs to remain active for a single clock 
provided the SMI setup and hold times, t20 and t21, 
are met. SMI # will also work correctly if it is held 
active for an arbitrary number of clocks. 


NOTE: 
A: Setup time for recognitionon I/O instructionboundary 


For Intel486 SL processor-based systems, the 82360SL 
I/O generates the SMI request. For any SMI to be rec- 
ognized by the SL Enhanced Intel486 CPU, the system 
logic must ensure all of the required timings are met. 
The following sections discuss the timing requirements 
that must be observed by the SMI generation logic in- 
terfacing to the SL Enhanced Intel486 CPU. 


I/O trapping has proven to be very useful in the SL 
Architecture for device power management. Trapping 
the last I/O access prior to entering SMM prevents the 
CPU from accessing a powered-down device. With the 
exception 
of 
the 
SMFILO 
(System 
Management 
• 
FILO), the SL Enhanced CPU supports the same I/O 
Instruction Restart option under SMM featured in the 
Intel486 SL CPU. 


The I/O 
Instruction 
Restart feature of the SL En- 
hanced Intel486 CPU is used the same way as the I/O 
Instruction 
Restart feature of the Intel486 SL CPU. 


When the I/O Instruction Restart option is enabled (by 
setting offset OFFOOHin the SMRAM to OFFH), the 
RSM instruction microcode modifies the restored EIP 
to point to the instruction immediately preceding the 
SMI # request, so that the I/O instruction can be re-ex- 
ecuted. For the CPU to trap the last I/O accesscorrectly, 
the external hardware must ensure the SMI # signal is 
asserted at least three CPU clock periods prior to assert- 
ing the RDY# 
signal (seeFigure 2). 


For back-to-back SMIs, the SMI # input must be held 
inactive for at least four clocks after it is de-asserted to 
reset the edge-triggered SMI detection logic. Otherwise, 
the second SMI'IF request may not be recognized (see 
Figure 3). 


A new pin called SMIACT# 
(SMI ACTive) which in- 
dicates that the CPU is operating in SMM has been 
added to the SL Enhanced Intel486 CPU. The CPU 
asserts SMIACT# 
in response to an SMI interrupt re- 
quest on the SMI # pin. SMIACT # is driven active by 
the CPU before accessing the SMRAM. SMIACT # 
remains active until the last access to SMRAM when 
the CPU restores (reads) its state from SMRAM. After 
the RSM instruction is executed, the CPU de-asserts 
the SMIACT# 
signal. 


The 
SMIACT # 
signal 
is 
equivalent 
to 
the 
SMRAMCS# 
signal on the Intel486 SL microproces- 
sor except that SMIACT # on the Intel486 SL CPU is 
active all the time and cannot be used as a chip select 
signal for external SMRAM. On the Intel486 SL micro- 
processor, the SMRAM is enabled automatically when- 
ever the CPU is switched into SMM. A similar mecha- 
nism can also be implemented by using the SMIACT# 
signal. Whenever the SMIACT# 
signal is active, the 
SMRAM will be enabled by the system logic. 


As previously mentioned, one of the many uses of the 
SMIACT# 
output 
is to enable SMRAM when the 
CPU is operating 
in SMM. Most importantly, 
the 
SMIACT# 
output is used by the system logic to main- 
tain system integrity while the CPU is in SMM. 


If part 
of the system memory 
is overlaid by the 
SMRAM while the CPU is in SMM, the system logic 
should ensure that only the CPU and SMI handler have 
access to the SMRAM 
area. Accesses to addresses 
overlaid by a bus master or DMA controller when 
SMIACT # is active should be re-directed to the system 
memory 
underneath 
the 
SMRAM 
and 
not 
the 
SMRAM itself. 


While inside SMM, the CPU should be protected from 
system activities such as CPU RESET, interrupt 
re- 
quests, and NMI, etc. The SMIACT# 
can be used by 
the system logic to block off these system activities 
while the CPU is in SMM. 


SMRAM resides in a unique address space so that the 
software operating under SMM is transparent 
to the 
normal address space. On the Intel486 SL microproces- 
sor, the size of SMRAM can be either 32 Kbytes or 
64 Kbytes. Depending on the size of the SMRAM, the 
SMRAM 
area can be located 
in either 
38000H- 
3FFFFH or 30000H-3FFFFH. 


On the SL Enhanced Inte1486 CPU, the size of the 
SMRAM can be between 32 Kbytes and 4 Gbytes. The 
location of the SMRAM is determined by the 5MBASE 
(SMRAM BASE ADDRESS) register, and defaults to 
5MBASE + 8000H, which is 38000H after CPU RE- 
SET. The first SMI after a CPU RESET always begins 
executing instructions at 38000H. 


8MI# 
Sampl 
¥' 


Second 
8MI# 
\.....L.-.r 


The SL Enhanced 
Intel486 
CPU family provides 
a new 
control 
register, 
5MBASE 
for changing 
the SMRAM 
base address 
(see Figure 
4). The SMRAM 
base address 
can be changed 
after CPU RESET 
by invoking 
a dum- 
my SMI call to change 
the 5MBASE 
register. 


o 


====================1 
Register offset 7EF8H 
_____ 
~ 
~I 


SMRAM Base Address 


241810-4 


In the SL Enhanced 
Intel486 
CPU, 
a new slot is added 
to the 
CPU 
dump 
area 
inside 
the 
SMRAM 
at offset 
7EF8H 
for changing 
the SMRAM 
base address. 
Dur- 
ing the execution 
of the RSM instruction, 
if the reloca- 
tion bit is set, the CPU 
will read this slot and initialize 
the CPU to use the new 5MBASE 
during 
the next SMI. 
From 
then on, the CPU 
will do its context 
save to the 
new SMRAM 
area 
pointed 
to by the 5MBASE, 
store 
the 
current 
5MBASE 
in the 
SMM 
Base 
slot 
(offset 
7EF8H), 
and then 
execute 
the new jump 
vector 
based 
on the current 
5MBASE. 


5MBASE 
must 
start 
at a 32K aligned 
boundary. 
Pro- 
gramming 
the 5MBASE 
register 
to values 
that are not 
32K aligned 
will cause the CPU 
to enter 
the shutdown 
state 
when 
executing 
the RSM 
instruction. 
After 
the 
SMRAM 
base address 
is changed, 
the new starting 
ad- 
dress 
for the SMI jump 
vector 
is calculated 
by adding 
8000H 
to the new SMRAM 
base address. 
The starting 
address 
for CPU 
state dump 
area will be remapped 
to 
the new SMRAM 
base address 
plus OFFFFH. 


A new bit (bit 17) has been added 
to the SMM Revision 
Identifier 
on the SL Enhanced 
Intel486 
CPU 
to indi- 
cate 
whether 
the processor 
supports 
relocation 
of the 
SMRAM 
base address. 
With the SL Enhanced 
Intel486 
CPU, 
the 5MBASE 
relocation 
bit is always 
set to one 
to indicate 
the processor 
supports 
5MBASE 
relocation. 


Since 
the 
memory 
controller 
is embedded 
inside 
the 
Intel486 
SL CPU, 
many 
design 
issues 
with 
SMRAM 


interface 
are handled 
internally 
by the CPU. For the SL 
Enhanced 
Intel486 
CPU, 
these issues must be handled 
by the external 
system 
logic interfacing 
to the CPU. 


Before 
the 
CPU 
can 
execute 
code 
inside 
SMM, 
the 
SMRAM 
must be loaded 
with valid SMM 
code. If the 
SMRAM 
is not initialized 
with 
code prior 
to entering 
SMM, 
executing 
invalid 
code out of the SMRAM 
can 
place the CPU in an unknown 
state. Thus, 
the external 
memory 
controller 
must provide 
a mechanism 
to bring 
the SMRAM 
into system 
address 
space without 
invok- 
ing SMM. 
This 
will allow 
software 
such 
as BIOS 
to 
load the SMM code into SMRAM. 


The Intel486 
SL CPU provides 
a hardware 
mechanism 
• 
to access memory 
overlaid 
by SMRAM. 
Although 
sys- 
tem logic 
is not 
required 
to provide 
a mechanism 
to 
access memory 
located 
underneath 
SMRAM, 
it may be 
much 
easier 
to implement 
a suspend 
state 
(O-volt sus- 
pend or 5-volt suspend) 
if such a mechanism 
is provid- 
ed. 


2.3.2.2 
Cache 
Coherency 


Since the Intel486 
SL CPU 
does not support 
a second 
level cache, 
cache coherency 
with SMRAM 
is handled 
completely 
inside the CPU. 
The CPU's 
internal 
cache 
is automatically 
emptied 
before 
entering 
SMM 
and af- 
ter exiting 
SMM. 


The 
SL 
Enhanced 
Intel486 
CPU 
does 
not 
flush 
its 
cache 
before 
entering 
SMM 
or 
after 
leaving 
SMM. 


Cache flushing 
is not required 
if the SMRAM 
is located 
in a non-cacheable 
area in the memory 
address 
space or 
in an external 
address 
space which 
is not visible to the 
system. 
If the SMRAM 
is located 
in a cacheable 
area 
that 
overlays 
system 
memory, 
both 
the CPU 
internal 
cache and any second 
level caches 
must be flushed 
be- 
fore entering 
SMM. 
If SMRAM 
is cacheable, 
the CPU 
internal 
cache and any second level caches must also be 
flushed 
when 
exiting 
SMM. 
The following 
steps 
must 
be taken by the system logic to maintain 
cache coheren- 
cy when SMM 
overlays 
normal 
system 
memory: 


I. 
Before entering 
SMM, 
the FLUSH 
# pin should 
be 
asserted 
when SMIACT# 
is driven 
active to empty 
the CPU 
cache. 


2. The KEN # pin must be driven 
inactive 
to stop ac- 
cesses to the SMRAM 
area from filling in the cache 
line if SMRAM 
is not cacheable. 


3. Upon leaving SMM, if SMRAM is cacheable, the 
CPU cache is emptied by asserting the FLUSH # 
pin within one CPU CLK after the SMIACT # pin 
is de-asserted. 


It is the responsibility of the system logic to ensure that 
the setup and hold times for FLUSH # and SMIACT # 
signals are met. 


Like the Intel486 
SL processor, 
the SL Enhanced 
Intel486 CPU empties its internal write buffers before 
entering SMM to prevent data in the write buffers from 
being written to SMRAM space. If a system supports a 
second level cache, the second level write buffers must 
also be emptied before the CPU enters SMM. It is pos- 
sible that the CPU is in SMM before the second level 
write buffers are completely emptied by the memory 
controller. In case the second level write buffer is not 
completely emptied, the SMIACT # signal can be used 
to direct the memory write cycles to either SMM space 
or memory space. 


The A20M # pin on the Intel486 CPU is provided to 
emulate the address wraparound at the I Mbyte bound- 
ary which occurs on the 8086 microprocessor (see Fig- 
ure 5). The SMRAM space on the Intel486 SL CPU is 
always below I Mbyte memory address space. Memory 
above I Mbyte can either be accessed through 
the 
ISAWINDOW register or the MCWINDOW register. 
The A20M # signal is automatically driven low when- 
ever the CPU is in SMM. When A20M # is active, all 
external bus cycles will drive A20 low, and all internal 
cache accesses will be performed with A20 low. 


The SL Enhanced Intel486 CPU does not provide any 
memory mapping mechanism to access memory above 
I Mbyte. To access memory above I Mbyte inside 
SMM, the software has to disable the A20M # manual- 
ly through 
the 
keyboard 
controller. 
Also, 
if the 
SMRAM is located above I Mbyte and A20M # is not 
enabled before entering SMM, the system will crash. In 
this case, the CPU will attempt to access SMRAM at 
the relocated address with A20 low, and will thus fetch 
invalid code. 


For these reasons, the A20M # should be driven inac- 
tive prior to entering SMM and remain inactive as long 
as the CPU is in SMM. This can be accomplished by 
blocking 
the 
assertion 
of 
A20M # 
whenever 
SMIACT # is active. The state of the A20M # should 
be saved upon entry to SMM and restored to its origi- 
nal state after leaving SMM. 


When the CPU is running in SMM, the processor is in 
a pseudo "real mode" environment, but without the 
64 Kbyte limit. After the SMRAM base address regis- 
ter has been relocated, the CPU segment registers will 
have values shown in Table 2 when an SMI # occurs. 
The CS selector register still contains the value 3000H, 
not the value corresponding to the new 5MBASE 
The 
rest of the registers are still initialized to zero. 


If the SMRAM base address has not been relocated, the 
segment registers can be initialized in the same way as 
with the Intel486 SL processor, i.e., using the CS regis- 
ter which defaults to 3000H. Otherwise, the segment 
registers must be initialized correctly to point to the 
new SMRAM memory space. 


Normally, the data segment registers are initialized to 
point to the SMRAM base address. Upon entering 
SMM, the CS BASE segment register is initialized to 
point to the SMRAM base address. The location of the 
SMRAM base address can be determined by reading 
the 
5MBASE 
register 
in 
the 
SMRAM 
at 
offset 
OFEF8H (the location of the SMRAM base address can 
also be stored in another memory location such as 
CMOS RAM by the BIOS which can be retrieved by the 
SMM program). 


The 5MBASE contains a 32-bit address and has to be 
shifted to the right by four bits to generate a l6-bit 
segment address before it can be placed in the data 
segment selector registers. The CS selector register can- 
not be initialized by writing directly to it. It has to be 
initialized by executing a far jump instruction to an 
address within the SMRAM to force the CS selector 
register to point to the SMRAM base address. 


When the CPU is in SMM, the operand size and the 
address size are still 16 bits but there are no limits to 
segment size. The physical address of an instruction is 
obtained by adding the value in OS segment base regis- 
ter to the value in EIP register, rather than the IP regis- 
ter. To access data anywhere within the four Gbyte 
logical address space, operand-size override (opcode 
66H) and address-size override' (opcode 67H) prefixes 
can be used as needed. Alternatively, SMRAM data 
located above I Mbyte can also be accessed by using 
32-bit displacement registers. 


Segment 
Selector 
Base 
Limit 
Register 


CS 
3000H 
5MBASE 
4 Gbytes 
OS 
OH 
OH 
4 Gbytes 
ES 
OH 
OH 
4 Gbytes 
FS 
OH 
OH 
4 Gbytes 
GS 
OH 
OH 
4 Gbytes 
SS 
OH 
OH 
4 Gbytes 


One of the most important power management features 
on the Intel486 SL processor is CPU clock control. The 
clock control scheme on the SL Enhanced Intel486 
CPU is similar to the Intel486 SL CPU, with the 
Intel486 SL CPU being driven by a 2X clock, and the 
SL Enhanced Inte1486 CPU available with both the IX 
and 2X clocking options. 


The 2X clock input is twice the internal frequency of 
the CPU, whereas the IX clock frequency is the same 
internal frequency of the CPU. With the IX clock, the 
two internal clock phases, "phase one" and "phase 
two", are generated by an internal Phase Lock Loop 
(PLL). The CPU clock input for a IX clock cannot be 
changed dynamically because the PLL requires a con- 
stant frequency CLK input (to within 0.1 %). 


As with the Intel486 SL CPU, 
the SL Enhanced 
Intel486 
CPU 
provides 
an 
interrupt 
mechanism, 
STPCLK #, which allows system hardware to control 
the power consumption of the CPU by stopping the 
internal clock to the CPU. Unlike the normal inter- 
rupts, INTR and NMI, the STPCLK # interrupt does • 
not initiate interrupt acknowledge cycles or interrupt 
table reads. 


The Stop Clock feature on the SL Enhanced Intel486 
CPU has been improved, allowing the input to the 
STPCLK # to be driven asynchronously as well as syn- 
chronously. The major difference between asynchro- 
nous and synchronous control is that the STPCLK # 
interrupt latency is much shorter with asynchronous 
control. 


With the Intel486 SL CPU, the STPCLK# 
input is 
controlled 
asynchronously 
through 
software. 
The 
STPCLK # is asserted after doing a dummy 110 read 
to the STPCLK register in the 82360SL or executing an 
HLT instruction. The STPCLK # signal will remain 
asserted until a system event wakes up the CPU. If the 
STPCLK # input is driven asynchronously, both setup 
and hold times t20 and t21 must be met for the 
STPCLK # interrupt request to be recognized. 


After a STPCLK # interrupt request is recognized by 
the CPU, the processor will stop execution on the next 
instruction boundary, stop the pre-fetch unit, and then 
empty all internal pipelines and the write buffers. Final- 
ly, a special Stop Grant bus cycle is generated. The pin 
state during a Stop Grant cycle is shown in Table 3. 


The interrupt acknowledge cycle is terminated when 
the system logic returns RDY # or BRDY #. At this 
point the CPU is in the Stop Grant state and the inter- 
nal clock is stopped. The Stop Grant cycle is similar to 
the HALT cycle except that the address bus has the 
value 04H instead of OOH. 


Signals 
State 


M/IO# 
0 


D/C# 
0 


W/R# 
1 


Address Bus 
0000 0010H (A4 = 1) 


BE3#-BEO# 
1011 (same as HALT) 


Data bus 
Floated 


Using the STPCLK # input, the SL Enhanced Intel486 
CPU can be put into low power states similar to the 
Global Standby and Suspend states as with the Intel486 
SL microprocessor. 


In an Intel486 SL processor-based system, the 82360SL 
puts the CPU in a low power standby state (CPU lee 
- 20 mA-50 mA) when the system is in Global Stand- 
by. A similar state called Stop Grant State is provided 
by the SL Enhanced Intel486 CPU. The Stop Grant 
state can be entered by simply asserting the external 
STPCLK # interrupt pin. Once the STPCLK # inter- 
rupt is acknowledged by the CPU (i.e., after the Stop 
Grant cycle is placed on the bus), the CPU is in the 
Stop Grant state. 


While in the Stop Grant state, the CPU still responds to 
RESET or SRESET and requests a cache invalidation 
(i.e., HOLD, AHOLD, BOFF# 
and EADS#). 
How- 
ever, the CPU does not recognize any other inputs 
while in the Stop Grant state. Input signals to the CPU 
will not be recognized until one CPU clock cycle after 
STPCLK # is deasserted. 


To emulate Global Standby, the stop clock control log- 
ic must be able to de-assert the STPCLK # 
signal 
whenever there is system activity (i.e., INTR, 
IRQ, 
NMI, and SMI#). 
Typically, the stop clock de-asser- 
tion logic is implemented by logic which latches the 
incoming interrupt requests from the system. The CPU 
returns to its normal state within 10-20 CPU clock 
cycles after exiting the Stop Grant state. 


As mentioned before, the CPU does not recognize any 
interrupt request while the STPCLK # input is active. 
To prevent the interrupt request from getting lost, the 
interrupt request logic must ensure the interrupt signal 
is held active for at least one CPU clock after the 
STPCLK # input is de-asserted. 


From the Stop Grant state, the CPU can go into a 
lower power state similar to the suspend state offered in 
the Intel486 SL processor. After the CPU is in the Stop 
Grant state, the CPU can enter the lowest power Stop 
Clock state (-100 J.1A-2oo J.1A)by stopping the CPU 
clock input. The CPU clock input can be driven to 
either logic high or logic low during Stop Clock state. 
The CPU will not generate any acknowledge cycle 
when entering stop clock state. 


For a 2X clock input, the clock input to CLK2 can be 
stopped on either a logic high or logic low independent 
of the clock phase. The CPU will go back to Stop Grant 
state as soon as the CPU clock is re-started. Upon exit 
from Stop Clock State, the CPU clock input must be re- 
started in the same state when it was stopped (see Fig- 
ure 6). 
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For a CPU with a IX clock input, the CPU clock can 
be stopped in the same manner as a CPU with a 2X 
clock input. Because of the phase lock loop, the CPU 
will not return to the Stop Grant state right after the 
CPU clock input has been re-started. To allow time for 
the PLL to stabilize, the CPU clock input must be held 
at a constant frequency for a period of time equal to the 
PLL startup latency (as specified in the data book) be- 
fore the CPU will return to the Stop Grant state. 


As long as the CPU clock input is stopped, the system 
logic must keep all the CPU input signals in the same 
state before the clock was stopped. Any change in state 
on an input signal (except for INTR) before the CPU 
has returned to the Stop Grant state will result in un- 
predictable behavior. The CPU will not be able to rec- 
ognize any interrupt request while the CPU clock is 
stopped. 


3.2.1.1 
Dynamic 
Clock Switching 


For a CPU clock with a IX clock input, the CPU clock 
cannot be changed "on-the-fly". 
For power manage- 
ment as well as implementation of features such as de- 
turbo mode, it is advantageous to run the CPU clock at 
a lower frequency. This can be accomplished by putting 
the CPU into Stop Clock state and change the CPU 
clock to a lower speed. After the CPU clock input fre- 
quency is changed, the clock control logic must ensure 
that the clock input has been running at a constant 
frequency for the time period necessary for the PLL to 
stabilize before de-asserting the STPCLK # signal. The 
lowest CPU clock rate for a IX part is 8 MHz (see 
Figure 7). 


3.2.1.2 
Power 
Consumption 


The Stop Grant and Stop Clock states are designed to 
save power. While the processor is in Stop Grant state, 
the input/output 
signals on the CPU remain at the 
same state when entering the Stop Grant state, floated 
(data and parity signals), or driven to a different state. 
Ifsome of the signals are driven improperly, the system 
can end up consuming more power. 


To achieve the lowest power consumption, all the possi- 
ble current leakage must be eliminated. The system log- 
ic should never drive the input signals with pull-up re- 
sistors LOW and input signals with pull-down resistors 
HIGH. While in the Stop Grant state, the pull-up resis- 
tors on STPCLK# 
and UP# 
are disabled internally. 
The system must continue to drive these inputs to the 
state they were in immediately before the CPU entered 
the Stop Grant state. For minimum CPU power con- 
sumption, all other input pins should be driven to their 
inactive level while the CPU is in the Stop Grant state. 


The STPCLK # input is an asynchronous signal. The 
system can crash if the interface to the STPCLK # is 
not designed properly. Special care must be taken to 
ensure that all the timing requirements are met and the 
proper protocol is used. Listed below are some design 
considerations that should be considered when design- 
ing the STPCLK # interface. 


• The CPU cannot empty the write buffer during an 
HLDA cycle. Therefore, the CPU will not acknowl- 
edge any STPCLK # request during an HLDA cy- 
cle. 
• 


• After the STPCLK IF is asserted, the CPU does not 


generate a Stop Grant cycle until it completes the 
current 
instruction. 
The 
latency 
between 
a 
STPCLK IF request and the Stop Grant bus cycle 
depends on the current instruction, the amount of 
data in the CPU write buffers, and the system mem- 
ory performance. 


• The CPU will not enter the Stop Grant state until 


either RDY IF or BRDY IF has been returned. 


• In response to HOLD being driven active during the 
Stop Grant state (when the CLK input is running), 
the CPU will generate HLDA and three-state all 
output and input/output 
signals that are three-stat- 


ed during the HOLD/HLDA 
state. After HOLD is 


de-asserted all signals will return to the state they 
were in prior to the HOLD/HLDA 
sequence. 


• When the CPU enters the Stop Grant state, the in- 


ternal pull-up resistor is disabled so that the CPU 
power consumption is reduced. The STPCLK IF in- 
put must be driven high (not floated) in order to exit 
the Stop Grant state. 
• It is the responsibility of the system designer to en- 


sure that the CPU is in the correct state prior to 
asserting cache invalidation or interrupt signals to 
the CPU. 


On a standard PC, the CPU can be reset by either hard- 
ware or software. On the SL Enhanced Intel486 CPU, 
asserting the RESET input to the CPU will also set the 
5MBASE register to the default value of 30000H. In 
other words, the SMRAM base address will reset to 
30000H whenever the operating 
system asserts the 


CPU RESET signal. For some older software, a CPU 
RESET is generated by the software to return the CPU 
to real mode from protected mode. 


Normally, this is not a problem if 5MBASE relocation 
is not used. If the SMRAM base address has been relo- 
cated, the CPU could be executing invalid SMM code 
from the address. The SRESET pin has the same func- 
tions as RESET 
except that 
it does not reset the 


5MBASE register. 


For a system which uses 5MBASE relocation, the logic 
which generates the software CPU RESET must be tied 
to the SRESET input and not the RESET input on the 
CPU. 
All hardware 
resets should 
be implemented 


through the RESET pin (see Figure 8), and all software 
resets should be implemented through 
the SRESET 


pin. 


While inside SMM, the CPU should be protected from 
being reset by a software CPU 
RESET. 
SRESET 


should be blocked whenever the SMIACT IF is active. 
Any request for a CPU RESET when the CPU is in 
SMM should be latched so it can be serviced after the 
CPU exits SMM. To ensure the execution of the RSM 
instruction does not get interrupted by the SRESET, 
the SRESET must be blocked until at least 20 CPU 
clock cycles after SMIACTIF has been driven inactive. 


0 


Q 
RESET 


KBD 
OR 
SORWARE 
CPU 
SRESET 
RESET 
LOGIC 


The system designer should consider the following re- 
strictions while implementing the CPU Reset logic: 
• For SRESET to be recognized by the CPU, the 
SRESET input must be driven active (high) for at 
least 15 CPU clock cycles. 


• The SRESET is not intended to' be used for flushing 
the on-chip cache. For compatibility with future 
generation 
Intel 
CPUs, 
the SRESET 
input 
pin 
should not be used for flushing the on-chip cache. 
• Hardware resets should not be blocked when the 
CPU is in SMM so that the system can recover from 
a fatal system failure. 


While taking advantage of the benefits of the SL Archi- 
tecture, the SL Enhanced Intel486 CPU family pro- 
vides a whole new world of opportunity for system de- 
signers to develop innovative, energy-efficient mobile 
and desktop designs. The SL Enhanced Intel486 micro- 
processor family combines power management, com- 
patibility and performance, allowing system designers 
to build a wide variety of machines to meet the diverse 
needs of a broad range of users. 


Intel's SL Architecture: Designing Portable Applications, 
1993, McGraw-Hill. 


Intel486 MicroprocessorFamily Data Book Addendum: 
SL Enhanced Intel486 Microprocessor Family, 
1993, 
Intel Corporation. 
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Intel's System Management Mode (SMM), introduced 
as part of the Intel SL technology, has become an in- 
dustry standard for portable computing. Through the 
utilization of SMM, system designers have a new meth- 
od of adding software controlled features that operate 
transparently to the operating system and applications 
software. In portable computer systems, SMM is often 
used to implement the power control on various system 
components to conserve power consumption. Flexible 
clock control has also become essential to the design 
of power-saving computers. The new SL Enhanced 
Intel486TM microprocessor family incorporates all of 
the best power management features which first ap- 
peared in Intel SL technology, bringing Intel486 CPU 
performance to portable computers and energy-efficient 
desktop systems. The purpose of this application note is 
to provide system designers with a better understanding 
of the theory and implementation of power manage- 
ment with the new SL Enhanced Intel486 CPUs. 


2.0 
POWER MANAGEMENT 
FEATURES 


• System Management 
Mode-This 
mode is com- 
posed of a special purpose interrupt that serves as 
the hardware interface and a secure memory address 
space that stores processor status and special soft- 
ware routines. It can be used to implement power 
management for portable systems. 
• Flexible Clocking Options-The 
clock input to the 
CPU can either be a IX clock or a 2X clock. For IX 
clock option, the internal clock of the CPU runs at 
the same speed as the input clock (CLK input). In 
the 2X clock case, the clock input (CLK2 input) 
needs to be twice the frequency of the internal clock. 


• Different Low Power States-Different 
low power 
processor states are available for various operating 
conditions. This feature enables the conservation of 
processor power consumption 
without 
sacrificing 
performance. 


• Low Voltage Power Supply Option-The 
SL En- 
hanced Intel486 CPUs can be powered by either a 
5V or a 3.3V supply, with the 3.3V supply providing 
a 50% power savings. 


The System Management Interrupt (SMI II) input pin 
on the processor provides the hardware interface for 
the computer system to invoke SMM. An exclusive 
memory address space, SMRAM, is only available for 
the CPU to access while in SMM. The size of SMRAM 
can be between 32 Kbytes and 4 Gbytes. It is used to 
store processor state and SMI handlers. SMI handlers 
are special software routines that can be designed to 
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control the power states of system components. Intel's 
SMM has a special instruction, RSM, that is responsi- 
ble for exiting SMM. When executed, RSM instruction 
restores the processor state from SMRAM and returns 
control 
to the application 
that 
was interrupted 
by 
SMIII. 


The servicing of SMI II is different from that of a regu- 
lar interrupt. The system invokes SMM by generating 
an SMI II to the SL Enhanced Intel486 CPU. Normal 
program execution will be interrupted as a result and 
the CPU will respond to the interrupt 
by asserting 
SMIACT II. This signal is used by the system to enable 
SMRAM space. The CPU will then save its state into 
the 
SMRAM 
area, 
starting 
at 
address 
location 
3FFFFH 
and proceeding downward 
in a stack-like 
fashion. After completing a state save, the CPU will be 
in a pseudo real-mode processor environment. The mi- 
crocode will then direct the CPU to begin executing 
instructions 
at 
the 
absolute 
address 
of 
38000H 
(SMRAM), at which point it will begin executing SMI 
handlers. SMI handlers can perform various system 
management functions, including system power con- 
trol. The last instruction in an SMI handler is always 
the RSM instruction. This instruction will restore the 
CPU state from SMRAM, de-assert SMIACT II and 
return control of the system to the interrupted 
pro- 
gram. 


The generation of an SMI II to the CPU can be initiated 
by hardware or software for the purpose of power man- 
agement. The actual implementation 
depends on the 
specific chipset used and how the system is designed. 
Hardware can generate SMI II by pulling the SMI II 
pin low directly or through other chipset pins, i.e., bat- 
tery level control. When the charging level of the main 
battery falls below a certain limit, the chipset monitor- 
ing the battery level can interrupt the normal program 
operation by pulling the SMIII 
pin low. While in 
SMM, the CPU can execute certain power-down SMI 
handlers to put the entire computer system in a low- 
power mode that can operate out of a different battery 
source. This will enable the system to maintain its cur- 
rent status 
while allowing the main battery 
to be 
changed. SMI II can also be initiated through software. 
Different chipsets have various ways of interfacing to 
the CPU in this aspect. Most of them have dedicated 
timers to detect an idled device. Once these timers are 
enabled, the timeout will automatically 
generate an 
SMIII. 


Exiting SMM is accomplished by the execution of the 
RSM instruction. Besides restoring the CPU state, the 
RSM instruction can also perform three other func- 
tions. The first is called "Auto HALT Restart". The 
System Management Interrupt 
request can interrupt 
the HALT instruction. By setting the appropriate con- 
trol slot in the SMRAM space, the RSM instruction 
can return control to the HALT instruction or the in- 


struction immediately following the HALT instruction. 
The second special feature is "I/O 
Trap Restart". 
If 
SMI"" interrupt 
is generated on an I/O 
access, the 
RSM instruction will re-execute that I/O instruction if 
its I/O Trap Restart slot in the SMRAM is set. The 
third function is the relocation of 5MBASE, the start- 
ing address of SMRAM. This provides system design- 
ers with the flexibility of placing SMRAM space into 
an area in which its integrity is best ensured. The start- 
ing address is controlled by the 5MBASE register, 
which has a power-on reset value of 30000H. The de- 
fault SMRAM 
area starts at 38000H and ends at 
3FFFFH. 
The SMRAM can be relocated to any 32K 


aligned area, either overlaid on top of the normal sys- 
tem address space or placed in a distinct address space. 


The standard Intel486 SX and Intel486 DX CPUs are 
driven by IX clock as opposed to the Inte1386 CPUs 
which use a 2X clock input. The SL Enhanced Intel486 
CPUs are available with either the IX or the 2X clock- 
ing options. 


The IX clock allows simpler system design by cutting 
in half the clock speed required in the external system. 
The IX clock relies on an internal Phase Lock Loop to 
generate the two internal clock phases, "phase one" 
(PhI) and "phase two" (ph2). The rising edge of the 
CLK input corresponds to the start of ph 1.All external 
timings are specified with respect to the rising edge of 
the CLK input. The PLL requires a constant frequency 
CLK input (to within 0.1%), and therefore the CLK 
input cannot be changed dynamically. 


The IX CLK input option is essential for those proces- 
sors with an on-chip clock doubler. The IX CLK input 
provides the fundamental timing references for the bus 
interface unit. The CLK input is doubled internally so 
that the CPU core will operate at twice the CLK input 
frequency, and hence twice the bus frequency. The in- 
ternal clock doubler enhances all operations using the 
internal cache and/or not blocked by external bus ac- 
cesses. This mode also uses PLL and therefore the CPU 
CLK input must be maintained at a constant frequen- 
cy. 


The SL Enhanced Intel486 CPUs also offer a 2X clock 
option for systems that rely on dynamic frequency scal- 
ing for CPU power management. The frequency of the 
CLK2 input is twice the internal frequency of the CPU. 
The internal clock is comprised of two phases, "PH I" 
and "PH2". Each CLK2 period is a phase of the inter- 
nal clock. All timings are referenced to the rising edge 
of the PH 1of the CLK2 input. It is therefore important 
to synchronize the external circuitry with the PHI of 
the CLK2 input. Because the 2X clock option does not 
rely on the PLL to generate the internal phase clocks, 
the frequency of the CLK2 input can be changed dy- 
namically or "on-the-fly". 


Several low power modes are available on the IX mi- 
croprocessors. These modes make it possible for a pow- 
er-sensitive computer system to optimize power conser- 
vation. Some of the CPU power controls are realized 
through 
a specially provided 
interrupt 
mechanism. 


Each of the following low power states is described in 
detail. 


• Auto Idle Power Down State 
This low power state is available in normal opera- 
tion for the DX2 CPUs. DX2 CPUs have an inter- 
nal clock doubler which doubles the IX clock input 
and therefore enables the CPU core to operate at • 
twice the speed of the input clock. When the SL 
Enhanced Intel486 CPU is known to be truly idle 
and waiting for a ready from a memory or an I/O 
bus cycle read or write, the DX2 CPU will reduce 
its core clock rate to IX from the doubled DX2 
clock. In this state, the CPU only consumes half of 
the normal power. More importantly, this function 
is transparent 
to software and external hardware 
and therefore does not cause any performance deg- 
radation. 


• Stop Grant State 
The Stop Grant state is initiated by simply asserting 
the external STPCLK"" 
interrupt 
pin. The Stop 
Grant state is used to transition to the Stop Clock 
state. 
The CPU enters the Stop Grant state through the 
following steps: When the CPU 
recognizes the 
STPCLK"" request, it will stop the execution of the 
normal program on the next instruction boundary, 
stop the pre-fetch unit, empty all internal pipelines 
and write buffers, generate a Stop Grant bus cycle 
and then stop the internal clock. 
This state is exited when STPCLK"" pin is pulled 
high. The rising edge of the STPCLK"" will tell the 
CPU to return control to the interrupted program 
and .start to execute the instruction following the 
last executed instruction of the interrupted program. 


• Stop Clock State 
The Stop Clock state is entered from the Stop Grant 
state by completely stopping the clock input to the 
PLL. 
In 
this 
state, 
the 
CPU 
consumes 
only 


- 100 /LA-200 /LA of current. 


• Auto HALT Power Down State 
When the HALT instruction is executed, the CPU 
will issue a normal HALT bus cycle. The SL En- 
hanced Intel486 CPU will automatically stop the in- 
ternal CPU clock, therefore causing the CPU to en- 
ter a low power state with a current of - 20 mA- 
55 mA. 


• Stop Clock Snoop State 
Cache snooping is necessary during Stop Grant and 
Auto HALT Power Down states in order to main- 
tain memory coherency. When the system issues a 
request for cache snooping, the CPU will transpar- 
ently enter the Stop Clock Snoop state and will pow- 
er up for I full core clock to complete the cache 
snoop cycle. It will then re-freeze the clock to the 
CPU core and either return to the Stop Grant or the 
Auto HALT Power Down state. 


2.3.2 
Transition 
of Power 
States 
and Latency 
Associated 
with 
1X Mode 


It is important to understand how the different power 
states are interrelated and how one transitions to anoth- 
er. The latency is different for different state tran- 
sitions. Figure I depicts which transitions are allowed. 
We shall describe how the transitions are made and 
how much latency is associated with each transition. 
I. The Auto Idle Power Down state is entered whenev- 
er the CPU is detected idle and waiting for a RDY II 
from either a memory or I/O read/write. This state 
only applies to SL Enhanced Intel486 DX2 CPUs. 
Both the internal CPU core clock and the current 
drop to half of the doubled frequency. There is no 
latency associated with this transition. The CPU 
will go back to normal operation when a RDY II is 
detected. 


2. The 
Stop 
Grant 
state 
is 
entered 
when 
the 
STPCLK II interrupt is asserted to the CPU by the 
system. In this state, the clock output ofPLL (or the 
clock input to the internal core) is stopped. The 
speed of the clock input to the PLL can be main- 
tained 
or 
changed. 
If the 
clock frequency is 
changed, the CPU requires the clock to be held at a 
constant frequency for a minimum of I ms before 
de-asserting STPCLK II. The I ms time period is 
necessary so that the PLL can stabilize the input 
clock period. De-asserting STPCLK II returns the 
CPU to normal operation. The CPU will also return 
to its normal state when RESET or SRESET is as- 
serted. 
3. The Stop Clock state can only be entered from Stop 
Grant state when the clock input to the PLL is 
stopped. The CPU must go through Stop Grant 
state to return to normal operation. Beforethe CPU 
can return to Stop Grant state, the clock input has 
to stabilize for the I ms required by the PLL. 


4. The Auto HALT Power Down state is entered when 
HALT instruction is executed. The clock input to 
the internal CPU core is automatically stopped 
upon the execution of HALT instruction. The clock 
input to the PLL cannot be changed during this 
state. This state is exited back to normal operation 
whenever any of the following events happen: 
INTR, NMI, SMIII, RESET or SRESET. There is 
no latency associated with this state transition. 
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5. When the CPU is in Auto HALT Power Down 
state, the system can generate STPCLK /I to the 
CPU to bring the CPU into a Stop Grant State. 
When the system de-asserts the STPCLK 11 request, 
the CPU will return to the Auto HALT 
Power 
Down state. There is no latency associated with this 
transition. HALT bus cycles will be launched when- 
ever this transition occurs. 
6,7. 
Cache snooping can be performed when the CPU 
is either in Stop Grant state or in Auto HALT 
Power Down state. Cache snoop cycles begin 
when the CPU receives an EADS1I from the sys- 
tem. The CPU will only wake up for I complete 
core clock to perform cache invalidation and then 
re-freeze the clock, i.e., either return to the Auto 
HALT Power Down state or to the Stop Grant 
state. 


There are five operating modes for the 2X SL En- 
hanced Intel486 CPU. The CPU power controls are 
realized through a specially provided interrupt mecha- 
nism, STPCLK 11. In the following, we shall describe 
each of the power states in detail. 
• Normal State 


2X CPUs do not have a PLL, and therefore do not 
require a stabilized clock period. This means the fre- 
quency of the input clock (CLK2) can be changed 
dynamically. 
Depending on the level of activity, 
CPUs do not always have to operate at full speed. 
Reducing the CPU speed saves power. 
• Stop Grant State 
The Stop Grant state is initiated by simply asserting 
the external STPCLK 11 interrupt 
pin. The Stop 
Grant state is used to transition to the Stop Clock 
state. 


The CPU enters the Stop Grant state through the 
following steps: when 
the 
CPU 
recognizes 
the 
STPCLK 11 request, it will stop the execution of the 
normal program on the next instruction boundary, 
stop the pre-fetch unit, empty all internal pipelines 
and write buffers, generate a Stop Grant bus cycle 
and then stop the internal clock. 
This state is exited when STPCLK /I pin is pulled 
high. The rising edge of the STPCLK /I will tell the 
CPU to return control to the interrupted program 
and start to execute the instruction following the 
interrupted instruction. 
• Stop Clock State 
The Stop Clock state is entered from the Stop Grant 
state 
by 
completely 
stopping 
the 
clock 
input 
(CLK2). 
In this state, the CPU consumes only 
- 100 /-LA-2OO/-LAof current. 


• HALT State 
When the HALT instruction is executed, the CPU 
will issue a normal HALT bus cycle. For 2X CPUs, 
there is no power savings in this state. 


• Stop Clock Snoop State 
Cache snooping is necessary during Stop Grant state 
in order to maintain memory coherency. When the 
system issues a request for cache snooping, the CPU 
will transparently enter the Stop Clock Snoop state . 
and will power up for I full core clock to complete 
the cache snoop cycle. 


2.3.4 
TRANSITION 
OF POWER 
STATES 
AND 
LATENCY 
ASSOCIATED 
WITH 
2X MODE 


Figure 2 shows the state transitions of a 2X microproc- 
essor. We shall describe how the transitions are made 
and how much latency is associated with each tran- 
sition. 


1. The 
Stop 
Grant 
state 
is 
entered 
when 
the 
STPCLK # interrupt is asserted to the CPU by the 
system. In this state, the speed of the external clock 
input (CLK2) can be maintained or changed. There 
is no latency associated with changing the CLK2 
frequencies. De-asserting 
STPCLK # returns 
the 
CPU to normal operation. The CPU will also return 
to its normal state when RESET or SRESET is as- 
serted. 
2. The Stop Clock state is entered from Stop Grant 


state when the clock input (CLK2) is stopped. The 
CPU must go through Stop Grant state to return to 
normal operation. There is no additional delay asso- 
ciated with returning to normal operation. 
3. This state is entered when the HALT instruction is 
executed. The HALT state consumes the same pow- 
er as the normal state. The HALT state is exited 
back to normal operation whenever any of the fol- 
lowing events happen: INTR, NMI, SMI #, RESET 
or SRESET. There is no latency associated with this 
state transition. 


4. When the CPU is in HALT state, the system can 
generate STPCLK # to the CPU to bring the CPU 
into Stop Grant state. When the system de-asserts 
the STPCLK # request, the CPU will return to the 
HALT state. There is no latency associated with this 
transition. HALT bus cycles will be launched when- 
ever this transition occurs. 


3.0 
IMPLEMENTATION OF POWER 
MANAGEMENT FEATURES 


The means of implementing power management fea- 
tures depend on the specific chipset used. Most of the 
chipsets have both hardware and software power man- 
agement. There are always a number of dedicated or 
user-definable timers that monitor the activity of cer- 
tain device(s), such as CPU or peripheral components. 
In a software approach, the timeout of these timers can 
trigger a System Management Interrupt. Upon the de- 
tection of an SMI, the CPU will execute the power 
management SMI handlers in the BIOS which exercise 
device power control through detailed programming. 
For a hardware-based approach, the timers can auto- 
matically be enabled to perform power controls. If the 
status of the specific device or the entire system needs 
to be saved before changing its power state, the soft- 
ware approach must be used. In other words, if the 
original status of a device or the entire system is re- 
quired to return the system to its operational state be- 
fore the power state change, SMM must be invoked and 
power control will be accomplished by the SMI han- 
dlers. This section summarizes several power control 
schemes that are common conceptually to all major 
chipsets and explains how they interact with the power 
management 
features offered by the SL Enhanced 
Intel486 CPUs. 


Controlling 
SL Enhanced 
Intel486 
CPU 
power 
is 
achieved by reducing the CPU clock speed, stopping 
the CPU clock or shutting off the CPU power. 


All chipsets have the option of pre-programming 
the 
CPU speed regardless of the level of system activity 
there is and the CPU clock speed can be divided down 
by 2 to 64. Once the CPU is selected to run in a re- 
duced speed mode and the CPU clock division is select- 
ed, the CPU will always run at a divided speed until the 
CPU is switched into some other mode of operation. 
Speed reduction is done by BIOS through program- 
ming certain register bits immediately after booting. 


The speed of the CPU can also be changed dynamically 
depending on the level of system activity. Most of the 
chipsets provide mechanisms of monitoring the level of 
system activity involving the CPU by detecting the tog- 
gling of certain CPU signal lines. The timers associated 
with these monitoring devices are responsible for deter- 
mining when to reduce the CPU speed by timing out a 
programmable 
time period. The chipset reduces the 
speed of the CPU clock by dividing its clock input to 
the CPU after STPCLK # is asserted. For 2X SL En- 
hanced Intel486 CPUs, the speed can be changed on- 
the-fly. For IX SL Enhanced Intel486 CPUs. clock in- 
put has to be stabilized for I ms before de-asserting 
STPCLK #. Stopping the CPU clock is accomplished 
in the same fashion. 


The CPU clock control can be achieved either through 
hardware or software approach. If the status of the 
CPU needs to be saved in order to return the system to 
the original state, CPU clock control should be done 
through the SMI handlers in SMM mode. 


Shutting off the power to CPU can only be done in 
suspend mode. 


3.2 
Controlling Power of Peripheral 
Components 


The power control of peripheral components is accom- 
plished through idle detectors and SMI generators. 


Idle detectors monitor the access to the following devic- 
es: Keyboard, Video RAM, Floppy Disk Drive, Hard 
Disk Drive, Serial Port and Parallel Port. Idle detectors 
can also monitor access to a programmable 
address 


range. Some chipsets even provide additional pins to 
monitor other user-definable miscellaneous activities. 
There are timers associated with each of the idle detec- 
tors with programmable idle time period and the timers 
activate power control pins that are directly tied to the 
controlled devices. When a timer )limes out the pro- 
grammed period, it will activate the power control pin 
to shut down the power to the specific device. For ad- 
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dress range idle detectors, when there is no access to 
the monitored address range for a programmed timeout 
period, the timers will either reduce the clock speed or 
shut off the power of the device that has the same ad- 
dress bits. The idle detectors can operate outside of 
SMM mode and can be independent of CPU state. 


SMI event generators generate SMI requests through a 
number of dedicated or user-defined events. Depending 
on the specific chipset used, these events can be the 
activity timeout of individual devices or a collection of 
devices. Upon the timeout, an SMI request will be gen- 
erated to the CPU, which in turn invokes SMM. The 
SMI handlers in the SMRAM contain the software rou- 
tine that controls the power state of the device(s) initi- 
ating the SMI request. By executing this routine, the 
CPU will access the power management control regis- 
ters associated with the devices. After proper program- 
ming, these registers will activate the proper power 
control pins to shut down the power to the proper de- 
vice(s). Controlling 
the power of peripheral 
devices 


through SMI handlers offers complete flexibility to ei- 
ther manage the power individually or collectively. 
This is very important for optimum power conserva- 
tion. 


Suspend state is the deepest level of power conserva- 
tion. There are two types of suspend: normal suspend 
and O-voltsuspend. In normal suspend, only the CPU, 
chipset and memory sub-system are powered. System 
status is saved into SMRAM. The rest of the system is 
shut down. DRAM is refreshed with a very slow clock 
(64 KHz or 32 KHz). In OVsuspend, the entire system, 
including the CPU, is shut down except the part of the 
system logic that is responsible for resume. The resume 
logic is always powered by an RTC battery. The system 
status is saved onto hard disk. Suspend is normally trig- 
gered by the suspend timer, and the timeout of the sus- 
pend timer is also programmable. 


Because of the amount of BIOS support required by 
Suspend, SMM must be invoked. Hardware alone can- • 
not accomplish the task. 


SL Enhanced Intel486 CPUs provided the best power 
management features that Intel SL technology offers. 
Intel's System Management Mode has become an in- 
dustry standard for power-saving computing. Through 
Intel's SMM, the implementation 
of power manage- 


ment is very flexible, enabling the optimization of pow- 
er conservation for different system designs. The vari- 
ous CPU clock control options available on SL En- 
hanced Intel486 processors provide the basis for run- 
time power management with no performance impact. 
All major chipsets support the Intel power manage- 
ment scheme with easy-to-design software and hard- 
ware interfaces. 
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Today's market for notebook computers demands desk- 
top performance in smaller and smaller form factors. 
Along with the higher performance comes greater pow- 
er consumption, which adds unique challenges for the 
mobile operating environment (battery operating time, 
thermal management, physical dimensions, etc.). In- 
cluded in this application note is a basic description of 
the thermal forces at work in mobile applications, with 
mathematical models that can be used to project system 
thermal parameters and aid the designer in worst-case 
design. 


The fIrst section begins with a review of the basic ther- 
mal defInitions which apply to notebook designs. Ther- 
mal data from a notebook experiment is presented to 
show a relationship between the temperature outside 
the notebook and the CPU case temperature. A model 
is given that helps the designer ensure the CPU thermal 
operating specifIcations are met. 


Next, several power consumption profIles are provided, 
starting with the assumed worst-case model, along with 
more conservative power profIles based on the degree of 
power management implementation. 
Based on these 
models, designers may forecast the amount of addition- 
al thermal m::'i6in their applications need. 


With these thermal models, power consumption pro- 
fIles, and test measurements, the designers will have the 
necessary tools and techniques to design for the worst 
case, and understand 
that by applying simple design 
enhancements, they can improve the quality of their 
designs. The designer should ensure the measured CPU 
case temperature 
(TCASE> complies with the TCASE 


specifIcations published in the SL Enhanced Intel486 
Microprocessor Data Sheet Addendum. 


Designing high performance CPU notebook systems re- 
quires some knowledge of the three processes by which 
heat is transferred from one point to another, namely: 
conduction, convection, and radiation, which are de- 
scribed in the following three sections. This knowledge 
will help the designer understand the subsequent meth- 
ods of heat transfer and their value in maintaining the 
CPU within its specifIed TCASE limits. The formulas 


NOTES: 


1. Physics, Second 
Edition, 
Paul A. Tiper. Worth Pub- 
lishers, Inc., 1982, p. 531. 


2. Physics. Second 
Edition, 
Paul A. Tiper. Worth Pub- 
lishers, Inc., 1982, p. 531. 


describing heat transfer by conduction, convection and 
radiation can be shown analogous to Ohm's Law: 


V 
1=- 
R. 


The thermal current, temperature difference and ther- 
mal resistance are analogous to electrical current (I), 
voltage (V), and electrical resistance (R), respectively. 


Conduction is a process by which heat flows from a 
region of higher temperature to one of lower tempera- 
ture within a medium (solid, liquid, or gas) or between 
mediums in direct physical contact(l). In a one-dimen- 
sional system, conductive heat transfer is governed by 
the following relation: 


Conductive 
Heat Transfer 
~T 
q = L/kA 
V 
1=- 


R 


where: 


q = Heat flow rate (W) 
k = Material thermal conductivity (WIm°C) 
A = Cross-sectional area (m2) 


ATL = Temperature gradient ("C/m) 


L = Distance of heat transfer 


In the preceding equations, the thermal current, q, can 
be viewed analogous to electrical current; AT analo- 
gous to voltage; and LIkA analogous to thermal resist- 
ance. To improve thermal conduction in a notebook 
environment, copper and other highly-thermal conduc- 
tive metals can be used in the package design. 


Convection is a process of energy transport by the com- 
bined action of heat conduction, energy storage, and 
mixing 
motion(2). Convection 
is 
the 
predominant 


mechanism for transferring energy between a solid sur- 
face and a fluid. In the notebook environment, this is 
equivalent to the heat transfer between the case surface 
and the ambient environment (air). The basic relation 
that describes heat transfer by convection from a sur- 
face to a fluid presumes a linear dependence on the 
difference between the temperature at the surface and 
deep in the fluid, and is referred to as Newtonian cool- 
ing: 


Convective 
Heat Transfer 


Ts - 
TA 
qc = 
1/hcA 


V 
1=- 
R 


where: 
qc 
Convective heat flow rate from a surface to 
ambient (W) 
A 
= Surface area (m2) 


Ts 
Surface temperature ("C) 
TA 
= Ambient temperature ("C) 
hc 
= Average convective heat transfer coefficient 
(W/m2°C) 


In the preceding equations, the thermal current, qc, 
can be viewed analogous to electrical current; Ts - TA 
analogous to voltage; and IIbC A analogous to thermal 
resistance. 


In forced convection, fluid flow is caused by an external 
factor such as a fan, while in free or natural convection, 
fluid motion is induced by density differences resulting 
from temperature gradients in the fluid (liquid or gas). 
Under the influence of gravity or other body forces, 
these density differences give rise to buoyancy forces 
that 
circulate 
the affected fluid and 
convect heat 
toward or away from surfaces wetted by the fluid. Al- 
though fans are often used to increase air convection 
inside desktop computers, they may not be a practical 
solution for notebook systems. 


Thermal radiation is defined as radiant energy emitted 
by a medium by virtue of its temperature, without the 
aid of any intervening medium(3). The amount of heat 
transferred by radiation between two bodies at temper- 
atures T 1 and T2 is governed by the following expres- 
sion: 


Radiative 


Heat Transfer 


T~ - T~ 
q = 
1/EO" 
V 
1=- 
R 


where: 
q = Amount of heat transferred by radiation (W) 


E = Emissivity 0 < E < I 


CT =Stefan-Boltzmann constant, 
5.67 X 10-8 (W/m2 °K4) 


NOTE: 
3. Physics, Second Edition, Paul A. Tiper. Worth Pub- 
lishers, Inc., 1982, p. 535. 


A = Area (m2) 
Tl> T2 = Surface temperature ("K) 


For radiation to be an effective method of heat transfer, 
compared to natural or forced convection mechanisms, 
a relatively large temperature difference must exist be- 
tween Tl and T2. For most low-power electronic appli- 
cations, these temperature 
differences are relatively 
small, and therefore, radiative effects are normally ne- 
glected. However, for high-power applications, 
heat 
transfer by radiation factors should be considered. Al- 
though heat radiation is a secondary thermal effect in a 
notebook 
system, some manufacturers 
are selecting 
coatings (i.e., black paint) for their notebook designs 
that are absorptive in the infrared to improve upon 
these thermal radiation effects. 


Thermal management of an electronic system encom- 
passes all the thermal processes and technologies which 
must be used to remove and transport heat from indi- 
vidual components to the system thermal sink in a con- 
trolled manner. The primary heat transfer processes 
(conduction, convection and radiation) can be com- 
bined into a single linear model (see Section 5.1). 


The junction-to-case 
(OJe) and junction-to-ambient 
(OJM thermal resistance values are used as measures of 
IC package thermal performance. These parameters are 
defined by the following relations: 


where: 


0JA 
= Junction-to-ambient thermal resistance ("CIW) 


0JC 
= Junction-to-case thermal resistance ("CIW) 


OCA = Case-to-ambient thermal resistance ("CIW) 
TJ 
= Average die temperature ("C) 
TC 
= Case temperature at a predefmed location ("C) 
P 
= Device power dissipation (W) 
TA = Ambient temperature ("C) 


0JC is a measure of package internal thermal resistance 
from silicon die to package exterior. This value is high- 
ly dependent upon packaging material, thermal con- 
ductivity, and package geometry. 
0JA 
measures the 
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conductivity and convective thermal resistance from 
package exterior to the ambient, as well as package in- 
ternal thermal resistance. 0JA values depend on materi- 
al, thermal conductivity, package geometry, and ambi- 
ent conditions such as flow rates and coolant physical 
properties. 


To improve CPU thermal characteristics in a notebook 
system, heat sinks are sometimes mounted on the top of 
the CPU using highly conductive adhesive materials. 
Adding a heat sink will not change 0Je; however, it will 
improve heat conduction and convection due to the in- 
crease in surface area, resulting in a significant reduc- 
tion in the case-to-ambient and, therefore, junction-to- 
ambient thermal resistance. Depending on the product 
and materials used, 0JA can be reduced by 10 to 30 
percent. 


To guarantee component functionality and reliability, 
the maximum device operating temperature is defined 
and constrained by the package exterior temperature at 
a predefined location. The guidelines for ambient tem- 
perature specify that measurements should be taken at 
an undisturbed location at a certain distance away from 
the package---traditionally 
12 inches horizontally from 
the center of the CPU. Measuring TA in the traditional 
manner is not possible in a notebook system. The case 
temperature, however, is measured at the center surface 
of the package. Depending on the ambient temperature 
and board power in the system's environment, thermal 
enhancements such as heat fins or forced air cooling 
may be necessary to meet the case temperature require- 
ments. 


In a linear model, power consumption is governed by 
the following equation: 


where: 


P 


Vee 
Ice 


Power consumed by the component 
Supply voltage 
Current through the component 


The preceding equation indicates that power consump- 
tion is linearly proportional with both supply voltage 
and current flowing through the component. For exam- 
ple, a 3.3V microprocessor will consume less power 
than a 5V microprocessor running the same application 
under equivalent operating conditions. 


A system's total power consumption is defined as either 
the sum of the power consumed by each individual 
module within the system, or the sum of the products 
of each module's voltage supply and its current. It is 
equivalent to: 


Several CPU Ice measurements were taken using an 
SL Enhanced Intel486 CPU evaluation board with vari- 
ous CPU modules running under two different operat- 
ing environments: Windows 3.1 and Indeo™ 
Video 


software (see Table 3-1). The modules used were SL 
Enhanced Intel486 DX-33 CPUs (IX SQFP, IX PGA, 
and 2X SQFP), and SL Enhanced Intel486 DX2-50 
CPUs (IX PGA). All of the CPU modules use a chipset 
with Power Management software. 


When comparing the Ice measurements between sever- 
al application environments, the CPUs running Indeo 
Video software consume the most Ice current by ap- 
proximately 10%. Since the Ice value represents the 
number of gates switching inside the CPU, and hence, 
the intensity of the CPU working condition, it is there- 
fore concluded that running Indeo Video software will 
be close to the worst case for thermal measurement 
purposes. 


Windows 3.1 
Indeo™ Video Software 


Icc 
Icc 


ClK 
CPU 
Vcc 
Freq 
Package 
°JC 
°JA 
Active 
Stop 
STPClK 
Active 
Stop 
STPClK 
Grant 
Grant 


1X 
OX 
3.3 
33 
SOFP 
3.5 
25.0 
0.279 
0.008 
0 
0.290 
0.008 
0 


OX 
5.0 
33 
PGA 
1.5 
17.0 
0.491 
0.041 
0 
0.512 
0.041 
0 


OX2 
5.0 
50 
PGA 
1.5 
17.0 
0.656 
0.046 
0 
0.685 
0.046 
0 


2X 
OX 
3.3 
33 
SOFP 
3.5 
25.0 
0.283 
N/A 
0 
0.294 
N/A 
0 


NOTES: 
1. Allthermal values are measured at zero airflow. 
2. Measurements taken on an SL Enhanced Intel486CPU EvaluationBoard (no heat spreader, no heat sink). 
3. Allmeasurements were taken using one module of each microprocessor. 
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3.3 
Typical System Power 
Consumption 
Profiles 


Table A-2 in the Appendix examines the power dissi- 
pated for four typical "system" profiles. These are sys- 
tems in the sense that the power used is assumed con- 
trolled (except in the first case) by either the operating 
system or the system hardware, aside from the CPU. 
These cases, however, do not attempt to add the effects 
of other power dissipating components that would exist 
in a complete PC notebook system. The first case gives 
the most conservative power calculation: the maximum 
power that can be generated by the CPU. The second 
case gives the typical average power. The last two sug- 
gest power consumption possibilities that could occur 
in a given system, or even be guaranteed by power man- 
agement. Many similar combinations would also be rea- 
sonable. Vcc in each case assumed as the standard val- 
ue (5.0V or 3.3V). 


In Case I, the average power is calculated as the stan- 
dard Vcc (3.3V or 5V) times the maximum current, 
Icdmax), 
that can be drawn by a particular CPU. This 
calculation gives the maximum power that can be dissi- 
pated while continuously executing the most power 
consuming instruction sequence. This value should be 
used in a system design if no thermal power manage- 
ment is imposed, and the designer wants to minimize 
potential 
problems 
even under 
worst-case 
cirfum- 
stances: a conservative design. 


In Case 2, the average power is calculated as the stan- 
dard Vcc (3.3V or 5V) times the TYPICAL current, as 
specified in Intel486 microprocessor data books. This 
calculates the average heat from the CPU that would be 
dissipated over time while executing a typical mix of 
software. The designer could use this power value in a 
less conservative design. However, if the CPU case tem- 
perature approaches its maximum specified value and 
no thermal power management is applied, the Tdmax) 
specification could be exceeded. (Section 5.5 discusses 
the time dependency issues in averaging the thermal 
power generated while executing a mix of software.) 


In Case 3, the average power is calculated as the stan- 
dard VCC (3.3V or 5V) times the Icdmax) 
for 10% of 
the time, and Icdtypical) 
for 90% of the time. This is a 
more conservative assumption than Case 2, allowing for 
some intervals in which the CPU runs at full power, 
and an overall thermal guardband over Case 2. 


Case 4 assumes that the current is distributed at the 
maximum for 10%, typical for 80%, and Stop Clock 
for 10% of the time. A mix of this sort is appropriate in 
a design where power management is applied to assert 
Stop Clock for at least 10% of the time. This mix could 
reduce the performance of the CPU. Suppose the sys- 
tem designer devises a theoretical mix of IcdMax, 
Typical and Stop Clock) which is exceeded by the sys- 
tem only I% of the time when running all standard 


benchmarks and applications. The designer builds the 
system to that specification, with thermal power man- 
agement responding only when the limit is exceeded. 
Then one can safely design the system assuming a sig- 
nificantly lower power than the absolute maximum, 
and experience performance degradation only I% of 
the time. 


The cases above illustrate that many less than worst- 
case power profiles are possible, depending on the soft- 
ware being run, and power management options being 
used. Intel recommends that systems be tested for ther- 
mal problems under the worst case power usage that 
the customer could contrive, and in an ambient temper- 
ature equal to the maximum specified for the design. 


This section provides some experimental thermal data 
which will help the designer better understand some of 
the system level issues affecting the thermal perform- 
ance of a notebook. Section 4.1 describes in one experi- 
ment how test chamber 8]A can be used as an approxi- 
mate thermal performance criteria in the early stages of 
a notebook design. Section 4.2 shows how in another 
experiment 
8]A is affected by CPU location on the 
motherboard inside the notebook. Section 4.3 presents 
a model showing the relationship between power gener- 
ated by other components on the motherboard and the 
CPU's 8]A requirement. 


4.1 
Measurements 
from Commercial 
Notebooks 


Since component location differs among notebook de- 
signs causing internal power densities to vary between 
notebooks there is no one place inside a notebook 
where TA can be defined in order to obtain an accurate 
system thermal profile. This section presents experi- 
mental 
thermal 
data 
collected 
for 
four 
different 
Inte1386 SL CPU 
notebooks 
running 
Indeo 
Video 
software and shows-as 
a rough estimate-that 
the 
ambient temperature (TM outside a notebook can be 
used with thermal resistance (8]M to project CPU tem- 
perture (T], Tc). 


In this notebook experiment, the CPU case temperature 
of four different Inte1386 SL CPU notebooks (PQFP 
packages) were measured using K-type thermocouples 
a digital multimeter, and an Inte1386 DX CPU-based 
system with data acquisition software. To simulate 
maximum ICC consumption, each notebook continu- 
ously executed Indeo Video software for the duration of 
the experiment. Table 4-1 shows the junction tempera- 
ture and 8]A calculated by using the thermal imped- 
ance equations from Section 2.2. The CPU case temper- 
ature, 
the maximum 
power 
consumption 
of 2.5W 


running Indeo Video software, and the test chamber 
8JC of 6°C/W as specified in the Inte1386TMSL Micro- 
processor Data Book for the 196L PQFP were used to 
calculate the junction temperature of each CPU. With 
this calculated CPU TJ and measured ambient room 
temperature of 25°C, the corresponding notebook's 8JA 
was obtained. 8JA can be calculated by combining the 
first two equations in Section 2.2 as follows: 


Although the test chamber 8JA for the 196L PQFP of 
23°C/W was collected with only the CPU present on a 
test board, the value obtained approximates 
8JA of a 
notebook operating in an environment of 25°C. One 
possible explanation is that the conductive and radia- 
tive effects the other components have on the CPU in- 
side the operating notebook, such as the PC board, con- 
nectors, floppy disk, shielding and plastic enclosure, ac- 
tually cause the temperature inside the notebook to be 
lower than expected. The end result is a lower 8JA for 
the CPU inside the operating notebook than the 8JA of 
a lone CPU inside a test chamber where the only con- 
ductive and radiative path is to the surrounding air, as 
shown in Notebooks # 1 and # 3. Table 4-1 shows the 
large variation in notebook 8JA caused by different sys- 
tem designs. Thus, the test chamber 8JA can be used as 
a rough guideline in the early stages of a notebook de- 
sign. For a more in-depth analysis of the conditions 
inside an operating notebook, see Section 4.3. For the 
final design, the thermal performance of the system 
should always be verified by measuring the CPU case 
temperature. 
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4.2 
()JA and ()JC Measurements 
in an 
Actual Environment 


A test motherboard with the same form factor as that 
of the original Test Notebook #4 was fabricated in or- 
der to take experimental measurements of 8JA and 8JC 
(see Figure 4-1). The only differences between the two 
boards are the following: 
1. The test board had two slots instead of the 70/80 
pin connectors on the motherboard. 


2. The test board only had six thermal test packages 


mounted on it (three on the component side and 
three on the solder side). 


Measurements from the 6 thermal test units yielded a 
8JA range of 20°C/W-25°C/W 
in the test chamber 


and 22°C/W -28°C/W 
in the Test Notebook and a 8JC 


range of 3°C/W -5°C/W 
in both the test chamber and 


the Test Notebook (see Table 4-2). 


Table 4-1. 8JA Calculations 
for Intel386 
SL CPU 
Notebooks 
Running Indeo™ 
Video Software 


Notebook 
# 
TeASE 
TJ 
8JA 
(calculated) 
(calculated) 


1 
48.9 
63.9 
15.5 


2 
69.0 
84.0 
23.6 


3 
52.1 
67.1 
16.8 


4 
71.5 
86.5 
24.6 
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Table 4·2. Thermal Resistance 8JA and 8JC 
for the SQFP Package 


Test Motherboard 
Test Motherboard 
in Test Chamber 
In Test Notebook 
eC/W) 
eC/W) 


8JA 
20-25 
22-28 


8JC 
3-5 
3-5 


NOTES: 
1. Test Notebook #4 was used to collect the experimental 
data. 
2. 208L SQFP test package with heat spreader containing 
thermal test die was used in all experiments to vary and 
measure the power going into package as wellas to mea- 
sure the temperature of the die. 
3. Allmeasurements were made withzero airflow,simulat- 
ing a typical notebook environment. Ambienttemperature 
is defined as ambient temperature outside the notebook. 


The worst CPU thermal location was on the center of 
the solder side with 8JA = 2SoC/W. The measured 
thermal resistance at this location was unfavorable be- 


cause of the reduced CPU board area surrounding the 
CPU (due to the two slots) and the reduced convection 
cooling on the bottom side of the board. The best CPU 
thermal location was on the component side at one end 
of the test board with the most PCB area surrounding 
the package with 8JA = 22°C/W. This shows how 
CPU location and system layout can impact the overall 
thermal performance of a notebook. 


The 8JA numbers should only be used as a first order 
estimate in preliminary notebook des':gns. Since the lo- 
cation of a CPU inside a notebook impacts thermal 
performance, TJ should always be verified in the fmal 
design by 8JC and the CPU case temperature. 


As notebooks evolve into smaller form factors with 
higher component density and smaller PCB sizes, the 
increasing power density inside the notebook has a 
large effect on CPU temperature. The power dissipated 
by components other than the CPU, and the layout of 


• 


the components, 
as well as the thermal-mechanical 
characteristics of the enclosure must be taken into ac- 
count in order to ensure a CPU junction temperature 
within specifications. In one model, such effects are ap- 
proximated by the introduction of the factors R and Ph 
to the thermal impedence equations in Section 2.2: 


R is defined as the thermal coupling factor between the 
CPU and the other components on the PCB. This fac- 
tor takes into account the effects of the power dissipa- 
tion of the other components on the CPU case and 
junction temperatures. Ph is defined as PCB power dis- 
sipation. This Ph value is the power dissipated by all 
components (except the CPU) inside the notebook. 


A detailed discussion of how R and Ph are used to 
calculate Thermal Headroom (thermal margin) for a 
Test Notebook is given in Section 5.3. Figure 4-2 gives 
a graphical representation of the effects of PCB power 
on the (J,A requirements. The horizontal line represents 
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the current 
method of a fixed 8JA requirement 
of 
23°C/W for this package, over the entire range of PCB 
power dissipated, Ph. The second line represents the 
model used with the factors Ph and R which takes into 
account the temperature rise inside the notebook and is 
obtained by substituting 
the example values TJ = 
100°C, TA = 30°C, PcPu = 2W, and R = 3.9 into 
the preceding equation. This line shows as the board 
power (Ph) increases, the thermal margin inside the 
notebook becomes smaller due to the higher tempera- 
ture environment. In both models, 8JA must fall below 
the line to ensure a junction temperature below specifi- 
cation for the given conditions. For the Test Notebook 
#4 the cross-over point between the two lines is 6W. 
Beyond this point, the package thermal resistance of 
23°C/W is too high and thermal enhancements are nec- 
essary to reduce that resistance. In summary, this ex- 
ample shows that for the current method, a 8JA value 
obtained from the test chamber leaves margin for Ph 
less than 6W. However, for Ph greater than 6W, the 
junction temperature will be exceeded. Again, it is em- 
phasized that the designer should always perform a 
thorough system thermal analysis to ensure the speci- 
fied TCASE(max) is not exceeded. 
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5.0 
CALCULATING 
THERMAL 
HEADROOM 


Thermal Headroom is the temperature margin between 
the calculated TA(max) and the TA measured outside a 
given system, with a particular CPU and power. This 
section shows how to calculate Thermal Headroom and 
use it as simple model for a system's thermal properties. 
Then a term accounting for board power is added to the 
model, and the experimental measurements needed to 
implement 
this more sophisticated 
version are de- 
scribed. Finally, the significance of two secondary ef- 
fects is analyzed. 


Figures A-I and A-2 (in Appendix A) plot the calculat- 
ed TA(max) vs the power being used by the CPU and 
are intended to facilitate quick determination of ther- 
mal headroom. The lines on the graphs indicate the 
(estimated) 
maximum 
allowed ambient 
temperature 
(TA in degrees Celsius) as a function of power dissipat- 
ed (P in Watts). Maintaining TA below or equal to 
TA(max) indicates that the required TC<max) is proba- 
bly not exceeded. The graph lines are generated from 
the formula: 


9CA is the thermal resistance to heat flow between the 
CPU case and the ambient environment, as specified in 
the Intel Packaging Handbook. As discussed in Section 
4, experiments show that these parameters are approxi- 
mately the same for a notebook PC with TA measured 
in open air outside the notebook case. The tendency of 
the notebook case to increase 9CA by adding extra lay- 
ers of insulation is approximately offset by its action as 
a heat spreader, since it is thermally connected to the 
CPU board. Different types of CPU packages have dif- 
ferent 9CA values, and thus generate different lines on 
the graphs. For example, Figure A-I shows the SQFP, 
PGA and PQFP packages for the SL Enhanced In- 
tel486 SX CPU and Figure A-2 shows the SQFP and 
PGA packages for the SL Enhanced Intel486 DX2 
CPU. 


To determine thermal headroom for a given CPU, first 
determine the correct line for the CPU type. (Some of 
the CPUs are marked on the graphs. For a CPU type 
that is not, find its 9CA in Table A-I, and match it to a 
CPU type that is marked on a graph line. Or use 9CA 
as the slope, and TC = 85·C as the temperature axis 
intercept to match directly to a graph line.) 


Second, determine the power at which the CPU will 
operate. Various average power use scenarios could be 
appropriate for a given design; four of them are detailed 
in Table A-2 for each different CPU (discussed in detail 


in Section 3.3). One can also calculate a custom power 
usage profile for one's system using P = Icc 
X Vco 
and Table A-I, which gives ICC under 3 different con- 
ditions (Active, Stop Grant and Stop Clock). 


Third, draw an ordinate (vertical line) at the power 
value (determined above) to intersect the appropriate 
line for the CPU package type chosen. Draw an abscis- 
sa (horizontal line) from the intercept to the TA axis, 
obtaining the maximum recommended ambient temper- 
ature for this system. If this TA(max)is greater than 
what is measured in the air outside the actual system, 
the design has a positive thermal headroom of TA(max) 
- 
TA(measured), as long as the effect of "board pow- 
er", Pb, is neglected. If however the actual system is 
exceeding this TA(max), the thermal headroom is nega- 
tive even before considering Pb, and thus the thermal 
• 
properties of the design will need improvement ("ther- 
mal mitigation"), (See Section 4 for Pb definition, and 
Section 5.3 for more information about Thermal Head- 
room). 


There are numerous techniques for thermal mitigation, 
or improving the thermal properties of a design (i.e., a 
lower voltage version of the CPU or a CPU package 
with a lower 9CA could be used). Depending on pack- 
age size and material, various 9CA values can be ob- 
tained. Various passive and active thermal management 
strategies are discussed in Section 6. 


5.2 
Example of Graph Use 


Consider the 5V PQFP SL Enhanced Intel486 SX-33 
CPU, which is likely to have special thermal needs be- 
cause of its power requirements. The graph line for it is 
indicated by the label (33 MHz PQFP 
5V) on the 
Intel486 SX CPU graph, or by the fact that its slope 
value from Table A-I is 17.0 ·CIW. The power shown 
for Figure A-I is for Case I: VCC = 
5.0V; ICC = 
0.685 mA (from Table A-I); Active Max for 100% of 
the time; P = 3.43W (from Table A-2, or calculation). 


The ordinate is drawn from the P axis at 3.43W to 
intersect the intermediate slope line, and the abscissa 
from that point intersects the Temp. axis at about 27·C. 
If we assume 4O"Cis the lowest TA that can be readily 
achieved, we get a negative thermal headroom of B·C. 
Designing a portable PC with this CPU clearly will 
require thermal mitigation. 


5.3 
Adjusting Thermal 
Headroom 
for 


Board Power 


Experiments have shown that the term R X Pb pro- 
vides a good way to model the effects on the CPU TC 
due to other heat sources inside a notebook. (Here R is 
an experimentally determined thermal coupling coeffi- 


cient, and Ph is the board power, as defined in Section 
4.). The term adds to Tc, or reduces the TA(max) that 
is required to ensure that Tc does not exceed Tc(max): 


To calculate thermal headroom adjusted for the effect 
of Ph, subtract R X Ph from the thermal headroom 
calculated as above. This of course makes the head- 
room smaller (worse), but by how much? This depends 
on the size of Ph, but also on R, which is highly depen- 
dent on the particular design. In theory, the smallest 
value possible for R is zero: no thermal coupling be- 
tween the CPU and other heat sources inside the note- 
book. From testing one system, the range measured ex- 
perimentally (with no effort made to thermally isolate 
the CPU) has been 3.9 to 4.9. Values closer to zero are 
obtained by positioning the other high-power devices 
away from the CPU, and thermally grounding them to 
the outer case. (Section 5.4 describes how to measure R 
for a given system.) 


5.4 
Experimental 
Measurements 
are 
Essential 


The 8CA values given by Intel can be used as a rough 
"rule of thumb" to estimate likely thermal margins. If 
the thermal headroom calculated from the graphs is 
positive for a given design even after correcting for the 
board power (R = 4 would be conservative for a real 
notebook design), the design is most likely satisfactory. 
But even then, Intel recommends that the CPU TC be 
measured when the complete design can be run at full 
power;with TA at the maximum allowed by the design- 
er's specifications, to be really sure that Tc(max) will 
never be exceeded. If a conservative calculation of ther- 
mal headroom (including R X Ph) is negative, it is 
essential that the system be tested, and improvements 
in thermal mitigation be made until Tc(max) is never 
exceeded. 


There are several levels of thermal experiments that can 
be used. The simplest is to just measure the CPU TCo 
and make adjustments in the design until it never ex- 
ceeds Tc(max). Then the equations and graphs can be 
ignored; if the design meets the Tc(max) specification, 
it does not matter if the (estimated) TA(max) is violat- 
ed, as far as Intel's CPU is concerned. (Consideration 
should be given, however, to other components, such as 
a disk drive, that might have trouble due to high tem- 
peratures inside the notebook.) 


Suppose, however, the design is expected to be used for 
several variations over time, i.e., an Intel486 DX CPU 
now, and an Intel486 DX2 CPU later, with perhaps 
higher power peripherals which can also increase Ph in 
later versions. These later versions with more power 
will likely require more thermal mitigation efforts, but 
simply measuring TC in the first version of the note- 
book will give little guidance about how much more 


thermal mitigation will be needed later. In this case, 
more detailed experiments on the first version, in order 
to build an accurate thermal model of the product line, 
can be very cost effective. This can be done using the 
equation from Section 5.3 and solving for Tc: 


In the preceding equation, TA is the actual air tempera- 
ture outside the notebook during the experiment; Pcpu 
is held fixed, and Ph is varied while the resulting Tc is 
measured. (TA + 8CA X PCPU) is the intercept of the 
resulting straight line, and R is the slope. The easiest 
way to measure R is to disconnect the CPU (so PcPU 
= 0) and measure TC with VCCat the upper and lower 
limits of its range (i.e., 4.5V and 5.5V). Ph is obtained 
for each Vcc value by Vcc X ICC' A third data point 
requires no measurement; when Ph = 0, TC = TA' 


When this semi-empirical model has been constructed 
for a given notebook design, it can be used to accurately 
determine thermal 
headroom 
for variations in both 
PcPU (plugging in a higher frequency CPU) and Ph 
(adding higher power peripherals). Of course, if the 
thermal mechanical design is subsequently modified, R 
should be measured again for the new version. 


If the designers expect the Ph to roughly track Pcpu, 
and R is small (as it should be in a good design), an 
approximation to the above model may make measure- 
ments easier: Assume Ph = C X PcPU, where C is the 
coefficient relating board power to CPU power. The 
preceding equation for TC then becomes: 


TC = TA + 8CA X Pcpu + R X Pb 


= TA + 8CA X Pcpu + R x C x Pcpu 


= TA + (8CA + R x C) Pcpu 


Then (8CA + R X C) becomes a new coefficient, say 
8CA" which can be measured by varying Pcpu and Ph 
together by varying Vcc over its functional range. 


Two kinds of secondary effects will be evaluated. The 
main model used in thermal analysis assumes a linear 
relationship between the temperature gradient and the 
rate of heat transfer, and also assumes a steady state 
(time independent) model. How valid are these assump- 
tions? 


Heat transfer by conduction is governed by a linear 
relationship between the temperature difference and the 
rate of heat transfer, and heat transfer by convection 
can be approximated by a linear relationship, as de- 
scribed in Section 2.J. However, heat transfer by radia- 
tion is proportional to the fourth power of the absolute 
temperatures 
involved. To demonstrate the contribu- 
tion that heat transfer by radiation makes to cooling the 
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CPU, consider the largest allowed Tc (8S°C) and the 
smallest TA that most designs would find acceptable 
(4O"C). One of the larger CPU packages is 4.4 cm 
square. Assume the largest emissivity (let E = 1). The 
temperatures must be converted to degrees Kelvin by 
adding 273°. Using the formula from Section 2.1, q, the 
radiative heat transfer in Watts is: 


q = ECTA (Tc4 - TA4) = 1 • (5.67 10-8 W/(m2K4» 


(0.044m)2«358 K)4 
- 
(313 K)4) = 0.75W 


To determine the significance of heat transfer by radia- 
tion in this case, one compares the O.7SWjust calculat- 
ed to the total heat transfer predicted by the linear ap- 
proximation using the experimentally determined 9CA' 
By rearranging the equation 


TC -TA 
9CA = --- 
for P, one obtains 
P 


P = Tc-TA. 
9CA 


Tc-TA 
= 
4SoC in this 
case, and 
9CA ranges 


IS.5"CIW to 32.O"CIW 
for the CPUs covered in this 


article. These figures give a total heat power dissipation 
ranging from 
1.41W to 2.90W. Thus, the radiative 


component varies between one fourth to one half of the 
total. This explains why the addition of black paint on a 
notebook 
case 
improved 
9JA 
measurably 
(see 


Section 6). 


The experimental determination of 9CA effectively in- 
corporates 
the radiative component, along with the 


conductive and convective components, in a combined 
linear approximation, which will be accurate for tem- 


peratures near the values used for the measurement. 
The effect of the radiative, nonlinear component will be 
beneficial in that the actual power radiated away from 
the CPU, when temperatures 
are higher than those 


used in the 9CA measurements, will be greater than 
predicted by the linear model. This means that TC will 
not increase as much as predicted by the model, for a 
given increase in power. 


The time independent assumption is fme if one is con- 
tent to build a system to tolerate Case I maximum pow- 
er, and with only passive thermal management (i.e., 
heat sinks and heat spreaders). However, if one assumes 
some power averaging over a typical mix of software, as 
in Cases 2, 3 and 4, one must consider the time depen- 
dent effects of bursts of maximum power, alternating 
with lower power periods. Also, if the design uses ac- 
tive thermal management, especially in a closed loop 
design with the system responding to a temperature 
sensor, the lag time between temperature sensing and 
response must be considered. 


The experimental measurements taken on an Intel386 
SL CPU notebook show how TCASEvaries with time at 
different CPU power levels, and allow an approximate 
determination 
of thermal time constants (see Figure 


S-l). The time constant (approximately 3 minutes for 
this notebook) is defined here as the time elapsed from 
when power was switched from Full On to Standby, to 
when the temperature has declined toward its Standby 
asymptote by lie. This means that it is reasonable to 
average the CPU power over approximately one minute 
when calculating average power generated by a mix of 
software. This is a large interval in CPU cycles (billions 
of CPU clocks). 
• 


This time constant also indicates a significant lag be- 
tween a temperature sensor reaching some action value 
(the action could be turning off the CPU clock for an 
interval), and a temperature response to that action. 
This means that the setpoint temperature (the tempera- 
ture that triggers power reduction) should be somewhat 
below Tc(max). Note also that the temperature does 
not come to equilibrium until about an hour after a 
major power change. This shows that when testing Tc 
to assure that it does not exceed Tc(max), one should 
run the notebook under worst case conditions continu- 
ally for at least an hour. 


6.0 
IMPROVING 
THERMAL 
HEADROOM 


By using the thermal management theories that have 
been reviewed here and keeping in mind the notebook 
platform limitation, the designer can apply proven ther- 
mal management techniques in several areas, including 
increasing thermal conduction and convection, opti- 
mum system layout, and power management 
tech- 
niques. 


6.1 
Improving Thermal Convection, 
Conduction 
and Radiation 


The most obvious method for improving thermal con- 
vection is by adding a fan to circulate the air. Unfortu- 
nately, fans are a compromise in mobile designs because 
of extra power and space requirements, and electrome- 
chanical noise. Intel's Thermal/Mechanical 
Tools and 
Analysis Group has collected data using more realistic 
techniques for reducing TCASE. 
Data was collected 
from an actual Inte1386 SL microprocessor notebook 
computer modified to measure TCASE. 
The case tem- 
perature was measured by applying 3W to the CPU 
with all other devices/components off. 


The inside of the notebook case was painted Flat black 
using a paint that is highly absorptive in the infra-red as 
well as visible. Figure 6-1 shows that TCASE was im- 
proved by 2.3°C, or 3%. Table 6-1 shows that heat 
transfer 
improvement 
by radiation 
reduces 9JA by 
O.5°CIW. The remaining experiments were performed 
with the inside of the notebook case painted black. 


A copper foil (1" x 3" x 1.5 mil) was attached from the 
CPU to the bottom of the keyboard (constructed of 
aluminum material) and then to the plastic case using 
thermal grease. Figure 6-1 shows a 4.8°C (6%) and a 
2.0°C (3%) improvement when the foil is connected 
from the CPU to the underside of keyboard and from 
the CPU to the case, respectively. After connecting the 
CPU to the bottom of the keyboard, 9JA improved be- 
cause of the higher thermal dispersion by the aluminum 
plate. A thicker copper foil (I" x 3" x 10 mil) was then 
connected from the CPU to the keyboard bottom which 
yielded an improvement of 11.4°C or 15%. 


6.1.3 
PER FLUOROCARBON 
FLUID 
AND 
SILICONE 
ELASTOMERS 


Liquid heat sinks containing perfluorocarbon fluid can 
offer a reasonable substitute for standard heat sinks. 
The heat transfer coefficient for natural convection in a 
perfluorocarbon fluid is greater than that of natural air 
convection. Measurements were taken using a perfluo- 
rocarbon liquid heat sink connected between the CPU 
and the plastic case. Figure 6-2 shows a 7.5"C (10%) 
improvement in temperature. 


Black 
1.5 mil 
1.5 mil 
10 mil 


Paint 
Copper 
Foil 
Copper 
Foil 
Copper 
Foil 
toCaae 
to Keyboard 
to Keyboard 


fJ.9JA (OC/W) 
0.5 
0.9 
1.8 
3.9 


A type of heat sink that helps blanket uneven surfaces 
is the silicone elastomers heat sink. These soft materials 
fill air gaps between hot components and the metal 
chassis. A piece of silicone elastomer was cut out to the 
same size as the CPU and placed between the CPU and 
the case, yielding a ISC 
(2%) improvement (see Fig- 
ure 6-2). 


Power must be optimally distributed to ensure the low- 
est TA. The objective is to reduce power density within 
the system to avoid hot spots at any particular device. 
Keeping the CPU away from batteries and power sup- 
plies is one challenge to the system designer. Thermally 


connecting the CPU case to the PC case can increase 
thermal area thus greatly improving thermal spreading. 
There are many opportunities for creativity in transfer- 
ring heat away from the CPU. 


6.3 
Effective 
System Power 
Management 


One technique which has become a standard in note- 
book designs is using Intel's System Management Mode 
(SMM) to effectively monitor system activity and shut 
off devices to slow or control clocks when low activity 
is detected. Another approach could be to monitor the 
CPU activity or temperature and slow or stop the CPU 
clock when a long period of system inactivity or a high 
temperature is detected. 


• 


System notebook designers need to pay close attention 
to the special thermal requirements for future notebook 
designs. These include demands for increasing comput- 
ing power, and thus power consumption, in decreasing 
sizes. With proper thermal design, the CPU can be kept 
below its stated maximum case temperature, even un- 
der worst case conditions. 


Experiments have shown that 9CA as measured in open 
air in factory tests can be used for a rough estimate for 
an actual notebook PC design, where TA is the air tem- 
perature outside the notebook. Using this 9CA to esti- 
mate thermal headroom provides the simplest model. 
Experiments have also shown that adding a term to the 
thermal equation that includes the board power (Pb), 
with a coupling coefficient (R), provides a better model. 
This enhanced model takes into account the effect of 
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other heat sources inside the notebook on the CPU Tc> 
and allows accurate prediction of the thermal effects of 
upgrades in a notebook design (adding a higher power 
CPU and/or peripherals on the board). Also, a number 
of suggestions have been made about how to improve 
the thermal characteristics of a design, by various pas- 
sive and active techniques. 


The information within this application note allows the 
designer to use initially a simple analytical model, as 
well as to build a semi-empirical model to correlate 
with the actual design. Techniques have been shown 
not only to increase the average thermal margin, but to 
eliminate thermal margin problems under worst case 
operating conditions. By designing for the worst case, 
and taking actual measurements to guarantee proper 
operation within spec limits, the notebook designer can 
provide the highest quality, most reliable products for 
their customers. 
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ICC 
Icc 
Icc 


ClK 
CPU Vcc 
Freq 
Pkg 
TCASE 
Vcc 
Active 
Stop 
Stop 
8JC 8JA 
8CA 


tol 
Grant 
Clock 
(calc) 


Typ 
Max 
Typ 
Max 
Typ 
Max 


1X 
SX 
3.3 
25 
SQFP 
85 
±0.30 
0.250 
0.315 
0.020 
0.040 
0.0001 
0.001 
4.0 
36.0 
32.0 
3.3 
33 
SQFP 
85 
±0.30 
0.300 
0.385 
0.025 
0.050 
0.0001 
0.001 
4.0 
36.0 
32.0 
5.0 
25 
PGA 
85 
±0.25 
0.430 
0.560 
0.035 
0.065 
0.0002 
0.002 
1.5 
17.0 
15.5 
5.0 
25 
PQFP 
85 
±0.25 
0.430 
0.560 
0.035 
0.065 
0.0002 
0.002 
3.5 
20.5 
17.0 
5.0 
33 
PGA 
85 
±0.25 
0.590 
0.685 
0.040 
0.080 
0.0002 
0.002 
1.5 
17.0 
15.5 
5.0 
33 
PQFP 
85 
±0.25 
0.590 0.685 0.040 0.080 0.0002 0.002 
3.5 20.5 
17.0 


1X 
OX 
3.3 
33 
SQFP 
85 
±0.30 
0.330 
0.415 
0.025 
0.050 
0.0001 
0.001 
3.5 
25.0 
21.5 
5.0 
33 
PGA 
85 
+0.25 
0.500 
0.630 
0.040 
0.080 
0.0002 
0.002 
1.5 
17.0 
15.5 
5.0 
33 
PQFP 
85 
±0.25 
0.500 
0.630 
0.040 
0.080 
0.0002 
0.002 
3.5 
20.5 
17.0 
5.0 
50 
PGA 
85 
±0.25 
0.775 
0.950 
0.050 
0.100 
0.0002 
0.002 
1.5 
17.0 
15.5 


1X 
OX2 
3.3 
40 
SQFP 
85 
±0.30 
0.375 
0.450 
0.020 
0.040 
0.0001 
0.001 
3.5 
24.0 
20.5 
3.3 
50 
SQFP 
85 
±0.30 
0.460 
0.550 
0.035 
0.065 
0.0001 
0.001 
3.5 
24.0 
20.5 
5.0 
50 
PGA 
85 
±0.25 
0.775 
0.950 
0.023 
0.050 
0.0002 
0.002 
1.5 
17.0 
15.5 
5.0 
66 
PGA 
85 
±0.25 
0.975 
1.200 
0.045 
0.090 
0.0002 
0.002 
1.5 
17.0 
15.5 


2X 
SX 
3.3 
25 
SQFP 
85 
±0.30 
0.250 
0.315 
0.0001 
0.001 
4.0 
36.0 
32.0 
3.3 
33 
SQFP 
85 
±0.30 
0.330 
0.415 
N/A 
N/A 
0.0001 
0.001 
4.0 
36.0 
32.0 
5.0 
25 
PQFP 
85 
±0.25 
0.430 
0.560 
0.0002 
0.002 
3.5 
20.5 
17.0 
5.0 
33 
PQFP 
85 
±0.25 
0.590 
0.685 
0.0002 
0.002 
3.5 
20.5 
17.0 


2X 
OX 
3.3 
33 
SQFP 
85 
±0.30 
0.330 
0.415 
N/A 
N/A 
0.0001 
0.001 
3.5 
25.0 
21.5 
5.0 
33 
PQFP 
85 
±0.25 
0.500 
0.630 
0.0002 
0.002 
3.5 
20.5 
17.0 


• 


Case 1 
Case 2 
Case 3 
Case 4 


ClK 
CPU 
Vcc 
Freq 
Pkg 
Tc 
Vcc 
Icc 
Power 
Thermal 
Icc 
Power 
Thermal 
Icc 
Power 
Thermal 
Icc 
Power 
Thermal 
tol 
AVG 
AVG 
Headrm 
AVG 
AVG 
Headrm 
AVG 
AVG 
Headrm 
AVG 
AVG 
Headrm 


1X 
SX 
3.3 
25 
SQFP 
85 
±0.30 
0.32 
1.04 
11.7 
0.25 
0.83 
18.6 
0.26 
0.85 
17.9 
0.23 
0.76 
20.5 
3.3 
33 
SQFP 
85 
±0.30 
0.39 
1.27 
4.3 
0.30 
0.99 
13.3 
0.31 
1.02 
12.4 
0.28 
0.92 
15.6 
5.0 
25 
PGA 
85 
±0.25 
0.56 
2.80 
1.6 
0.43 
2.15 
11.7 
0.44 
2.22 
10.7 
0.40 
2.00 
14.0 
5.0 
25 
PQFP 
85 
±0.25 
0.56 
2.80 
(2.6) 
0.43 
2.15 
8.5 
0.44 
2.22 
7.3 
0.40 
2.00 
11.0 
5.0 
33 
PGA 
85 
±0.25 
0.69 
3.43 
(8.1) 
0.59 
2.95 
(0.7) 
0.60 
3.00 
(1.5) 
0.54 
2.70 
3.1 
5.0 
33 
PQFP 
85 
±0.25 
0.69 
3.43 
(13.2) 
0.59 
2.95 
(5.2) 
0.60 
3.00 
(6.0) 
0.54 
2.70 
(1.0) 


1X 
OX 
3.3 
33 
SQFP 
85 
±0.30 
0.42 
1.37 
15.6 
0.33 
1.09 
21.6 
0.34 
1.12 
21.0 
0.31 
1.01 
23.3 
5.0 
33 
PGA 
85 
±0.25 
0.63 
3.15 
(3.8) 
0.50 
2.50 
6.3 
0.51 
2.57 
5.2 
0.46 
2.32 
9.1 
5.0 
33 
PQFP 
85 
±0.25 
0.63 
3.15 
(8.6) 
0.50 
2.50 
2.5 
0.51 
2.57 
1.4 
0.46 
2.32 
5.6 
5.0 
50 
PGA 
85 
±0.25 
0.95 
4.75 
(28.6) 
0.78 
3.88 
(15.1) 
0.79 
3.96 
(16.4) 
0.72 
3.58 
(10.4) 


1X 
OX2 
3.3 
40 
SQFP 
85 
±0.30 
0.45 
1.49 
14.6 
0.38 
1.24 
19.6 
0.38 
1.26 
19.1 
0.35 
1.14 
21.7 
3.3 
50 
SQFP 
85 
±0.30 
0.55 
1.82 
7.8 
0.46 
1.52 
13.9 
0.47 
1.55 
13.3 
0.42 
1.40 
16.4 
5.0 
50 
PGA 
85 
±0.25 
0.95 
4.75 
(28.6) 
0.78 
-3.88 
(15.1) 
0.79 
3.96 
(16.4) 
0.72 
3.58 
(10.4) 
5.0 
66 
PGA 
85 
±0.25 
1.20 
6.00 
(48.0) 
0.98 
4.88 
(30.6) 
1.00 
4.99 
(32.3) 
0.90 
4.50 
(24.8) 


2X 
SX 
3.3 
25 
SQFP 
85 
±0.30 
0.32 
1.04 
11.7 
0.25 
0.83 
18.6 
0.26 
0.85 
17.9 
0.23 
0.76 
20.5 
3.3 
33 
SQFP 
85 
±0.30 
0.42 
1.37 
1.2 
0.33 
1.09 
10.2 
0.34 
1.12 
9.3 
0.31 
1.01 
12.7 
5.0 
25 
PQFP 
85 
±0.25 
0.56 
2.80 
(2.6) 
0.43 
2.15 
8.5 
0.44 
2.22 
7.3 
0.40 
2.00 
11.0 
5.0 
33 
PQFP 
85 
±0.25 
0.69 
3.43 
(13.2) 
0.59 
2.95 
(5.2) 
0.60 
3.00 
(6.0) 
0.54 
2.70 
(1.0) 


2X 
OX 
3.3 
33 
SQFP 
85 
±0.30 
0.42 
1.37 
15.6 
0.33 
1.09 
21.6 
0.34 
1.12 
21.0 
0.31 
1.01 
23.3 
5.0 
33 
PQFP 
85 
±0.25 
0.63 
3.15 
(8.6) 
0.50 
2.54 
2.5 
0.51 
2.57 
1.4 
0.46 
2.32 
5.6 


NOTES: 
Casetemperaturespecificationsassumea heatspreaderand no heatsink. 
CASE 1: 
Ice Active (max) 
= 100% 


CASE 2: 
Ice Active (typ) 
= 100% 
CASE 3: 
Ice Active (max) 
10% 


Ice Active (typ) 
90% 


CASE 4: 
Ice Active (max) 
10% 
Ice Active (typ) 
80% 
Ice Stop Clock(max) = 
10% 
--€: 


90 


60 


70 


60 


50 


T.mblent (·C) 


40 


30 


20 


10 
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THERMAL 
ENHANCEMENT 
VENDORS 


Chomerics, Inc. 
77 Dragon Court 
Woburn, MA 01888-4014 
(617) 935-4850 
FAX: (617) 933-4318 
Product ID: CHO·THERM 
A274 
(Silicon Elastomer) 


3M Corporation 
Building 223-6S·04 
3M Center 
St. Paul, MN 55144-1000 
(612) 733-3735 or (800) 833-5045 
Product ID: Fluorinert Liquid FC· 77 
(Perfluorocarbon Fluid) 
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• 
Powerful Performance Boosters for 
Intel486TMMicroprocessor-Based 
Systems 
- 
Improve Overall System 
Performance by up to 70% 
- 
Increase Both Integer and Floating- 
Point Performance 
- 
Provides Next Level of CPU 
Performance 


• 
Intel486 DX2 OverDrive Processors 
Upgrade Systems Based on 
-lntel486TM 
SX CPUs 
-lntel486TM 
DX CPUs 


• 
Binary Compatible with Large Installed 
Software Base 
-MS-DOS·, 
OS/2*, Windows* 
- 
UNIX* System V/386 
-iRMX®, 
IRMK Kernals 


Intel OverOrive 
processors 
are powerful, 
single-chip 
upgrades 
for Intel486 
microprocessor-based 
systems. 
The Intel486 
OX2 OverOrive 
processor 
upgrades 
Intel486 SX and OX CPU-based 
systems with Intel486 
OX2's 
"speed 
doubling" 
technology. 
OverOrive 
processors 
accelerate 
integer 
and math performance, 
delivering 
an 
overall 
performance 
boost of up to 70 percent. 
Users see a performance 
boost for all 005, Windows, 
OS/2 
and UNIX applications 
from AutoCAO' 
to WordPerfect'. 
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intel~ 


This 
data 
sheet 
describes 
the 
Intel486 
DX2 


OverDrive processor. The Intel486 DX2 OverDrive 
processor is designed as an end-user upgrade for 
Intel486 SX and Intel486 DX CPU·based systems. 
This data sheet is intended to be used with the data 
sheet for the 
original CPU in the 
system-the 


Intel486 SX or Intel486 DX CPU-which 
describes 


the Intel486 Family Architecture and functionality. All 
enhancements 
or 
differences 
in 
the 
Intel486 


DX2 OverDrive processor from the Intel486 SX or 
Intel486 DX CPU are described in this data sheet. 
Intel486 SX or Intel486 DX CPU-based systems that 
are compatible to the Intel OverDrive processor 
must be designed to both the original CPU specifica- 
tions and the Intel OverDrive processor specifica- 
tions. 


The Intel486 DX2 OverDrive processor is based 
on Intel486 DX2 Microprocessor technology. Intel 


A)B 


• 


B)El 


Intel 
Intel486 
OX2 


OverDrive 
~ 
OverDrive 
Processor 
Processor 


Socket 


Intel 
Intel486 
OX2 


OverDrive 
~ 
OverDrive 
Processor 
Processor 


Socket 


Intel486 
OX2 


i486 
OX 
OverDrive 
Processor 


Intel 


Intel 
PentiumlN 


OverDrive 
+- 
OverDrive 


Processor 
Processor 


Socket 


OverDrive processor are designed as a single-chip, 
powerful performance booster for Intel486 Micro- 
processor-based systems. 


The Intel486 DX2 OverDrive processor is currently 
available in two basic product variants. The first 
product variant (ODP), shown in Figures 1A and 1B, 
is referred to 
as the 
OverDrive processor 
for 


Intel486 SX and DX CPU-based systems that have 
an OverDrive processor socket. This product, ODP, 
is described specifically in Section 1.2. 


The second i486 DX2 OverDrive processor variant 
(ODPR), shown in Figure 1C, is referred to as the 
OverDrive processor for Intel486 DX CPU-based 
systems that do not have an OverDrive processor 
socket. This product is designed to replace the 
Intel486 DX CPU in systems that do not have an 
"End-User Easy" OverDrive processor socket. Sec- 
tion 1.3 describes this product variant, ODPR. 


Description 
• For Intel486 SX CPU-based Systems 
• Based on Intel486 DX2 Technology 
• Intel487™ Math CoProcessor Pinout 


• For Intel486 DX CPU-based Systems with an 


Intel OverDrive Processor Socket 


• Based on Intel486 DX2 Technology 
• Intel487 Math CoProcessor Pinout 


• For Intel486 DX CPU-based Systems without 


an Intel OverDrive Processor Socket 


• Based on Intel486 DX2 Technology 
• Intel486 DX CPU Pinout 


• For 486 DX2 CPU-based Systems 
• Based on Pentium™ CPUTechnology 
• Expected Availability 1994 


1.2 
Intel486 OX2 OverOrive™ 
Processor 
for Intel486™ 
SX and OX 
Microprocessor-Based 
Systems (with an OverOrive™ 
Processor 
Socket) 


• 
Powerful Performance Booster for 
• 
Utilizes Intel486TMDX2 Speed-Doubling 
Intel486™ SX and OX CPU-Based 
Technology 
Systems 
- 
CPU Core Runs at Twice the 
-Improves 
Overall System 
Frequency of the System Bus 
Performance by up to 70% 
- 
Compatible with 33, 25, 20, and 
-Increases 
Both Integer and Floating- 
16 MHz Systems 
Point Performance 
• 
Binary Compatible with Large Installed 
• 
169·Lead Pin Grid Array Package 
Software Base 
- 
Pin Compatible with Intel487™ SX 
- 
MS-DOS, OS/2, Windows 
Math CoProcessor 
- 
UNIX System V/386 
- 
"End-User Easy", Single-Chip 
- 
iRMX®, iRMK Kernals 
Upgrade 
• 
High Performance Design 
-169th 
Alignment Pin Ensures Proper 
-Core 
Clock Speed up to 66 MHz 
Chip Orientation 
-106 
Mbyte/sec Burst Bus 
• 
Math CoProcessor Included On-Chip 
- 
CHMOS V Process Technology 


• 
High Integration Enables On-Chip 
• 
Complete 32-Bit Architecture 
- 
8 Kbyte Code and Data Cache 
- 
Address and Data Busses 
- 
Floating Point Unit 
- 
Registers 
- 
Paged, Virtual Memory Management 
- 
8-, 16-, 32-Blt Data Types 


• 
Compatible with Intel SL Enhanced 
Features 


The Intel486 
DX2 OverDrive 
processor 
for Intel486 
SX and DX microprocessor-based 
personal 
computers 
is a 
powerful, 
single-chip 
upgrade 
that is intrinsically 
cost effective 
and that offers 
excellent 
price/performance. 


Based on Intel486 
DX2 technology, 
the Intel OverDrive 
processor 
integrates 
an integer 
unit, a floating 
point 
unit, a memory 
management 
unit, SL Enhanced 
features, 
and an 8 Kbyte cache 
on a single chip. 


Using the Intel486 
DX2's 
"speed 
doubling" 
technology, 
the Intel OverDrive 
processor 
operates 
internally 
at 
twice the speed of the system bus. This allows users of Intel486 SX and DX microprocessor-based 
systems 
to 
double 
the frequency 
of their computer's 
CPU by adding 
a single 
chip, without 
upgrading 
or modifying 
any 
other 
system 
components. 
For example, 
adding 
an Intel 486 
DX2 OverDrive 
processor 
to an Intel486 
DX 
33 MHz system 
will double 
the CPUs internal 
operating 
speed to 66 MHz. Depending 
on the application, 
the 
OverDrive 
processor 
can provide 
an overall 
system 
performance 
boost of up to 70%. 


The Intel486 
DX2 OverDrive 
processor 
(ODP) is designed 
to be installed 
in the OverDrive 
processor 
socket 
of 
Intel486 
microprocessor-based 
systems. 
In Intel486 SX systems, this is the same socket that was designed 
for 
the Intel487™ 
SX Math CoProcessor. 


The Intel486 DX2 OverDrive processor is available 
in 
three 
product 
versions. 
The 
20 
MHz 
Intel 
OverDrive processor is designed to upgrade both 
20 MHz and 16 MHz Intel486 SX Microprocessor- 
based systems. This product utilizes the standard 
169-lead PGA package. The 25 MHz and 33 MHz 


Intel OverDrive processor is designed to upgrade 
25 MHz and 33 MHz Intel486 SX or OXMicroproces- 
sor-based systems. This product has a heat sink at- 
tached to the standard 169-lead PGA package to aid 
in the heat dissipation in 25 MHz and 33 MHz sys- 
tems. 
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TOP SIDE VIEW 
169 Lead PGA 
(OOP) 
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Address 
Data 
Control 
N/C(1) 
Vcc 
Vss 


A2 
014 
00 
P1 
A20MIF 
015 
A10 
B7 
A7 
A3 
R15 
01 
N2 
A051F 
517 
A12 
B9 
A9 
A4 
516 
02 
N1 
AHOLO 
A17 
A14 
B11 
A11 
A5 
012 
03 
H2 
BEOIF 
K15 
B12 
C4 
B3 
A6 
515 
04 
M3 
BE1# 
J16 
B13 
C5 
B4 
A7 
013 
05 
J2 
BE21F 
J15 
C13 
E16 
B5 
A8 
R13 
06 
L2 
BE31F 
F17 
R17 
G16 
E1 
A9 
011 
07 
L3 
BLA5TIF 
R16 
54 
H16 
E17 
AlO 
513 
08 
F2 
BOFFIF 
017 
A3 
J1 
G1 
An 
R12 
09 
01 
BROYIF 
H15 
B16 
K16 
G17 
A12 
57 
010 
E3 
BREOIF 
015 
C11 
L16 
H1 
A13 
010 
011 
C1 
B581F 
016 
M2 
Hl7 
A14 
55 
012 
G3 
B5161F 
C17 
M16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
P16 
K17 
A16 
09 
014 
K3 
O/CIF 
M15 
INC(2) 
R3 
L1 
A17 
03 
015 
F3 
OPO 
N3 
L17 
A18 
R5 
016 
J3 
OP1 
F1 
C14 
R6 
M1 
A19 
04 
017 
03 
OP2 
H3 
R8 
M17 
A20 
08 
018 
C2 
OP3 
A5 
R9 
P17 
A21 
05 
019 
B1 
EA051F 
B17 
PLUG 
R10 
02 
A22 
07 
020 
A1 
FERRIF 
A13 
R4 
A23 
53 
021 
B2 
FLU5HIF 
C15 
05 
R11 
56 
A24 
06 
022 
A2 
HLOA 
P15 
06 
R14 
58 
A25 
R2 
023 
A4 
HOLO 
E15 
012 
59 
A26 
52 
024 
A6 
IGNNEIF 
A15 
013 
510 
A27 
51 
025 
B6 
INTR 
A16 
014 
511 
A28 
R1 
026 
C7 
KEN IF 
F15 
E4 
512 
A29 
P2 
027 
C6 
LOCK IF 
N15 
E14 
514 
A30 
P3 
028 
C8 
M/IOIF 
N16 
F4 


A31 
01 
029 
A8 
NMI 
B15 
F14 


030 
C9 
PCO 
J17 
M4 


031 
B8 
PCHKIF 
017 
M14 


PWT 
L15 
N4 


PLOCK IF 
016 
N14 


ROYIF 
F16 
P4 


RE5ET 
C16 
P5 


5MIIF 
B10 
P6 


5MIACTIF 
C12 
P12 


5RE5ET 
C10 
P13 


5TPCLKIF 
G15 
P14 


UPIF 
B14 


W/RIF 
N17 
KEY 
04 


NOTES: 
1. All NC pins must remain 
unconnected. 


2. The INC pin is defined 
to be an internal 
no-connect. 
This means 
that the pin is not internally 
connected 
and may be used 
for the routing 
of external 
signals. 


• 


1.3 
Intel486 OX2 OverOrive™ 
Processor 
for PGA Intel486™ 
OX Microprocessor- 
Based Systems (without an OverOrive™ 
Processor 
Socket) 


• 
Powerful Performance Booster for PGA 
Intel486™ DX CPU-Based Systems 
-Improves 
Overall System 
Performance by up to 70% 
-Increases 
Both Integer and Floating- 
Point Performance 


• 
168-Lead Pin Grid Array Package 
- 
Pin Compatible with Intel486™ DX 
CPU 


• 
Math CoProcessor Included On-Chip 


• 
High Integration Enables On-Chip 
- 
8 Kbyte Code and Data Cache 
- 
Floating Point Unit 
- 
Paged, Virtual Memory Management 


• 
Compatible with SL Enhanced Features 


• 
Utilizes Intel486TMDX2 Speed-Doubling 
Technology 
- 
CPU Core Runs at Twice the 
Frequency of the System Bus 
- 
Compatible with 33 and 25 MHz 
Systems 


• 
Binary Compatible with Large Installed 
Software Base 
- 
MS-DOS, OS/2, Windows 
- 
UNIX System V/386 
-IRMX@, 
IRMK Kernals 


• 
High Performance Design 
- 
Core Clock Speed up to 66 MHz 
-106 
Mbyte/sec Burst Bus 
- 
CHMOS V Process Technology 


• 
Complete 32-Bit Architecture 
- 
Address and Data Busses 
-Registers 
- 
8-, 16-, 32-Blt Data Types 


The Intel486 
DX2 OverDrive 
processor 
for Intel486 
OX microprocessor-based 
personal 
computers 
is a power- 
ful, single-chip 
upgrade 
that is intrinsically 
cost effective 
and that offers 
excellent 
price/performance. 
Based 
on Intel486 
DX2 technology, 
the Intel OverDrive 
processor 
integrates 
an integer 
unit, a floating 
point unit, a 
memory 
management 
unit and an 8 Kbyte cache on a single chip. 


Using the Intel486 
DX2's 
"speed 
doubling" 
technology, 
the Intel OverDrive 
processor 
operates 
internally 
at 
twice the speed of the system bus. This allows users of Intel486 
OX microprocessor-based 
systems 
to double 
the frequency 
of their computer's 
CPU by replacing 
a single chip, without 
upgrading 
or modifying 
any other 
system 
components. 
For example, 
adding 
an Intel486 
DX2 OverDrive 
processor 
to an Intel486 
OX 33 MHz 
system will double the CPUs internal 
operating 
speed to 66 MHz. Depending 
on the application, 
the OverDrive 
processor 
can provide 
an overall 
system 
performance 
boost of up to 70%. 


The Intel486 
DX2 OverDrive 
processor, 
ODPR, is designed 
to be installed 
in the Intel486 
OX CPU socket 
after 
the Intel486 
OX CPU has been removed. 


Both 
the 25 MHz and 33 MHz versions 
of the OverDrive 
processor 
for Intel486 
OX CPU-based 
Systems 
without 
an OverDrive 
processor 
socket 
have a heat sink attached 
to the standard 
168-lead 
PGA package. 


HLOA 
VCC 
VSS 
a 
a 
a 


LOCK# 
~/IO# 
W/R# 
a 
a 
a 


o/c# 
VCC 
VSS 
a 
a 
a 


PWT 
VCC 
VSS 
a 
a 
a 


VSS 
06 
07 
a 
a 
a 


VSS 
VCC 
01. 
a 
a 
a 


VCC 
05 
016 
a 
a 
a 


vss 
03 
OP2 
a 
a 
a 


vss 
VCC 
012 
a 
a 
a 


BRDVN: 
vec 
vss 
a 
a 
a 


STPCLK# 
VCC 
VSS 
a 
a 
a 


Intel486™ 
DX2 OverDrive™ 
Processor 


PIN SIDE VIEW 


(168-LEAD 
PGA) 


(ODPR) 


KEN" 
ROY. 
aE3- 
a 
a 
a 


HOLO 
VCC 
VSS 
a 
a 
a 


vss 
vec 
DID 
a 
a 
a 


09 
013 
017 
a 
a 
a 


019 
021 
vss 
VSS 
vss 
025 
vec 
031 
vec 
SWI- 
vec 
Ne 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 


020 
022 
NC 
023 
OP3 
02. 
VSS 
029 
VSS 
NC 
VSS 
NC 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 
a 


10 
11 
12 
13 
14 
15 
16 
17 
290436-22 


HC 
BLAST' 
A3 
VCC 
A8 
All 
VCC 
VCC 
VCC 
VCC 
A 15 
VCC 
A 18 
VSS 
VCC 
A25 
A28 
o 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


PCHK' 
PLOCK' 
BREQ 
A2 
A7 
A5 
A9 
A13 
AI6 
A20 
A22 
AH 
A21 
A19 
AI7 
VSS 
A31 


o 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 
0 


VSS 
VCC 
HLDA 
000 


W/R' 
M/IO' 
LOCK' 
000 


VSS 
VCC 
D/C' 
000 


VSS 
VCC 
PWT 


000 


VSS 
VCC 
BED' 
000 


PCD 
BE10 
BE2' 
000 


VSS 
VCC 
BRDY" 
000 


Intel486™ 
DX2 OverDrive™ 
Processor 


TOP SIDE VIEW 


(168-LEAD 
PGA) 


(ODPR) 


D7 
D6 
VSS 
000 
Dl. 
vcc 
vss 


000 


8E3- 
ROY. 
KEN# 
000 


VSS 
VCC 
HOLD 
000 


OP2 
03 
VSS 
000 


D12 
VCC 
VSS 
000 


D15 
D8 
DPI 
000 


DID 
VCC 
VSS 
000 


1 
290436-23 


Address 
Data 
Control 
N/C(1) 
Vcc 
Vss 


A2 
014 
00 
P1 
A20M# 
015 
A3 
87 
A7 
A3 
R15 
01 
N2 
A08# 
817 
A10 
89 
A9 
A4 
816 
02 
N1 
AHOLO 
A17 
A12 
811 
A11 
A5 
012 
03 
H2 
8EO# 
K15 
A14 
C4 
83 
A6 
815 
04 
M3 
8EH 
J16 
812 
C5 
84 
A7 
013 
05 
J2 
8E2# 
J15 
813 
E2 
85 
A8 
R13 
06 
L2 
8E3# 
F17 
814 
E16 
E1 
A9 
011 
07 
L3 
8LA8T# 
R16 
816 
G2 
E17 
A10 
813 
08 
F2 
80FF# 
017 
C11 
G16 
G1 
Al1 
R12 
09 
01 
8ROY# 
H15 
C13 
H16 
G17 
A12 
87 
010 
E3 
8REO# 
015 
R17 
J1 
H1 
A13 
010 
011 
C1 
888# 
016 
84 
K2 
H17 
A14 
85 
012 
G3 
8816# 
C17 
K16 
K1 
A15 
R7 
013 
02 
CLK 
C3 
L16 
K17 
A16 
09 
014 
K3 
O/C# 
M15 
INC(2) 
M2 
L1 
A17 
03 
015 
F3 
OPO 
N3 
M16 
L17 
A18 
R5 
016 
J3 
OP1 
F1 
A13 
P16 
M1 
A19 
04 
017 
03 
OP2 
H3 
R3 
M17 
A20 
08 
018 
C2 
OP3 
A5 
R6 
P17 
A21 
05 
019 
81 
EA08# 
817 
R8 
02 
A22 
07 
020 
A1 
FERR# 
C14 
R9 
R4 
A23 
83 
021 
82 
FLU8H# 
C15 
R10 
86 
A24 
06 
022 
A2 
HLOA 
P15 
R11 
88 
A25 
R2 
023 
A4 
HOLO 
E15 
R14 
89 
A26 
82 
024 
A6 
IGNNE# 
A15 
810 
A27 
81 
025 
86 
INTR 
A16 
811 
A28 
R1 
026 
C7 
KEN# 
F15 
812 
A29 
P2 
027 
C6 
LOCK # 
N15 
814 
A30 
P3 
028 
C8 
MIIO# 
N16 
A31 
01 
029 
A8 
NMI 
815 
030 
C9 
PCO 
J17 
031 
88 
PCHK# 
017 
PWT 
L15 
PLOCK# 
- 
016 
ROY# 
F16 
RE8ET 
C16 
8MI# 
810 
8MIACT# 
C12 
8RE8ET 
C10 
8TPCLK# 
G15 
W/R# 
N17 


NOTES: 
1. All NC pins must remain 
unconnected. 


2. The INC pin is defined 
to be internal 
no-connect. 
This means that the pin is not internally 
connected 
and may be used for 
the routing 
of external 
signals. 


• 


Symbol 
Type 
Name and Function 


CLK 
I 
Clock provides 
the fundamental 
timing for the bus interface 
unit and is multiplied 
by 
two (2x) to provide the internal frequency 
for the Intel OverDrive 
processor. 
All 
external 
timing parameters 
are specified 
with respect 
to the rising edge of CLK. 


ADDRESS 
BUS 


A31-A4 
1/0 
A31-A2 
are the address 
lines of the processor. 
A31-A2, 
together 
with the by1e 
A2-A3 
0 
enables 
BEO# -BE3#, 
define the physical 
area of memory or input/output 
space 
accessed. 
Address 
lines A31-A4 
are used to drive addresses 
into the processor 
to 
perform 
cache line invalidations. 
Input signals must meet setup and hold times t22 
and t23. A31-A2 
are not driven during bus or address 
hold. 


BEO-3# 
0 
The byte enable signals indicate 
active by1es during read and write cycles. 
During 
the first cycle of a cache fill, the external 
system should assume that all by1e 
enables 
are active. BE3# 
applies to D24-D31, 
BE2# 
applies to 016-D23, 
BE1 # 
applies to D8-D15 
and BEO# applies to DO-D7. 
BEO#-BE3# 
are active LOW and 
are not driven during bus hold. 


DATA 
BUS 


D31-DO 
I/O 
These are the data lines for the Intel OverDrive 
processor. 
Lines DO-D7 
define the 
least significant 
by1e of the data bus while lines D24-D31 
define the most 
significant 
by1e of the data bus. These signals must meet setup and hold times t22 
and t23 for proper operation 
on reads. These pins are driven during the second 
and 
subsequent 
clocks of write cycles. 


DATA 
PARITY 


DPO-DP3 
I/O 
There is one data parity pin for each by1e of the data bus. Data parity is generated 
on all write data cycles with the same timing as the data driven by the Intel 
OverDrive 
processor. 
Even parity information 
must be driven back into the 
microprocessor 
on the data parity pins with the same timing as read information 
to 
insure that the correct 
parity check status is indicated 
by the Intel OverDrive 
processor. 
The signals read on these pins do not affect program 
execution. 


Input signals must meet setup and hold times t22 and t23. DPO-DP3 
should be 
connected 
to Vcc through a pullup resistor 
in systems which do not use parity. 
DPO-DP3 
are active HIGH and are driven during the second and subsequent 
clocks 
of write cycles. 


PCHK# 
0 
Parity Status is driven on the PCHK # pin the clock after ready for read operations. 
The parity status is for data sampled 
at the end of the previous clock. A parity error 
is indicated 
by PCHK # being LOW. Parity status is only checked 
for enabled 
by1es 
as indicated 
by the by1e enable and bus size signals. PCHK # is valid only in the 
clock immediately 
after read data is returned 
to the microprocessor. 
At all other 
times PCHK# 
is inactive (HIGH). PCHK# 
is never floated. 


intel~ 


Symbol 
Type 
Name and Function 


BUS CYCLE DEFINITION 


M/IO# 
0 
The memory/input-output, 
data/contro/ 
and write/read 
lines are the primary bus 
D/C# 
0 
definition 
signals. These signals are driven valid as the ADS# 
signal is asserted. 


W/R# 
0 
M/IO# 
D/C# 
W/R# 
Bus Cycle Initiated 


0 
0 
0 
Interrupt Acknowledge 
0 
0 
1 
Halt/Special 
Cycle 
0 
1 
0 
I/O Read 
0 
1 
1 
I/O Write 
1 
0 
0 
Code Read 
1 
0 
1 
Reserved 
1 
1 
0 
Memory 
Read 
1 
1 
1 
Memory Write 


The bus definition 
signals are not driven during bus hold and follow the timing of the 
address 
bus. Refer to Section 
7.2.11 for a description 
of the special bus cycles. 


LOCK # 
0 
The bus lock pin indicates 
that the current bus cycle is locked. The Intel OverDrive 
processor 
will not allow a bus hold when LOCK # is asserted 
(but address 
holds are 
allowed). 
LOCK # goes active in the first clock of the first locked bus cycle and goes 
inactive after the last clock of the last locked bus cycle. The last locked cycle ends 
when ROY # is returned. 
LOCK # is active LOW and is not driven during bus hold. 
Locked read cycles will not be transformed 
into cache fill cycles if KEN # is returned 
active. 


PLOCK# 
0 
The pseudo-lock 
pin indicates 
that the current bus transaction 
requires 
more than 
one bus cycle to complete. 
Examples 
of such operations 
are floating 
point long 
reads and writes (64 bits), segment 
table descriptor 
reads (64 bits), in addition to 
cache line fills (128 bits). The Intel OverDrive 
processor 
will drive PLOCK # active 
until the addresses 
for the last bus cycle of the transaction 
have been driven 
regardless 
of whether 
ROY # or BRDY # have been returned. 
Normally 
PLOCK # and BLAST # are inverse of each other. However 
during the first 
bus cycle of a 64-bit floating 
point write, both PLOCK # and BLAST # will be 
asserted. 
PLOCK # is a function 
of the BS8 #, BS16 # and KEN # inputs. PLOCK # should be 
sampled 
only in the clock ROY # is returned. 
PLOCK # is active LOW and is not 
driven during bus hold. 


BUS CONTROL 


ADS# 
0 
The address status output indicates 
that a valid bus cycle definition 
and address are 
available 
on the cycle definition 
lines and address bus. ADS# 
is driven active in the 
same clock as the addresses 
are driven. ADS# 
is active LOW and is not driven 
during bus hold. 


RDY# 
I 
The non-burst 
ready input indicates 
that the current bus cycle is complete. 
RDY # 
indicates 
that the external 
system has presented 
valid data on the data pins in 
response 
to a read or that the external 
system has accepted 
data from the Intel 
OverDrive 
processor 
in response 
to a write. RDY # is ignored when the bus is idle 
and at the end of the first clock of the bus cycle. 
RDY # is active during address 
hold. Data can be returned 
to the processor 
while 
AHOLD is active. 
ROY # is active LOW, and is not provided with an internal 
pullup resistor. 
RDY # 
must satisfy setup and hold times t16 and t17 for proper chip operation. 


• 


Symbol 
Type 
Name and Function 


BURST CONTROL 


BRDY*" 
I 
The burst ready input performs 
the same function 
during a burst cycle that RDY *" 


performs 
during a non-burst 
cycle. BRDY *" indicates 
that the external 
system has 


presented 
valid data in response 
to a read or that the external 
system has accepted 


data in response 
to a write. BRDY *" is ignored when the bus is idle and at the end of 
the first clock in a bus cycle. 
BRDY *" is sampled 
in the second 
and subsequent 
clocks of a burst cycle. The data 


presented 
on the data bus will be strobed 
into the microprocessor 
when BRDY *" is 


sampled 
active. If RDY *" is returned 
simultaneously 
with BRDY *", BRDY *" is ignored 


and the burst cycle is prematurely 
interrupted 
BRDY *" is active lOW 
and is provided 
with a small pullup resistor. 
BRDY *" must 


satisfy the setup and hold times t16 and tn. 


BLAST*" 
a 
The burst last signal indicates 
that the next time BRDY *" is returned 
the burst bus 


cycle is complete. 
BLAST *" is active for both burst and non-burst 
bus cycles. 
BLAST *" 
is active lOW 
and is not driven during bus hold. 


INTERRUPTS 


RESET 
I 
The RESET input forces the CPU to begin execution 
at a known state. Reset is 


asynchronous, 
but must meet setup and hold times t20 and t21 for recognition 
in any 


specific 
clock. The CPU cannot 
begin execution 
of instructions 
until at least 1 ms after 


Vcc and ClK 
have reached 
their proper AC and DC specifications. 
However, 
for soft 


resets, RESET should remain active for at least 15 ClK 
periods. The RESET pin 
should remain active during this time to ensure proper CPU operation. 
RESET is active 


HIGH. 
RESET sets the 5MBASE 
descriptor 
to a default address of 30000H. 
If the system 


uses 5MBASE 
relocation, 
then the SRESET pin should be used for soft resets. 


SRESET 
I 
The SRESET pin duplicates 
all the functionality 
of the RESET pin with the following 
two 


exceptions: 
1. The 5MBASE 
register will retain its previous value. 


2. If UP*" (I) is asserted, 
SRESET will not have an effect on the host microprocessor. 


For soft resets, SRESET should remain active for at least 15 ClK 
periods. SRESET 
is 


active HIGH. SRESET is asynchronous 
but must meet setup and hold times t20 and t21 
for recognition 
in any specific clock. 


SMI*" 
I 
The System 
Management 
Interrupt 
input is used to invoke the System Management 


Mode (SMM). SMI *" is a falling edge triggered 
signal which forces the CPU into SMM at 


the completion 
of the current instruction. 
SMI *" is recognized 
on an instruction 


boundary 
and at each iteration for repeat string instructions. 
SMI *" does not break 


lOCKed 
bus cycles and cannot 
interrupt 
a currently 
executing 
SMM. The CPU will 


latch the falling edge of one pending SMI *" signal while the CPU is executing 
an 


existing SMI. The nested SMI will not be recognized 
until after the execution 
of a 


Resume 
(RSM) instruction. 


SMIACT*" 
a 
The System 
Management 
Interrupt 
ACTive 
is an active low output, indicating 
that 


the processor 
is operating 
in SMM. It is asserted 
when the CPU begins to execute 
the 


SMI state save sequence 
and will remain active lOW 
until the processor 
executes 
the 


last state restore cycle out of SMRAM. 


STPClK*" 
I 
The SToP CLocK request 
input signal indicates 
a request has been made to turn off 


the ClK 
input. When the CPU recognizes 
a STPClK 
*", the processor 
will stop 


execution 
on the next instruction 
boundary, 
unless superseded 
by a higher priority 


interrupt, 
empty all internal 
pipelines 
and the write buffers and generate 
a Stop Grant 


acknowledge 
bus cycle. STPClK 
*" is active lOW 
and is provided 
with an internal pull- 


up resistor. STPClK*" 
is asynchronous 
but setup and hold times t20 and t21 must be 


met to ensure recognition 
in any specific 
clock. 


Symbol 
Type 
Name and Function 


INTERRUPTS 
(Continued) 


INTR 
I 
The maskable 
interrupt 
indicates 
that an external 
interrupt 
has been generated. 
If the 
internal interrupt 
flag is set in EFLAGS, active interrupt 
processing 
will be initiated. The 
Intel OverDrive 
processor 
will generate 
two locked interrupt 
acknowledge 
bus cycles in 


response 
to the INTR pin going active. INTR must remain active until the interrupt 


acknowledges 
have been performed 
to assure that the interrupt 
is recognized. 


INTR is active HIGH and is not provided 
with an internal 
pulldown 
resistor. 
INTR is 


asynchronous, 
but must meet setup and hold times t20 and t21 for recognition 
in any 


specific clock. 


NMI 
I 
The non-maskable 
interrupt 
request signal indicates 
that an external 
non-maskable 


interrupt 
has been generated. 
NMI is rising edge sensitive. 
NMI must be held LOW for at 


least four CLK periods before this rising edge. NMI is not provided 
with an internal 


pulldown 
resistor. NMI is asynchronous, 
but must meet setup and hold times t20 and t21 


for recognition 
in any specific 
clock. 


BUS ARBITRATION 


BREa 
0 
The internal 
cycle pending 
signal indicates 
that the Intel OverDrive 
processor 
has 
internally 
generated 
a bus request. 
BREa 
is generated 
whether 
or not the Intel OverDrive 


processor 
is driving the bus. BREa 
is active HIGH and is never floated. 


HOLD 
I 
The bus hold request 
allows another bus master complete 
control 
of the Intel OverDrive 


processor 
bus. In response 
to HOLD going active the Intel OverDrive 
processor 
will float 


most of its output and input/output 
pins. HLDA will be asserted 
after completing 
the 


current bus cycle, burst cycle or sequence 
of locked cycles. The Intel OverDrive 
processor 
will remain in this state until HOLD is deasserted. 
HOLD is active high and is 


not provided 
with an internal pulldown 
resistor. 
HOLD must satisfy setup and hold times 


t18 and t19 for proper operation. 


HLDA 
0 
Hold acknowledge 
goes active in response 
to a hold request presented 
on the HOLD pin. 


HLDA indicates 
that the Intel OverDrive 
processor 
has given the bus to another 
local bus 
master. HLDA is driven active in the same clock that the Intel OverDrive 
processor 
floats 
its bus. HLDA is driven inactive when leaving bus hold. HLDA is active HIGH and remains 
driven during bus hold. 
• 


intel~ 


Symbol 
Type 
Name and Function 


BUS ARBITRATION 
(Continued) 


BOFF# 
I 
The backoffinput 
forces the Intel OverDrive 
processor 
to float its bus in the next clock. 


The microprocessor 
will float all pins normally floated 
during bus hold but HLDA will not 
be asserted 
in response 
to BOFF #. BOFF # has higher priority than ROY II or BRDY II; 
if both are returned 
in the same clock, BOFF # takes effect. The microprocessor 
remains in bus hold until BOFF # is negated. 
If a bus cycle was in progress 
when 
BOFF # was asserted 
the cycle will be restarted. 
BOFF # is active LOW and must meet 
setup and hold times t18 and t19 for proper operation. 


CACHE INVALIDATION 


AHOLD 
I 
The address hold request allows another 
bus master access to the Intel OverDrive 
processor's 
address bus for a cache invalidation 
cycle. The Intel OverDrive 
processor 
will stop driving its address 
bus in the clock following 
AHOLD 
going active. Only the 
address 
bus will be floated 
during address 
hold, the remainder 
of the bus will remain 
active. AHOLD is active HIGH and is provided with a small internal 
pulldown 
resistor. 
For 
proper operation 
AHOLD 
must meet setup and hold times t18 and t19. 


EADS# 
I 
This signal indicates 
that a valid external address 
has been driven onto the Intel 
OverDrive 
processor 
address 
pins. This address will be used to perform 
an internal 
cache invalidation 
cycle. EADS# 
is active LOW and:s 
provided with an internal 
pullup 
resistor. 
EADS# 
must satisfy setup and hold times t12 and t13 for proper operation. 


CACHE CONTROL 


KEN# 
I 
The cache enable pin is used to determine 
whether 
the current cycle is cacheable. 
When the Intel OverDrive 
processor 
generates 
a cycle that can be cached 
and KEN # is 
active, the cycle will become 
a cache line fill cycle. Returning 
KEN # active one clock 
before ROY # during the last read in the cache line fill will cause the line to be placed in 
the on-chip cache. KEN # is active LOW and is provided 
with a small internal 
pull up 
resistor. 
KEN# 
must satisfy setup and hold times t14 and t15 for proper operation. 


FLUSH# 
I 
The cache flush input forces the Intel OverDrive 
processor 
to flush its entire internal 
cache. FLUSH II is active low and need only be asserted 
for one clock. FLUSH # is 
asynchronous 
but setup and hold times t20 and t21 must be met for recognition 
in any 
specific 
clock. FLUSH # being sampled 
low in the clock before the falling edge of 
RESET causes the Intel OverDrive 
processor 
to enter the tri-state 
test mode. 


PAGE CACHEABILITY 


PWT 
0 
The page write-through 
and page cache disable pins reflect the state of the page 
PCD 
0 
attribute 
bits, PWT and PCD, in the page table entry or page directory 
entry. If paging is 
disabled 
or for cycles that are not paged, PWT and PCD reflect the state of the PWT and 
PCD bits in control 
register 3. PWT and PCD have the same timing as the cycle definition 
pins (MIIO#, 
D/C# 
and W/R#). 
PWT and PCD are active HIGH and are not driven 
during bus hold. PCD is masked by the cache disable bit (CD) in Control 
Register o. 


NUMERIC 
ERROR REPORTING 


FERR# 
0 
The floating point error pin is driven active when a floating 
point error occurs. 
FERR # is 
similar to the ERROR # pin on the Intel387™ 
math coprocessor. 
FERR # is included 
for 
compatibility 
with systems 
using DOS type floating 
point error reporting. 
FERR # will not 
go active if FP errors are masked in FPU register. 
FERR# 
is active LOW, and is not 
floated 
during bus hold. 


Symbol 
Type 
Name and Function 
.' 


NUMERIC 
ERROR REPORTING 
(Continued) 


IGNNE# 
I 
When the ignore numeric 
error pin is asserted 
the Intel OverDrive 
processor 
will ignore a 
numeric error and continue 
executing 
non-control 
floating 
point instructions, 
but FERR # 
will still be activated 
by the Intel OverDrive 
processor. 
When IGNNE # is deasserted 
the 
Intel OverDrive 
processor 
will freeze on a non-control 
floating 
point instruction, 
if a 
previous floating 
point instruction 
caused an error. IGNNE # has no effect when the NE 
bit in control 
register 0 is set. IGNNE # is active LOW and is provided with a small 
internal 
pullup resistor. 
IGNNE# 
is asynchronous 
but setup and hold times t20 and t21 
must be met to insure recognition 
on any specific 
clock. 


BUS SIZE CONTROL 


8516# 
I 
The bus size 16 and bus size 8 pins (bus sizing pins) cause the Intel OverDrive 
858# 
I 
processor 
to run multiple bus cycles to complete 
a request from devices that cannot 
provide or accept 32 bits of data in a single cycle. The bus sizing pins are sampled 
every 
clock. The state of these pins in the clock before ready is used by the Intel OverDrive 
processor 
to determine 
the bus size. These signals are active LOW and are provided 
with internal pullup resistors. 
These inputs must satisfy setup and hold times t14 and t15 
for proper operation. 


ADDRESS 
MASK 


A20M# 
I 
When the address bit 20 mask pin is asserted, 
the Intel OverDrive 
processor 
masks 
physical 
address bit 20 (A20) before performing 
a lookup to the internal cache or driving 
a memory cycle on the bus. A20M # emulates 
the address wraparound 
at one Mbyte 
which occurs on the 8086. A20M # is active LOW and should be asserted 
only when the 
processor 
is in real mode. This pin is asynchronous 
but should meet setup and hold 
times t20 and t21 for recognition 
in any specific 
clock. For proper operation, 
A20M # 
should be sampled 
high at the falling edge of RESET. 


i486 DX AND 1486 SX CPU INTERFACE 


UP#(1,2) 
0 
The upgrade present 
pin is used to signal the Intel486 
Microprocessor 
to float its outputs 
and get off the bus. It is active low and is never floated. 
UP# 
is driven low at power-up 
and remains active for the entire duration 
of the Upgrade 
Processor 
operation. 


KEY PIN 


KEy(2) 
The KEY pin is an electrically 
non-functional 
pin which is used to ensure correct 
Upgrade 
Processor 
orientation 
in a 169-pin socket. 


NOTE: 
1. The UPii' pin was previously named the MPiI' pin in the i486 SX Microprocessorli487 SX Math CoProcessor data book. 
The functionality is the same, only the name has changed. 
2. The UPii' input pin and KEY pin are not defined on the OverDrive processor for replacement of PGA Intel486 DX Micro- 
processor (ODPR). 


Name 
Active 
When 
Level 
Floated 


BREQ 
HIGH 
HLDA 
HIGH 
BEO#-BE3# 
LOW 
Bus Hold 
PWT,PCD 
HIGH 
Bus Hold 


W/R#, 
D/C#, 
M/IO# 
HIGH 
Bus Hold 
LOCK# 
LOW 
Bus Hold 
PLOCK# 
LOW 
Bus Hold 


ADS# 
LOW 
Bus Hold 
BLAST# 
LOW 
Bus Hold 
PCHK# 
LOW 
FERR# 
LOW 
SMIACT# 
LOW 
UP# 
LOW 


A2-A3 
HIGH 
Bus, Address 
Hold 


Name 
Active 
When 
Level 
Floated 


DO-D31 
HIGH 
Bus Hold 


DPO-DP3 
HIGH 
Bus Hold 


A4-A31 
HIGH 
Bus, Address 
Hold 
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Name 
Active 
Synchronous/ 


Level 
Asynchronous 


CLK 
RESET 
HIGH 
Asynchronous 


HOLD 
HIGH 
Synchronous 


AHOLD 
HIGH 
Synchronous 


EADS# 
LOW 
Synchronous 


BOFF# 
LOW 
Synchronous 


FLUSH# 
LOW 
Asynchronous 


A20M# 
LOW 
Asynchronous 


BS16#, 
BS8# 
LOW 
Synchronous 
KEN# 
LOW 
Synchronous 


RDY# 
LOW 
Synchronous 


BRDY# 
LOW 
Synchronous 


INTR 
HIGH 
Asynchronous 


NMI 
HIGH 
Asynchronous 


SRESET 
HIGH 
Asynchronous 


SMI# 
LOW 
Asynchronous 


STPCLK# 
LOW 
Asynchronous 
IGNNE# 
LOW 
Asynchronous 


Intel OverDrlve™ Processor Plpellned 32-Bit Mlcroarchltecture 
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Transfer Bus 
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Index 
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Translation 
Lookaside 
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2.0 
DIFFERENCES FROM 
Intel486™ SX AND 
Intel486™ OX CPU 
FUNCTIONALITY 


The Intel486 DX2 OverDrive processor is essentially 
an enhanced Intel486 Microprocessor. There are 
three functional differences between the Intel Over- 
Drive processor and Intel486 Microprocessors. First, 
the Intel OverDrive processor has an internal clock 
doubling circuit which decreases the time required to 
execute instructions. Second, the Intel OverDrive 
processor does not support the JTAG boundary 
scan test feature. Third, the Intel OverDrive proces- 
sor has a different CPU revision identification than 
the Intel486 SX or Intel486 DX CPUs. These three 
differences are described in the following sections 
according to how they effect the CPU functionality. 


2.1 
Hardware Interface 


The Intel OverDrive processor bus has been de- 
signed to be identical with the Intel486 Microproces- 
sor bus. Although the external clock is internally 
doubled and data and instructions are manipulated 
in the CPU core at twice the external frequency, the 
external bus is functionally identical with the Intel486 
CPU. 


The four boundary scan test signals (TCK, Test 
clock; TMS, Test Mode select; TDI, Test Data Input; 
TDO, Test Data Output), defined for some Intel486 
CPUs, are 
not 
specified for 
the 
Intel486 
DX2 
OverDrive processor. 


The UP# (Upgrade Present) signal, which is defined 
as an input for some Intel486 CPUs, is an output 
signal on the Intel OverDrive processor. The UP# 
pin on the Intel OverDrive processor provides a logi- 
cal low output signal which can be used to enable 
logic to recognize and configure the system for the 
Intel OverDrive processor. This signal is identical to 
the MP# output defined for the Intel487 SX Math 
CoProcessor. Refer to Section 3 for examples of use 
of the UP# signal. 


The DX register always contains the component 
identifier at the conclusion of RESET. The Intel 
OverDrive processor has a different revision identifi- 
er in the DL register than the Intel486 SX or Intel486 
DX Microprocessors (refer to Section 4.1). When the 
OverDrive processor is installed in a system the 
component identifier is supplied by the OverDrive 
processor, rather than the original CPU. The step- 
ping identification portion of the component identifi- 
cation will change with different revisions of the 
OverDrive processor. The designer should only as- 
sume that the component 
identification for the 
OverDrive processor will be 043xH, where 'x' is the 
stepping identifier. 


2.2 Testability 


As detailed in Section 2.1, the Intel OverDrive proc- 
essor does not support the JTAG boundary scan 
testability feature. 


2.3 
Instruction Set Summary 


The Intel OverDrive processor supports all Intel486 
extensions to the 8086/80186/80286 
instruction 
set. In general, instructions will execute faster on the 
Intel OverDrive processor than the Intel486 Micro- 
processor. Specifically, an instruction that only uses 
memory from the on-chip cache executes at the full 
core clock rate while all bus accesses execute at the 
bus clock rate. To calculate the elapsed time of an 
instruction, the number of clock counts for that in- 
struction must be multiplied by the clock period for 
the system. The instruction set clock count summary 
tables from the Intel486 SX and Intel486 DX Micro- 
processor 
Data 
Sheets 
can 
be 
used 
for 
the 
OverDrive processor with the following modifica- 
tions: 
- 
Clock counts for a cache hit: This value repre- 
sents the number of internal CPU core clocks for 
an instruction that requires no external bus ac- 
cesses or the base core clocks for an instruction 
requiring external bus accesses. 
- 
Penalty clock counts for a cache miss: This value 
represents the worst-case approximation of the 
additional number of external clock counts that 
are required for an instruction which must access 
the external bus for data (a cache miss). This 
number must be multiplied by 2 to convert it to 
an equal number of internal CPU core clock 
counts and added to the base core clocks to 
compute the total number of core clocks for this 
instruction. 


The actual number of core clocks for an instruction 
with a cache miss may be less than the base clock 
counts (from the cache hit column) plus the penalty 
clock counts (2 times the cache miss column num- 
ber). The clock counts in the cache miss penalty 
column can be a cumulative value of external bus 
clocks (for data reads) and internal clocks for manip- 
ulating the data which has been loaded from the ex- 
ternal bus. The number of clocks which are related 
to external bus accesses are correctly represented 
in terms of internal core clocks by multiplying by two. 
However, the clock counts related to internal data 
manipulation should not be multiplied by two. There- 
fore the total number of CPU core clock counts for 
an instruction with a cache miss represents a worst- 
case approximation. 


To calculate the execution time for an OverDrive 
processor instruction, multiply the total CPU core 
clock counts by the core clock period. For example, 
in a 25 MHz system the core clock period is 20 ns 
(1/50 MHz). 
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Additionally, 
the 
assumptions 
specified 
below 
should be understood in order to estimate instruc- 
tion execution time. 


A cache miss will force the OverDrive processor to 
run an external bus cycle. The Intel486 microproces- 
sor 32-bit burst bus is defined as r-b-w. 


Where: 
r = 
The number of bus clocks in the first cycle of a 
burst read or the number of clocks per data 
cycle is a non-burst read. 
b = 
The number of bus clocks for the second and 
subsequent cycles in a burst read. 
w = The number of bus clocks for a write. 


The fastest bus the OverDrive processor can sup- 
port is 2 -1 - 2 assuming 0 waits states. The clock 
counts in the cache miss penalty column assume a 
2- 1- 2 bus. For slower busses add r- 2 clocks to 
the cache miss penalty for the first dword accessed. 
Other factors also affect instruction clock counts. 


Instruction 
Clock Count Assumptions 
1. The external bus is available for reads or writes at 
all times. Else add bus clocks to reads until the 
bus is available 
2. Accesses are aligned. Add three core clocks to 
each misaligned access. 
3. Cache fills complete before subsequent accesses 
to the same line. If a read misses the cache dur- 
ing a cache fill due to a previous read or prefetch, 
the read must wait for the cache fill to complete. If 
a read or write accesses a cache line still being 
filled, it must wait for the fill to complete. 
4. If an effective address is calculated, the base reg- 
ister is not the destination register of the preced- 
ing instruction. If the base register is the destina- 
tion register of the preceding instruction add 1 to 
the core clock counts shown. Back-to-back PUSH 
and POP instructions are not affected by this rule. 


5. An effective address calculation uses one base 
register and does not use an index register. 
However, if the effective address calculation 
uses an index register. 1 core clock may be add- 
ed to the clock shown. 
6. The target of a jump is in the cache. If not, add r 
clocks for accessing the destination instruction 
of a jump. If the destination instruction is not 
completely contained in the first dword read, add 
a maximum of 3b bus clocks. If the destination 
instruction is not completely contained in the first 
16 byte burst, add a maximum of another r+ 3b 
bus clocks. 
7. If no write buffer delay, w bus clocks are added 
only in the case in which all write buffers are full. 


8. Displacement and immediate not used together. 


If displacement and immediate used together, 1 
core clock may be added to the core clock count 
shown. 
9. No invalidate cycles. Add a delay of 1 bus clock 
for each invalidate cycle if the invalidate cycle 
contends for the internal cache/external 
bus 
when the OverDrive processor needs to use it. 


10. Page translation hits in TLB. A TLB miss will add 
13,21 or 28 bus clocks + 1 possible core clock 
to the instruction depending on whether the Ac- 
cessed and/or Dirty bit in neither, one or both of 
the page entries needs to be set in memory. This 
assumes that neither page entry is in the data 
cache and a page fault does not occur on the 
address translation. 
11. No exceptions are detected during instruction 
execution. Refer to interrupt core Clock Counts 
Table for extra clocks if an interrupt is detected. 


12. Instructions that read multiple consecutive data 
items (i.e., task switch, POPA, etc.) and miss the 
cache are assumed to start the first access on a 
16-byte boundary. If not, an extra cache line fill 
may be necessary which may add up to (r+3b) 
bus clocks to the cache miss penalty. 
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3.0 
IntelOverDrive™ 
PROCESSOR 
CIRCUIT DESIGN 


Figures 3.1, 3.2 and 3.3 show the circuits which in- 
terface the original CPU with the Intel OverDrive 
processor socket. These circuits allow Intel486 OX 
and Intel486 SX CPU based systems to be upgraded 
with the Intel OverDrive processor. 


The circuits shown in Figures 3.1 and 3.2 may only 
be used for Intel486 CPUsthat do not have the UP# 
input pin. The circuit shown in Figure 3.3 should be 
used for all Intel486 CPUs that have the UP# input 
pin. 


3.1 
Upgrade Circuit for PGA 
Intel486TMOX Based Systems 


The Intel OverDrive processor socket Circuit for 
PGA Intel486 OX CPU based systems allows the 
Intel486 OX CPU complete control of the system 
when the Intel OverDrive processor socket is unpop- 
ulated. The HLDA signal from the Intel OverDrive 
processor socket should be tied low through a resis- 
tor while the UP# and FERR# signals from the Intel 
OverDrive processor socket should be tied high 
through a resistor to insure that the Intel486 OXCPU 
functions correctly when an Intel OverDrive proces- 
sor socket component is not installed. 


When the Intel OverDrive rocessor is installed, the 
Upgrade Present output, UP# 
pin, causes the 
FLUSH" and BOFF# signals to be driven active to 
the Intel486 OX CPU. When the Intel486 OX CPU 
samples FLUSH# active during reset, the Intel486 
OXCPU enters tri-state output test mode after reset, 
which causes the Intel486 OX CPU to float all of its 
output signals. To float most of the Intel486 OX 
CPU's output pins before the end of reset, BOFF# 
is also driven active to the Intel486 OXCPU. BOFF# 
immediately causes all output signals to float except 
PCHK#, BREQ, HLDA and FERR#. 


In addition to floating the PGA Intel486 OX CPU's 
outputs, the Intel486 OX CPU's HLDA and FERR# 
signals must be gated to prevent potential bus con- 
tention with the Intel OverDrive processor's HLDA 
and FERR# signals during reset. During reset the 
Intel486 OX CPU may not recognize HOLD active 
because BOFF# is driven active to the Intel486 OX 
CPU by the Intel OverDrive processor. If the Intel486 
OX CPU does not recognize HOLD active, it will not 
drive HLDA active. However, the Intel OverDrive 
processor will recognize HOLD active and drive 
HLDA. By gating the HLDA signals from the Intel486 
OX CPU and Intel OverDrive processor socket, bus 
contention is avoided if HOLD is driven active during 
reset. Because the state of FERR# 
is undefined 
during reset, bus contention is also avoided by gat- 
ing FERR#. 
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Figure 3.2. Intel OverDrive 
Socket 
Circuit Diagram for PGA Intel486 
SX CPU Based Systems 


Upgrade Circuit for PGA Intel486 
the PGA Intel486 SX CPU enters tri-state output test 
SX CPU Based Systems 
mode after reset which causes the PGA Intel486 SX 
CPU to float all of its output signals. To float most of 
the PGA Intel486 SX CPU's output pins before the 
end of reset, BOFF# is also driven active to the 
PGA Intel486 SX CPU. BOFF# immediately causes 
all output signals to float except PCHK#, BREQ, 
and HLDA. 


The Intel OverDrive processor socket Circuit for 
PGA Intel486 SX CPU based systems is the same 
as the Intel OverDrive processor socket Circuit for 
the PGA Intel486 OXCPU based system except that 
no gates are needed for the FERR# signal. The 
OverDrive processor socket Circuit allows the PGA 
Intel486 SX CPU complete control of the system 
when the Intel OverDrive processor socket is unpop- 
ulated. The HLDA signal from the Intel OverDrive 
processor socket should be tied low through a resis- 
tor while the UP# and FERR# signals from the Intel 
OverDrive processor socket should be tied high 
through a resistor to insure that the PGA Intel486 SX 
CPU functions correctly when an Intel OverDrive 
processor is not installed. 


When the Intel OverDrive processor is installed, the 
Upgrade Present output, UP# 
pin, causes the 
FLUSH# and BOFF# signals to be driven active to 
the PGA Intel486 SX CPU. When the PGA Intel486 
SX CPU samples FLUSH# 
active during reset, 


In addition to floating the PGA Intel486 SX CPU's 
outputs, the PGA Intel486 SX CPU's HLDA signal 
must be gated to prevent potential bus contention 
with the Intel OverDrive processor's HLDA signal 
during reset. During reset, the PGA Intel486 SX CPU 
may not recognize HOLD active because BOFF# is 
driven active to the PGA Intel486 SX CPU by the 
Intel OverDrive processor. If the PGA Intel486 SX 
CPU does not recognize HOLD active, it will not 
drive HLDA active. However, the Intel OverDrive 
processor will recognize HOLD active and drive 
HLDA. By gating the HLDA signals from the PGA 
Intel486 SX CPU and Intel OverDrive processor, bus 
contention is avoided if HOLD is driven active during 
reset. 
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Figure 3.3. Intel OverDrlve™ 
Socket 
Circuit Diagram for Systems 
Based on Intel486 
CPUs That Have the UP# 
Input Pin 


3.3 
Upgrade Circuit for Intel486 CPU 
Based Systems with UP # 


allows the Intel486 CPU to directly recognize when 
the Intel OverDrive processor socket is populated. 
When the UP# pin is driven active to the Intel486 
CPU, the Intel486 CPU tri-states all of its output pins 
and enters power-down mode. 


The Intel OverDrive processor socket circuit for 
UP# Intel486 CPU based systems requires no addi- 
tional gates. The Upgrade Present input, UP# pin, 


The following should be considered when designing 
a system for upgrade with an Intel OverDrive proces- 
sor. 


4.1 
Intel OverDrive Processor 
Detection 


The 
component 
identifier 
and 
stepping/revision 
identifier for the Intel OverDrive processor is read- 
able in DH and DL registers respectively, immediate- 
ly after RESET, where 


DH = 04h 


DL = 30h-3Fh. 


As it is difficult to differentiate between the Intel486 
OX CPU and the Intel OverDrive processor in soft- 
ware, it is recommended that the BIOS save the 
contents of the OX register, immediately after RE- 
SET, so that this information can be used later, if 
required, to identify an Intel OverDrive processor in 
the system. 


NOTE: 
Initialization routines for Intel486 SX CPU systems 
should check for the presence of a floating point 
unit and set the CRD register accordingly. (Refer to 
the Intel486 SX Microprocessor Data Book for spe- 
cific details.) 


The Intel OverDrive processor executes instructions 
at twice the frequency of the input clock. Thus, soft- 
ware (or instruction based) timing loops will execute 
faster on the Intel OverDrive processor than on the 
Intel486 OX or Intel486 SX CPU (at the same input 
clock frequency). Instructions such as NOP, LOOP, 
and JMP $+ 2, have been used by BIOS to imple- 
ment timing loops that are required, for example, to 
enforce recovery time between consecutive access- 
es for I/O devices. These instruction based timing 
loop implementations may require modification for 
systems intended to be upgradable with the Intel 
OverDrive processor. 


In order to avoid any incompatibilities, it is recom· 
mended that timing requirements be implemented in 
hardware rather than in software. This provides 
transparency and also does not require any change 
in BIOS or I/O device drivers in the future when 
moving to higher processor clock speeds. As an ex- 3 
ample, a timing routine may be implemented as fol· 
lows: The software performs a dummy I/O instruc- 
tion to an unused I/O port. The hardware for the bus 
controller logic recognizes this I/O instruction and 
delays the termination of the I/O cycle to the CPU 
by keeping ROYII or BRDYII deasserted for the ap- 
propriate amount of time. 


The following sections describe recommended elec- 
trical connections for the Intel OverDrive processor, 
and its electrical specifications. 


Power and ground connections must be made to all 
external Vcc and GND pins of the Intel OverDrive 
processor. On the circuit board, all Vcc pins must be 
connected on a Vcc plane. All Vss pins must be 
likewise connected on a GND plane. 


5.1.2 
POWER 
DECOUPLING 
RECOMMENDATIONS 


Liberal decoupling capacitance should be placed 
near 
the 
Intel 
OverDrive 
processor. 
The 
Intel 
OverDrive processor driving its 32-bit parallel ad- 
dress and data busses at high frequencies can 
cause transient power surges, particularly when driv- 
ing large capacitive loads. 


Low inductance capacitors and interconnects are 
recommended for best high frequency electrical per- 
formance. Inductance can be reduced by shortening 
circuit board traces between the Intel OverDrive 
processor and decoupling capacitors as much as 
possible. Capacitors specifically for PGA packages 
are also commercially available. 


5.1.3 
OTHER 
CONNECTION 
RECOMMENDATIONS 


For reliable operation, always connect unused in- 
puts to an appropriate signal level. Active LOW in- 
puts should be connected to Vcc through a pullup 
resistor. Pullups in the range of 20 K.l1 are recom- 
mended. Active HIGH inputs should be connected to . 
GND. 


Table 5.1 is a stress rating only, and functional oper- 
ation at the maximums is not guaranteed. Function 
operating conditions are given in Section 5.3 D.C. 
Specifications and Section 5.4 A.C. Specifications. 


Extended exposure to the Maximum Ratings may af- 
fect device reliability. Furthermore, although the Intel 
OverDrive processor contains protective circuitry to 
resist damage from static electric discharge, always 
take precautions to avoid high static voltages or 
electric fields. 


Table 5.1. Absolute 
Maximum 
Ratings 
Case Temperature under Bias 
- 65·C to + 11O·C 


Storage Temperature 
- 65·C to + 150·C 


Voltage on Any Pin with 
Respect to Ground 
-0.5 to Vcc + 0.5V 


Supply Voltage with 
Respect to Vss 
- 0.5V to + 6.5V 
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5.3 
D.C. Specifications 
Functional 
Operating 
Range: VCC = 5V ±5%; 
(Note 
1) 


Table 5-2. DC Parametric 
Values 


Symbol 
Parameter 
Mln 
Max 
Unit 
Notes 


Vil 
Input Low Voltage 
-0.3 
+0.8 
V 


VIH 
Input High Voltage 
2.0 
V 


Val 
Output Low Voltage 
V 
(Note 2) 


VOH 
Output High Voltage 
2.4 
V 
(Note 3) 


Icc 
Power Supply Curre'nt 
CLK = 33 MHz 
CLK = 25 MHz 
mA 
(Note 4) 
CLK = 20 MHz 
CLK = 16 MHz 


III 
Input Leakage Current 
~ 
IJ-A 
(Note 5) 


IIH 
Input Leakage Current 
IJ-A 
(Note 6) • 


III 
Input Leakage Current 
IJ-A 
(Note 7) 


ILO 
Output Leakage 
Cur 
±15 
IJ-A 


CIN 
Input Capacitance 
13 
pF 
Fc = 1 MHz (Note 8) 


Co 
I/O or Output Capacitance 
17 
pF 
Fc = 1 MHz (Note 8) 


CClK 
CLK Capacitance 
15 
pF 
Fc = 1 MHz (Note 8) 


NOTES: 
1. The function operating temperature range is: 


OverDrive processor-20 
MHz, Tcase = O·Cto +95·C 
OverDrive processor-25 
MHz, Tsink = O·Cto +85·C 
OverDrive processor-33 
MHz, Tsink = O·Cto + 85·C 
2. This parameter is measured at: 


Address, Data, BEn 
4.0 mA 
Definition, Control 
5.0 mA 
3. This parameter is measured at: 
Address, Data, BEn 
-1.0 
mA 
Definition, Control 
-0.9 
mA 
4. Typical supply current: 


525 mA @ ClK = 16 MHz 
625 mA @ ClK = 20 MHz 
775 mA @ ClK 
= 25 MHz 
975 mA @ ClK = 33 MHz 
5. This parameter is for inputs without internal pullups or pulldowns and 0 S; VIN S; Vcc. 
6. This parameter is for inputs with internal pulldowns and VIH = 2.4V. 
7. This parameter is for inputs with internal pullups and VIL = 0.45V. 
8. Not 100% tested. 


5.4 
A.C. Specifications 


The AC. 
specifications, 
given in Tables 
5.3.1, 5.3.2, 
5.3.3, and 5.3.4, consist of output delays, input setup 
requirements 
and input 
hold requirements. 
All AC. 
specifications 
are relative 
to the rising edge 
of the 
CLK signal. 


A.C. specifications 
measurement 
is defined 
by Fig- 
ures 5.1-5.6. 
All timings 
are referenced 
to 1.5V un- 
less otherwise 
specified. 
Inputs 
must 
be driven 
to 


the voltage 
levels indicated 
by Figure 5.3 when AC. 
specifications 
are measured. 
Intel OverDrive 
proces- 


sor output 
delays 
are specified 
with 
minimum 
and 
maximum 
limits, measured 
as shown. 
The minimum 
Intel OverDrive 
processor 
delay times are hold times 
provided to external 
circuitry. 
Intel OverDrive 
proces- 
sor input setup and hold times are specified 
as mini- 
mums, 
defining 
the 
smallest 
acceptable 
sampling 
window. 
Within the sampling 
window, 
a synchronous 
input 
signal 
must 
be 
stable 
for 
correct 
Intel 
OverDrive 
processor 
operation. 
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VCC = 5V ±5%; 
Tsink = O°C to +85°C; 
CI = 50 pF unless otherwise 
specified 
(Note 2) 


Symbol 
Parameter 
Mln 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
33 
MHz 
1X Clock Driven to 
OverDrive 
processor 


t1 
CLK Period 
30 
125 
ns 
5.1 


t1a 
CLK Period Stability 
0.1% 
A 
Adjacent 
Clocks 


t2 
CLK High Time 
11 
ns 
5.1 
at2V 


t3 
CLKLowTime 
11 
ns 
5.1 
atO.8V 
4 
CLK Fall Time 
3 
ns 
5. Lej.,. to 0.8V 


t5 
CLK Rise Time 
.-3. 
ns 
"'~#.••••0.8V to 2V 


te 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 


~ 
•• 14 
~~~~ 


(Note 3) 
D/C#, 
W/R#, 
ADS#, 
LOCK#, 
SMIACT#, 
FERR#, 
BREQ, HLDA Valid Delay 


t7 
A2-A01 , PWT, PCD, 8EO-3', 
MI10'~il'o'" 
~ 
:;:~ 
5.6 
(Note 1) 
D/C#, 
W/R#, 
ADS#, 
LOCK# 
Float D 
~ 


ta 
PCHK # Valid Delay 
3 ~ ~..if2~ 
ti 
5.4 
(Note 3) 


t8a 
BLAST #, PLOCK # Valid DelaYl\..~~ .••v 
~ 


,)Jt li~s 
5.5 
(Note 3) 
..- 


t9 
BLASH, 
PLOCK# 
Float 9..-fllN~~ 
~~ 


~. 
ns 
5.6 
(Note 1) 


t10 
DO D31, DPO 3 Write 
Id De~ 
••~ 
~'~8 
ns 
5.5 
(Note 3) 


t11 
DO D31, DPO 3 W' 
Float Delay9 .L"~ 
20 
ns 
5.6 
(Note 1) 


t12 
EADS # Setup Tim~"". 
"':V ~ 
"'C 
5 
ns 
5.2 


t13 
EADS # Hold TIme 
/~...'~'" 
;.v 
3 
ns 
5.2 


t14 
KEN#, 
BS16#, 
BS8#c_~t~ 
5 
ns 
5.2 


t15 
KEN#,BS16#,BS8~ld~~c 
3 
ns 
5.2 


t16 
RDY #, BRDY.~...iJP 
Time 


T 


5 
5.3 
ns 


t17 
RDY #, BR!Ii 
.• }ro"ld Time 
3 
ns 
5.3 


t18 
HOLD, ~1r 
Setup Time 
6 
ns 
5.2 


t18a 
BOFF # Setup Time 
7 
ns 
5.2 


t19 
HOLD, AHOLD, 
BOFF # Hold Time 
3 
ns 
5.2 


t20 
RESET, FLUSH#, 
A20M#, 
NMI, INTR, 
5 
ns 
5.2 
SMI#, 
STPCLK#, 
SRESET,IGNNE# 
Setup Time 


t21 
RESET, FLUSH#, 
A20M#, 
NMI, INTR, 
3 
ns 
5.2 
SMI#, 
STPCLK#, 
SRESET, 
IGNNE# 
Hold Time 


t22 
DO-D31, 
DPO-3, 
A4-A31 
Read Setup Time 
5 
ns 
5.2,5.3 


t23 
DO-D31, 
DPO-3, 
A4-A31 
Read Hold Time 
3 
ns 
5.2,5.3 


NOTES: 
1. Not 100% tested. Guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. Charts 5.7.1-5.7.3 provides the charts that may be used to determine the 
delay due to derating, depending on the lumped capacitive loading, that must be added to these specification values. 
3. The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry. 
4. A reset pulse width of 15 ClK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and ClK are stable. 


Table 
5.3.2. 25 MHz Intel OverDrive 
Processor 
A.C. Characteristics 


Vcc = 5V ± 5%; Tsink = O·C to + 85·C; CI = 50 pF unless otherwise 
specified 
(Note 2) 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
25 
MHz 
1X Clock Driven to 
OverDrive 
Processor 


t1 
CLK Period 
40 
125 
ns 
5.1 


t1a 
CLK Period Stability 
0.1% 
f,. 
Adjacent 
Clocks 


t2 
CLK High Time 
14 
ns 
5.1 
at2V 


t3 
CLKLowTime 
14 
ns 
5.1 
atO.8V 


~ 
CLKFaliTime 
4 
ns 
5.1 
'" ""toO.8V 


t5 
CLK Rise Time 
4 
ns 
~.1"S.,; -o.8Vto2V 


~ 
A2-A31, 
PWT, PCD, BEO-3#>, MIlO#>, 
3 
9 
ns 


~ 


(Note 3) 
D/C#>, 
W/R#>, 
ADS#>, LOCK#, 
,..' 


~ 


FERR#>, BREQ, HLDA, SMIACT#>, 
--~~ 
Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3#>, M/JO#>,4'i~~.~ 
~~ ~~< 
5.6 
(Note 1) 
D/C#>, 
W/R#>, 
ADS#>, LOCK#> Floa 
.~ 
<r 
~~~ '. 
Delay 


te 
PCHK #>Valid Delay 
..••.. 
',oN' 


~' 
l4~ ':01 
5.4 
(Note 3) 


tea 
BLAST #>,PLOCK #>Valid ~~ 
y 
"41) 
_24 
ns 
5.5 
(Note 3) 


t9 
BLAST #>,PLOCK #>Flo ~ 
/ .. 
~ ,~ 
ns 
5.6 
(Note 1) 


t10 
00-031, 
DPO-3 wr!tl IIf! 
Valid Delay 
,tJ ~~20 
ns 
5.5 
(Note 3) 


t11 
00-031, 
DPO-3Wile 
ata~~la~ 
ET.'"' 
28 
ns 
5.6 
(Note 1) 


t12 
EADS#> Setu~time'" 
~~'f'~"Aof.::' 
~""8 
ns 
5.2 


t13 
EADS#> Hold Time 
_',7') ~~ 
3 
ns 
5.2 


t14 
KEN#>,BS16#>,BS~~~~~~' 
8 
ns 
5.2 


t15 
KEN #>,BS 16 #>..IeS8#>~HoldT~e 
3 
ns 
5.2 


t16 
ROY #>,Be ~'&.Qbtup 
Time 
8 
ns 
5.3 


t17 
ROY#>, DIO 'W'tt' Hold Time 
3 
ns 
5.3 


t1e 
HOLD, A~ 
LD, BOFF #>Setup Time 
8 
ns 
5.2 


t19 
HOLD, AHOLD, 
BOFF #>Hold Time 
3 
ns 
5.2 


t20 
RESET, FLUSH #>,A20M #>,NMI, SMI #>, 
8 
ns 
5.2 
STPCLK #>,SRESET, 
INTR, IGNNE #> 


Setup Time 


t21 
RESET, FLUSH #>,A20M #>,NMI, SMI #>, 
3 
ns 
5.2 
STPCLK#>, SRESET,INTR,IGNNE#> 
Hold Time 


t22 
00-031, 
DPO-3, 
A4-A31 
Read 
5 
ns 
5.2,5.3 
Setup Time 


t23 
00-031, 
DPO-3, 
A4-A31 
Read 
3 
ns 
5.2,5.3 
Hold Time 


NOTES: 
1. Not 1000/0tested. Guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. Charts 5.7.1-5.7.3 provides the charts that may be used to determine the 
delay due to derating, depending on the lumped capacitive loading, that must be added to these specification values. 
3, The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry, 
4. A reset pulse width of 15 ClK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and ClK are stable, 
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Table 
5.3.3. 20 MHz Intel OverDrive 
Processor 
A.C. Characteristics 


Vcc = 5V ±5%; 
Tease = O°C to + 95°C; CI = 50 pF unless 
otherwise 
specified 
(Note 2) 


Symbol 
Parameter 
Min 
Max 
Unit 
Figure 
Notes 


Frequency 
8 
20 
MHz 
1X Clock Driven to 
OverDrive 
Processor 


t1 
ClK 
Period 
50 
125 
ns 
5.1 


t1a 
ClK 
Period Stability 
0.1% 
tJ. 
Adjacent 
Clocks 


t2 
ClK 
High Time 
16 
ns 
5.1 
at2V 


t3 
ClK 
low Time 
16 
ns 
5.1 
at 0.8V 


t4 
ClKFallTime 
6 
ns 
5.1 
~VtoO.8V 


t5 
ClK 
Rise Time 
6 
ns 
5.~" 
~8Vt02V 


te 
A2-A31, 
PWT, PCD, BEO-3#, 
M/IO#, 
3 
23 
ns 
b···.••••(Note 3) 


D/C#, 
W/R#, 
ADS#, 
lOCK#, 
i;;~~.J:"'~ 
FERR#, 
BREQ, HlDA, 
SMIACT# 


!of 
Valid Delay 


t7 
A2-A31, 
PWT, PCD, BEO-3'*', 
M/IO#, 
~ 
... ~. 
37 l"~ 
5.6 
(Note 1) 


D/C#, 
W/R#, 
ADS#, 
lOCK# 
~ 
rt.~~ 
Float Delay 
~.' ~~ 
.." 


.•.. ~. 
~ 


t8 
PCHK# 
Valid Delay 
\~ 
3 
y. I(!'"cL1i" 
5.4 
(Note 3) 


taa 
BLAST #, PLOCK # Valid Dela~ 
;- 
~. 


~~. 
f"Lns 
5.5 
(Note 3) 


t9 
BLAST #, PLOCK # Flo 


,.~~ 
~'''.1 
5.6 
(Note 1) 
At. 
-," 
ns 


tlO 
DO D31, DPO 3 Wri 
'"alid 
De" 
••. ~'f6 
ns 
5.5 
(Note 3) 


t11 
DO-D31, 
DPO-3~ata 
FIQ&t Delay 
I,~ 
37 
ns 
5.6 
(Note 1) 


t12 
EADS# 
SetupTim90 
~"":" ,,.. 
.\l 
~O 
ns 
5.2 


t13 
EADS # Hold Time 
•• 
'T~ 
""' 
41f;"""':f, 
3 
ns 
5.2 


t14 
KEN#, 
BS16#, 
BSIi'~p 
.l~"" 
10 
ns 
5.2 


t15 
KEN#, 
BS16#, 
BsaMold 
"r1O!e 
3 
ns 
5.2 


t16 
RDY #, BRDX{ 
~up 
Time 
10 
ns 
5.3 


t17 
RDY #, E!~YIl-lold 
Time 
3 
ns 
5.3 


t18 
HOLD, Af:I~D, 
Setup Time 
12 
ns 
5.2 


t19 
HOLD, AHOlD, 
BOFF # Hold Time 
3 
ns 
5.2 


t20 
RESET, FlUSH#, 
A20M#, 
NMI, SMI#, 
12 
ns 
5.2 
(Note 4) 


STPClK#, 
SRESET,INTR, 
IGNNE# 
Setup Time 


t21 
RESET, FlUSH#, 
A20M#, 
NMI, SMI#, 
3 
ns 
5.2 
(Note 4) 


STPClK#, 
SRESET, 
INTR,IGNNE# 
Hold Time 


t22 
DO-D31, 
DPO-3, 
A4-A31 
Read 
6 
ns 
5.2,5.3 
Setup Time 


t23 
DO-D31, 
DPO-3, 
A4-A31 
Read 
3 
ns 
5.2,5.3 
Hold Time 


NOTES: 
1. Not 100% tested. Guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. Charts 5.7.1-5.7.3 provides the charts that may be used to determine the 
delay due to derating, depending on the lumped capacitive loading, that must be added to these specification values. 
3. The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry. 
4. A reset pulse width of 15 ClK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and ClK are stable. 


intel~ 


Table 5.3.4.16MHzIntel OverDrive Processor A.C.Characteristics 


VCC = 5V ± 5%; Tcase = O°C to + 95°C; C, = 50 pF unless otherwise 
specified 


Symbol 
Parameter 


Frequency 


Min 
Max 
Unit 
Figure 
Notes 


8 
16 
MHz 
1X Clock Driven to 
OverDrive 
Processor 


ClK 
Period 
62.5 
125 
ns 
5.1 


ClK 
Period Stability 
0.1 % 
f:1 
Adjacent 
Clocks 


ClK 
High Time 
20 
ns 
5.1 
at 2V 


ClK 
low 
Time 
20 
ns 
5.1 
at 0.8V 


ClK 
Fall Time 
8 
ns 
5.1 
?,V to 0.8V 


ClK 
Rise Time 
8 
ns 
5. ~~ 
~8V 
to 2V 


A2-A31, 
PWT, PCO, BEO-3#, 
M/IO#, 
3 
2&1- 
nsr,~ 
(Note 3) 
O/C#, 
W/R#, 
AOS#, 
lOCK#, 
_, 


FERR#, 
BREQ, HlOA, 
SMIACH 
.••••~ 
t 


Valid Delay 
4", •• 
,.••~ 


A2-A31, 
PWT, PCO, BEO-3#, 
M/lO#,~~~ 
42~~5'6 
O/C#,W/R#,AOS#,lOCK# 
~ 
. 
~ 
. 


Float Delay 
6:"~ 
6" .~ 


PCHK# 
Valid Delay 
~~' 
3 
~ 
5.4 


BLASH, 
PlOCK# 
Valid oela~ 
_Q 
'JI'";,.ns 
5.5 


BLAST#, 
PlOCK# 
Flo 
., 
~ 
A~» T 
~ 
~ 
ns 
5.6 


DO 031,OPO 
3 Wri 
alid OeV 
~~O 
ns 
5.5 


00-031, 
OPO-3 ~8ta 
FI()A! O~lay 
~ 
.. 42 
ns 
5.6 


EAOS# 
SetuRifim~ 
.67~~2 
ns 
5.2 


EAOS # Hold Time 
~~;~ •. -iTh~ 
4 
ns 
5.2 


KEN#, 
BS16#, 
BS 
':"\i. p""'> 
12 
ns 
5.2 


KEN#, 
BS16#,.BS~old~~ 
4 
ns 
5.2 


ROY #, BROYC~_up 
Time 
12 
ns 
5.3 


ROY #, B.•.••• 
'i(';LI' 
old Time 
4 
ns 
5.3 


HOLD, ~b, 
BOFF # Setup Time 
12 
ns 
5.2 


HOLD, AHOlO, 
BOFF # Hold Time 
4 
ns 
5.2 


RESET,FlUSH#,A20M#,NMI,SMI#, 
14 
ns 
5.2 
STPClK#, 
SRESET, 
INTR,IGNNE# 
Setup Time 


RESET, FlUSH#, 
A20M#, 
NMI, SMI#, 
STPClK#, 
SRESET, 
INTR, IGNNE# 
Hold Time 


(Note 3) 


(Note 3) 


(Note 1) 


(Note 3) 


(Note 1) 


NOTES: 
1. Not 100% tested. Guaranteed by design characterization. 
2. All timing specifications assume CL = 50 pF. Charts 5.7.1-5.7.3 provides the charts that may be used to determine the 
delay due to derating, depending on the lumped capacitive loading, that must be added to these specification values. 
3. The minimum Intel OverDrive processor output valid delays are hold times provided to external circuitry. 
4. A reset pulse width of 15 ClK cycles is required for warm resets. Power-up resets require RESET to be asserted for at 
least 1 ms after Vcc and ClK are stable. 


00-031, 
OPO-3, A4-A31 
Read 
Setup Time 


00-031, 
OPO-3, A4-A31 
Read 
Hold Time 


• 


BS8., BS16., 
[ 
KEN. 


SOFF. (180), 
AHOlO, [ 


HOLD 


RESET, FLUSH., 
[ 
A201l., 
IGNNE., 
INTR, 
NIII 


ClK [ 


ROY., BROY. [ 


00-031 [ 
OPO-OP3 


EAOS. 
[ 


A4-A31 
[ 
(READ) 


ClK 
[ 


00-031 
[ 
OPO-OP3 


PCHK# 
[ 


ClK 
[ 


A2-A31, 
PWT, PCO, 
BEO-3#, M!IO#, 
O!C#,W!R#, 
AOS#, 
[ 
lOCK#, fERR#, BREQ, 
HlOA 


00-031, 
OPO-3, 
[ 
(WRITE) 


ClK 
[ 


A2-A31, 
PWT, pce, 
BEO-3#, M!IO#, 
o!C#,W!R#, 
AOS#, 
[ 
lOCK#, fERR#, BREQ, 
HlOA 


00-031, 
OPO-3, 
[ 
(WRITE) 


• 


5.7.1 
TYPICAL 
OUTPUT 
VALID 
DELAY 
VERSUS 
LOAD CAPACITANCE 
UNDER 
WORST 
CASE 
CONDITIONS 
FOR THE Intel OverDrive 
PROCESSOR 


50 
75 
100 
125 
ct. (picofarad.) 


NOTES: 
This graph 
will not be linear outside 
of the CL range shown. 


nom = nominal 
value 
given in A.C. Characteristics 
table. 


5.7.28 
TYPICAL 
LOADING 
DELAY 
VERSUS 
CAPACITIVE 
LOADING 
UNDER 
WORST-CASE 
CON· 
DITIONS 
FOR A HIGH TO LOW TRANSITION 
ON THE Intel OverDrive 
PROCESSOR 


Loading Delay 
2 
(n.) 


infel~ 


5.7.2b 
TYPICAL 
LOADING 
DELAY 
VERSUS 
CAPACITIVE 
LOADING 
UNDER 
WORST-CASE 
CON- 
DITIONS 
FOR A LOW TO HIGH TRANSITION 
ON THE Intel OverDrive 
PROCESSOR 


5.7.3 
TYPICAL 
OUTPUT 
RISE TIME 
VERSUS 
LOAD CAPACITANCE 
UNDER 
WORST-CASE 
CONDITIONS 


> 
6 
o 
NI 
5 
> 
coo 
• 


5 
75 
100 
125 
150 


C1. (picofarads) 


NOTE: 
This graph 
will not be linear outside 
of the CL range 
shown. 
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0, 
• 
s, 


1111.65 
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PIN C3 
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@@@ 
@ 
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@@@ 
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@@ 
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.l:1!REf 
1.52 
• 


45° 
CHAtolfER 
(INDEX CORNER) 


SWAGGED 
PIN 
(4 PL) 


SEATING 
PLANE 


lllB (ALL PINS) 
Lt 


SWAGGED 


PIN 
DETAIL 


Family:Ceramic Pin Grid Array Package 


Symbol 
Millimeter. 
Inche. 


Mln 
Max 
Note. 
Mln 
Max 
Note. 


A 
3.56 
4.57 
0.140 
0.180 


Al 
0.64 
1.14 
SOLID LID 
0.025 
0.045 
SOLID LID 


A2 
2.8 
3.5 
SOLID LID 
0.110 
0.140 
SOLID LID 


A3 
1.14 
1.40 
0.045 
0.055 


B 
0.43 
0.51 
0.017 
0.020 


D 
44.07 
44.83 
1.735 
1.765 


Dl 
40.51 
40.77 
1.595 
1.605 
- I 
81 
2.29 
2.79 
~.O90 
0.110 


L 
2.54 
3.30 
0.100 
0.130 


N 
168.169 
168.169 


SI 
1.52 
2.54 
0.060 I 0.100 


ISSUE 
IWS 
REVX 
7/15/88 


intel~ 


Letter or 
Description 
of Dimensions 
Symbol 


A 
Distance 
from seating plane to highest point of body 


A1 
Distance 
between 
seating plane and base plane (lid) 


j 


A2 
Distance 
from base plane to highest point of body 


A3 
Distance 
from seating plane to bottom of body 


B 
Diameter 
of terminal 
lead pin 


0 
Largest overall package 
dimension 
of length 


01 
A body length dimension, 
outer lead center to outer lead center 


e1 
Linear spacing 
between 
true lead position 
centerlines 


L 
Distance 
from seating plane to end of lead 


51 
Other body dimension, 
outer lead center to edge of body 


NOTES: 
1. Controlling dimension: millimeter. 
2. Dimension "e1" ("e") is non-cumulative. 
3. Seating plane (standoff) is defined by P.C. board hole size: 0.0415-0.0430 inch. 
4. Dimensions "B", "B1" and "C" are nominal. 
5. Details of Pin 1 identifier are optional. 


~O.015:1:0.010 


SECTION 
A-A 


• 


7.1 
The Intel OverDrive™ 
Processor 
with Attached 
Heat Sink 
The heat generated by the Intel OverDrive proces- 
sor requires that heat dissipation be managed care- 
fully. The 33 MHz and 25 MHz Intel OverDrive proc- 
essors are supplied with a heat sink attached with 
adhesive to the package. 33 MHz and 25 MHz sys- 
tem designs must, therefore. provide space for the 
heat sink on the Intel OverDrive processor. 


The heat sink for the Intel OverDrive processor is 
adhesively attached to the standard PGA package. 
Figure 7.1 shows a drawing of the Intel OverDrive 
processor with the heat sink (see Table 7.1 for di- 
mensions). 


The maximum and minimum dimensions for the PGA 
package with heat sink are shown in Table 7.1. 


Int.1 
OVERDRIVElM 
PROCESSOR 
II 


OIlNI-DIRECTIONAL 
HEAT 
SINK 
0 
G 


/ADHESIVE 
II 


Int.1 
OVERDRIVElM 
PROCESSOR 
PCA 
PACKAGE 
TT 
r 
I 
I 
! 
I 


Table 
7.1. Intel OverDrive™ 
Processor 
PGA Package 
Dimensions 
with 
Heat Sink Attached 


Dimension 
(Inches) 
Mln 
Max 


A. 
Heat Sink Width 
1.520 
1.550 


B. PGA Package Width 
1.735 
1.765 


C. Heat Sink Edge Gap 
0.065 
0.155 


D. Heat Sink Height 
0.212 
0.260 


E. Adhesive 
Thickness 
0.008 
0.012 


F. Package 
Height from Stand- 
0.140 
0.180 
Offs 


G. Total Height from Package 
0.360 
0.452 
Stand-Offs 
to Top of Heat 
Sink 


The standard 
product 
markings 
and logo for the Intel 
OverDrive 
processor 
with the attached 
heat sink will 
be included 
on a 1 in2 plate located 
on the top, cen- 


ter of the heat sink. The heat sink is omni-directional 
which 
allows 
the 
air to flow 
from 
any direction 
to 
achieve 
adequate 
cooling. 
The 
thermal 
resistance 
values 
for 
the 
Intel 
OverDrive 
processor 
with 
at- 
tached 
heat sink are shown 
in Table 7.2. 


Table 
7.2. Thermal 
Resistance 
for the Intel 
OverDrlve™ 
Processor 
with 
Attached 
Heat Sink 


9JS = 
2.S·C/W 
0 


NOTE: 
'The thermal resistance from junction to ambient (6JN 
in 
static air is actually a linear function of power dissipation. 
The value shown in the table (14.0'C/W) 
represents the 
worst case expected value which is derived from a power 
dissipation of 2.9W. The maximum expected power dissipa- 
lion of 6W yields a 6JA value of 13.1'C/W. 


7.2 Intel OverDrive™ Processor 
without Heat Sink 


The 20 MHz Intel OverDrive 
processor, 
for 16 MHz 
and 20 MHz i486 SX CPU systems, 
is supplied 
with- 
out a heat sink. Table 
7.3 contains 
the thermal 
re- 
sistance 
values 
for 
the 
Intel 
OverDrive 
processor 
without 
heat sink. 


Table 
7.3. Thermal 
Resistance 
for the Intel 
OverDrive™ 
Processor 
without 
Heat Sink 


9JC = 
2.0·C/W 
0 
800 


10.0 


NOTE: 
'The thermal resistance from junction to ambient (6JN 
in 
static air is actually a linear function of power dissipation. 
The value shown in the table (19.O'C/W) 
represents the 
worst case expected value which is derived from a power 
dissipation of 2.9W. The maximum expected power dissipa- 
tion of 4W yields a 6JA value of 18.S'C/W. 


The methodology 
for calculating 
the heat dissipation 
for the 20 MHz Intel OverDrive 
processor 
(without 
a 
heat 
sink) 
and 
the 
25 
MHz 
or 
33 
MHz 
Intel 
OverDrive 
processor 
(with 
a heat 
sink) 
is identical 
• 
except 
that 
the 
reference' 
point 
for 
measuring 
the 
product 
temperature 
is different. 
The 20 MHz Intel 
OverDrive 
processor 
specifies 
Tcase (the tempera- 
ture at the outside 
center 
of the PGA package, 
op- 
posite 
the pins-see 
Figure 7.2) and 9JC (the ther- 
mal resistance 
from the silicon junction 
to the pack- 
age 
case) 
but 
the 
25 
MHz 
and 
33 
MHz 
Intel 
OverDrive 
processor 
specifies 
Tsink (the 
tempera- 
ture at the outside 
center 
base of the heat sink, not 
on the heat sink marking plate or cooling 
posts-see 
Figure 7.3) and 9JS (the thermal 
resistance 
from the 
silicon junction 
to the heat sink base). The relation- 


ships between 
temperature, 
thermal 
resistance 
and 
power are shown 
in the following 
equations: 


Tcase = Tambient + (Pmax 
' 6CN and 


T sink = T ambient + (P max ' 6SN 


where, 


Tambient = Ambient Temperature 


Pmax = Power (Ice' Vce), 


6CA = 6JA - 
6JC. 


6SA = 6JA - 
6Js· 


Figure 7.2.Case Temperature Measurement without Heat Sink (0.005"Dia. Thermocouple 
on the Center of Package Top Surface with a 90· Angle Adhesive Bond) 


Figure 7.3.Heat Sink Measurement (0.005"Dia. Thermocouple 
on the Center of Heat Sink with a 
90· Angle Adhesive Bond through a Hole Drilled through the Center of the Name Plate) 


The maximum allowable ambient temperature as computed from these thermal equations is shown in 
Table 7.4. 


fCLK 
Airflow-Linear 
ftlmin 
(m/sec) 


(MHz) 
0 
200 
400 
600 
800 
(0) 
(1.01) 
(2.03) 
(3.04) 
(4.06) 


OverDrive Processor 
16 
42 
51 
62 
67 
70 
without Heat Sink(1) 
20 
29 
41 
54 
60 
64 


OverDrive Processor 
25 
30 
49 
61 
67 
70 
with Heat Sink(2) 
33 
16 
40 
55 
63 
66 


NOTES: 
1. The 20 MHz OverDrive 
processor 
does 
not have a heat sink. 
2. The 25 MHz and 33 MHz OverDrive 
processors 
have a heat sink attached. 
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APPENDIX 
A 
"END USER EASY" UPGRADABILITY* 


PC buyers value easy and safe upgrade 
installation. 
PC manufacturers 
can make the Intel OverDrive 
proces- 
sor installation 
in the Intel OverDrive 
processor 
socket 
simple and foolproof 
for the end user and reseller 
by 
implementing 
the suggestions 
listed in Table A-1. 


"End User Easy" 
Implementation 
Feature 


Visible OverDrive 
Processor 
Socket 
The Intel OverDrive 
processor 
socket should be easily visible when 


the PC's cover is removed. 
Label the Intel OverDrive 
processor 


socket and the location 
of pin 1 by silk screening 
this information 
on 


the PC board. 


Accessible 
OverDrive 
Processor 
Make the Intel OverDrive 
processor 
socket easily accessible 
to the 
Socket 
end user (i.e., do not place the Intel OverDrive 
processor 
socket 


under a disk drive). If a Low Insertion 
Force (L1F) or screw machine 


socket is used, position the Intel OverDrive 
processor 
socket on the 


PC board such that there is ample clearance 
around the socket. 


Foolproof 
Chip Orientation 
Intel packages 
all Intel OverDrive 
processors 
in a 169-pin, PGA 
package. 
The 169th pin is called the "key" 
pin and insures that the 


Intel OverDrive 
processor 
fits into a 169-pin socket 
in only the 


correct 
orientation. 
Supplying 
a 169-pin socket as the Intel 


OverDrive 
processor 
socket eliminates 
the possibility 
of end users 


or resellers 
damaging 
the PC board or Intel OverDrive 
processor 
by 


powering 
up the system with the Intel OverDrive 
processor 
incorrectly 
oriented. 


Zero Insertion 
Force 
The high pin count of the Intel OverDrive 
processor 
makes the 


Upgrade Socket 
insertion 
force required for installation 
in a screw machine 
PGA 


socket excessive 
for end users or resellers. 
Even most Low 


Insertion 
Force (L1F) sockets 
often require more than 60 Ibs. of 


insertion 
force.'A 
Zero Insertion 
Force (ZIF) socket 
insures that the 


chip insertion 
force does not damage the PC board. If the ZIF 


socket has a handle, be sure to allow enough clearance 
for the 


socket handle. If a L1For screw machine 
socket is used, additional 


PC board support 
is recommended. 


"Plug and Play" 
Jumper or switch changes 
should not be needed to electrically 


configure 
the system for the Intel OverDrive 
processor. 


Thorough 
Documentation 
Describe 
the Intel OverDrive 
processor's 
installation 
procedure 
in 


the PC's User's Manual. 


"This section applies to the 169-lead PGA versions of the OverDrive processor only. The "End-User Easy" criteria have 
been incorporated into the OverDrive Ready program. OverDrive Ready is a trademark of Intel Corp. 
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APPENDIX B 
ZIF and LIF SOCKET VENDORS 


The 
following 
lists 
provide 
examples 
of 
sockets 
which can be used as the Intel OverDrive 
socket 
for 
Intel486 
SX and Intel486 
OX CPU based systems. 


NOTE: 
This is not a comprehensive 
list. Intel has not test- 
ed the sockets 
listed 
below 
and cannot 
guarantee 
that these 
sockets 
will meet every 
PC manufactur- 
er's specific 
requirements. 


Zero Insertion 
Force 
Upgrade 
Sockets 
and 
Vendors: 


1. AMP Inc. 
P.O. Box 3608 
Harrisburg, 
PA 17105·3608 
Tel: (800) 522·6752 
Part Number: 
55287-3 
Contact: 
Rick 
Simonic, 
New 
Product 
Manager 
(717) 561·6143 


2. Augat 
Inc. 
425 John Dietsch 
Blvd. 
Attleboro 
Falls, MA 02763 
(508) 699-9800 


3. Aries Electronics 
P.O. Box 130 
Frenchtown, 
NJ 08825 
Tel: (908) 996-6841 
Part Number: 
169-PRS17012-10 
Contact: 
Frank 
Folmsbee, 
Marketing 
Manager 
(908) 996-6841 


4. JAE 
599 N. Mathilda 
Ave., Suite 8 
Sunnyvale, 
CA 94086 
Tel: (408) 733-0493 
Part Number: 
PCPS-169-002 
Contact: 
Bob Gerleman, 
Western 
Sales Manager 
(408) 733-0493 


5. Thomas 
and Betts 
200 Executive 
Center 
Drive 
P.O. Box 24901 
Greenville, 
SC 29616-2401 
Tel: (803) 676-2900 
Part Number: 
PGA 169A 17-S-1 AC 
Contact: 
Scott 
Roland, 
Product 
Marketing 
Manager 
(803) 676-2910 


6. Yamaichi 
Electronics 
1420 Koll Circle, Suite B 
San Jose, CA 95112 
Tel: (408) 452-0797 
Part Number: 
NP111-16911-G4 
Contact: 
Jim Bennett, 
Sales Manager 
(408) 452-0797 


1. AMP Inc. 


P.O. Box 3608 
Harrisburg, 
PA 17105·3608 
Tel: (800) 522-6752 
Part Number: 
(Premium 
Base Material) 
55589-5 
(Standard 
Base Material) 
916227-3 


2. Thomas 
and Betts 
200 Executive 
Center 
Drive 
P.O. Box 24901 
Greenville, 
SC 29616-2401 
Tel: (803) 676-2900 
Part Number: 
LPG 169A 17-S-1 AC 
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82091AA 
ADVANCED 
INTEGRATED 
PERIPHERAL (AlP) 
• Single-Chip PC Compatible I/O Solution 
• Floppy Disk Controller Features, 


for Notebook and Desktop Platforms: 
-100% 
Software Compatible with 
- 
82078 Floppy Disk Controller Core 
Industry Standard 82077SL and 
- 
Two 16550Compatible UARTs 
82078 
- 
One Multi-Function Parallel Port 
-Integrated 
Analog Data Separator 
-IDE 
Interface 
250K,300K,500K,1M 
-Integrated 
Back Power Protection 
- 
Programmable Powerdown 
-Integrated 
Game Port Chip Select 
Command 
- 
5V or 3.3V Supply Operation with 5V 
- 
Auto Powerdown and Wakeup 
Tolerant Drive Interface 
Modes 
- 
Full Power Management Support 
-Integrated 
Tape Drive Support 
- 
Supports Type F DMA Transfers for 
- 
Perpendicular Recording Support for 
Faster I/O Performance 
4 MB Drives 
- 
No Wait-State Host I/O Interface 
- 
Programmable Write 
- 
Programmable Interrupt Interfaces 
Pre-Compensation Delays 
- 
Single Crystal/Oscillator 
Clock 
- 
256 Track Direct Address, Unlimited 
(24 MHz) 
Track Support 
- 
Software Detectable Device 10 
-16 
Byte FIFO 
- 
Comprehensive Powerup 
- 
Supports 2 or 4 Drives 
Configuration 
• 16550Compatible UART Features • 
• The AlP Is 100% Compatible with EISA, 
- 
Two Independent Serial Ports 
ISA and AT 
- 
Software Compatible with 8250 and 
• Host Interface Features 
16450 UARTs 


- 
8-Blt Zero Wait-State ISA Bus 
- 
16 Byte FIFO per Serial Port 


Interface 
- 
Two UART Clock Sources, Supports 


- 
DMA with Type F Transfers 
MIDI Baud Rate 


- 
Five Programmable ISA Interrupt 
• IDE Interface Features 
Lines 
- 
Generates Chip Selects for IDE 
-Internal 
Address Decoder 
Drives 
• Parallel Port Features 
-Integrated 
Buffer Control Logic 


- 
All IEEE Standard 1284 Protocols 
- 
Dual IDE Interface Support 


Supported (Compatibility, Nibble, 
• Power Management Features 
Byte, EPP,and ECP) 
- 
Transparent to Operating Systems 
- 
Peak BI-Dlrectlonal Transfer Rate of 
and Applications Programs 
2 MB/sec 
-Independent 
Power Control for Each 
-16 
Byte FIFO for ECP 
Integrated Device 
• 100-Pln OFP Package 


See Packaging 
Spec. 
240600 


The 82091AA 
Advanced 
Integrated 
Peripheral 
(AlP) is an integrated 
I/O 
solution 
containing 
a floppy 
disk 
controller, 
2 serial ports, a multi-function 
parallel port, an IDE interface, 
and a game port on a single chip. The 


integration 
of these 
I/O 
devices 
results 
in a minimization 
of form factor, 
cost and power 
consumption. 
The 


floppy 
disk controller 
is the 82078 
core with a data rate up to 2 Mbs. The serial ports are 16550 compatible. 


The parallel port supports 
all of the IEEE Standard 
1284 protocols 
(ECP, EPP, Byte, Compatibility, 
and Nibble). 


The IDE interface 
supports 
8-bit or 16-bit programmed 
I/O and 16-bit DMA. The Host Interface 
is an 8-bit ISA 
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interface 
optimized 
for type "F" 
DMA and no wait-state 
I/O accesses. 
Improved 
throughput 
and performance, 
the AlP contains 
six 16-byte FIFOstwo 
for each serial port, one for the parallel port, and one for the floppy disk 
controller. 
The AlP also includes 
power management 
and 3.3V capability 
for power sensitive 
applications 
such 


as notebooks. 
The AlP supports 
both motherboard 
and add-in card configurations. 


Host 
Interface 


4 
IDE 
IDE 
Interface 


Floppy 
Drives 
16 
FDC 
82078 


2 
Clock 
Oscillator 


Serial 
Port 
16550 


Serial 
Port 
16550 


Parallel 
Port 


Parallel 
Port 
Buffer 
Direction 
or 
Game Port 
Chip Select 


PPDIR or 
GAME 
Chip Select 
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82078 CHMOS SINGLE-CHIP 
FLOPPY DISK CONTROLLER 
• Small Footprint and Low Height 
• Integrated Analog Data separator 


Packages 
- 
250 Kblts/sec 


• Supports Standard 5.0Y as Well as Low 
- 
300 Kblts/ sec 
- 
500 Kblts/ sec 
Yoltage 3.3Y Platforms 
-1 
Mblts/sec 
- 
selectable 3.3Y and 5.0Y 
Configuration 
- 
2 Mblts/sec 


- 
5.0Y Tolerant Drive Interface 
II Integrated Tape Drive Support 
• Enhanced Power Management 
-Standard 
1 Mbps/500 Kbps/ 


- 
Application Software Transparency 
250 Kbps Tape Drives 


- 
Programmable Powerdown 
- 
New 2 Mbps Tape Drive Mode 


Command 
• Perpendicular Recording Support for 


- 
save and Restore Commands for OY 
4 MB Drives 
Powerdown 
• Fully Decoded Drive select and Motor 
- 
Auto Powerdown and Wakeup 
Signals 
Modes 
- 
Two External Power Management 
• Programmable Write Precompensatlon 
Pins 
Delays 
- 
Consumes No Power While In 
• Addresses 256 Tracks Directly, 


Powerdown 
Supports Unlimited Tracks 


• Programmable Internal Oscillator 
• 16 Byte FIFO 
• Floppy Drive Support Features 
• Single-Chip Floppy Disk Controller • 


- 
Drive Specification Command 
Solution for Portables and Desktops 


- 
Media 10 Capability Provides Media 
-100% 
PC·AT- Compatible 
Recognition 
- 
100% PS/2- Compatible 


- 
Drive 10 Capability Allows the User 
- 
100% PS/2 Model 30 Compatible 
to Recognize the Type of Drive 
- 
Fully Compatible with Intel's 386SL 
- 
Selectable Boot Drive 
Microprocessor SuperSet 
- 
Standard IBM and ISO Format 
-Integrated 
Drive and Data Bus 


Features 
Buffers 


- 
Format with Write Command for 
Available In 64 Pin QFP and 44 Pin QFP 
High Performance In Mass Floppy 
• 
Duplication 
Package 
(See Package 
Specification 
Order 
Number 
240800, 
Package 
• Integrated Host/Disk Interface Drivers 
Type 
S) 


The 82078 
Product 
Family brings a set of enhanced 
floppy 
disk controllers. 
These 
include 
several 
features 
that allow for easy implementation 
in both the portable 
and desktop 
market. 
The current 
family includes 
a 64 
pin and a 44 pin part in the smaller form factor QFP package. 
The 3.3V version 
of the 64 pin part provides 
an 
ideal solution 
for the rapidly emerging 
3.3V platforms. 
It also allows for a 5.0V tolerant 
floppy 
drive interface 
that lets the users retain their normal 
5.0V drives. Another 
version 
of the 64 pin part provides 
support 
for 2 
Mbps data rate tape drives. 


3.3V 
5.QV 
2 Mbps Data Rate 


82078SL 
X 
X 


82078-1 
X 
X 


The 44 pin is targeted for platforms 
that are operated at 3.3V or 5.0V and do not require more than two drive 
support. The 82078-5 is designed for price sensitive 
5.0V designs which do not include 4 MB drive support. 


3.3V 
5.QV 
1 Mbps Data Rate 


82078 
X 
X 


82078-5 
, 
X 


82078-3 
X 
X 


Both parts can be operated at 1 Mbps/500 
Kbps/300 
Kbps/250 
Kbps. Additionally, 
one version of the 64 pin 
part provides 2 Mbps data rate operation 
specific for the new tape drives. 
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8207844 
PIN 
CHMOS SINGLE-CHIP FLOPPY DISK CONTROLLER 
• Small Footprint 
and Low Height 
• Integrated Tape Drive Support 
Package 
- 
Standard 1 Mbps/500 Kbps/ 
• Enhanced Power Management 
250 Kbps Tape Drives 


- 
Application 
Software Transparency 
• Perpendicular 
Recording Support for 
- 
Programmable 
Powerdown 
4 MB Drives 
Command 
• Integrated 
Host/Disk 
Interface 
Drivers 
- 
Save and Restore Commands for 
Zero-Volt Powerdown 
• Fully Decoded Drive Select and Motor 
- 
Auto Powerdown and Wakeup 
Signals 
Modes 
• Programmable Write Precompensatlon 
- 
Two External Power Management 
Delays 
Pins 
- 
Consumes No Power While In 
• Addresses 256 Tracks Directly, 


Powerdown 
Supports Unlimited Tracks 
• Integrated Analog Data Separator 
• 16 Byte FIFO 


-250 
Kbps 
• Single-Chip Floppy Disk Controller 
-300 
Kbps 
Solution for Portables and Desktops 
-500 
Kbps 
-100% 
PC/AT* Compatible 
-1 
Mbps 
- 
Fully Compatible with Intel386TM SL 
• Programmable 
Internal Oscillator 
-Integrated 
Drive and Data Bus 
Buffers 
• Floppy Drive Support Features 
Separate 5.0V and 3.3V Versions of the 
- 
Drive Specification 
Command 
• 
- 
Selectable Boot Drive 
44 Pin part are Available 


- 
Standard IBM and ISO Format 
• Available In a 44 Pin QFP Package 
Features 
- 
Format with Write Command for 
High Performance 
In Mass Floppy 
Duplication 


The 82078, a 24 MHz crystal. a resistor 
package, 
and a device chip select implements 
a complete 
solution. 
All 
programmable 
options 
default to 82078 
compatible 
values. 
The dual PLL data separator 
has better 
perform- 


ance 
than 
most 
board 
level/discre1e 
PLL implementations. 
The FIFO allows 
better 
system 
performance 
in 
multi-master 
(e.g., Microchannel, 
EISA). 


The 82078 
maintains 
complete 
software 
compatipility 
with the 82077SLl82077 
AA/8272A 
floppy disk control- 


lers. It contains 
programmable 
power manageme"tfeatures 
while integrating 
all of the logic required for floppy 


disk control. 
The power 
management 
features 
are transparent 
to any application 
software. 


The 82078 
is fabricated 
with Intel's 
advanced 
CHMOS 
III technology 
and is also available 
in a 54-lead 
QFP 
package. 
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8207864 
PIN 
CHMOS SINGLE-CHIP FLOPPY DISK CONTROLLER 
• Small Footprint 
and Low Height 
- 
Selectable 
Boot Drive 
Packages 
-Standard 
IBM and ISO Format 
• Supports 
Standard 
5.0V as well as Low 
Features 
- 
Format 
with Write Command 
for 
Voltage 
3.3V Platforms 
High Performance 
in Mass Floppy 
- 
Selectable 
3.3V and 5.0V 
Configuration 
Duplication 


- 
5.0V Tolerant 
Drive Interface 
• Integrated 
Tape Drive Support 
• Enhanced 
Power 
Management 
- 
Standard 
1 Mbps/500 
Kbps/ 


- 
Application 
Software 
Transparency 
250 Kbps Tape Drives 


- 
Programmable 
Powerdown 
- 
New 2 Mbps Tape Drive Mode 


Command 
• Perpendicular 
Recording 
Support 
for 
- 
Save and Restore 
Commands 
for 
4 MB Drives 
Zero-Volt 
Powerdown 
• Integrated 
Host/Disk 
Interface 
Drivers 
- 
Auto Powerdown 
and Wakeup 
Modes 
• Fully Decoded 
Drive Select 
and Motor 
- 
Two External 
Power 
Management 
Signals 


Pins 
• Programmable 
Write Precompensatlon 
- 
Consumes 
no Power when In 
Delays 
Powerdown 
• Addresses 
256 Tracks 
Directly, 
• Integrated 
Analog 
Data Separator 
Supports 
Unlimited 
Tracks 
•• 


-250 
Kbps 
• 16 Byte FIFO 
-300 
Kbps 
-500 
Kbps 
• Single-Chip 
Floppy 
Disk Controller 
-1 
Mbps 
Solution 
for Portables 
and Desktops 
-2 
Mbps 
-100% 
PC AT* Compatible 
• Programmable 
Internal 
Oscillator 
- 
100% PS/2* 
Compatible 
- 
100% PS/2 
Model 30 Compatible 
• Floppy 
Drive Support 
Features 
- 
Fully Compatible 
with Intel386TM SL 
- 
Drive Specification 
Command 
Microprocessor 
SuperSet 
- 
Media ID Capability 
Provides 
Media 
• Integrated 
Drive and Data Bus Buffers 
Recognition 
- 
Drive ID Capability 
Allows the User 
• Available 
In 64 Pin QFP Package 
to Recognize 
the Type of Drive 


The 82078, a 24 MHz crystal, a resistor 
package, 
and a device chip select implements 
a complete 
solution. 
All 
programmable 
options 
default 
to 82078 
compatible 
values. 
The dual PLL data separator 
has better 
perform- 
ance 
than 
most 
board 
level/discrete 
PLL implementations. 
The FIFO allows 
better 
system 
performance 
in 
multi-master 
(e.g., Microchannel, 
EISA). 


The 82078 
maintains 
complete 
software 
compatibility 
with the 82077SLl82077 
AA/8272A 
floppy disk control- 
lers. It contains 
programmable 
power management 
features 
while integrating 
all of the logic required for floppy 
disk control. 
The 
power 
management 
features 
are transparent 
to any application 
software. 
There 
are two 
versions 
of 82078 
floppy 
disk controllers, 
the 82Q78SL and 82078-1. 


The 82078 
is fabricated 
with Intel's 
advanced 
CHMOS 
III technology 
and is also available 
in a 44-lead 
QFP 
package. 


Refer 
to the 1994 Peripheral 
Components 
Handbook 
for the complete 
data sheet 
on this device. 
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CHMOS SINGLE-CHIP FLOPPY DISK CONTROLLER 


• 
Integrated Analog Data Separator 
- 
250 Kblts/sec 
- 
300 Kblts/sec 
- 
500 Kblts/sec 
- 
1 Mblts/sec 
• 
Programmable Crystal Oscillator for On 
or Off 
• 
Integrated Tape Drive Support 


• 
Perpendicular 
Recording Support 
• 
12 mA Host Interface Drivers, 40 mA 
Disk Drivers 
• 
Four Fully Decoded Drive Select and 
Motor Signals 
• 
Programmable Write Precompensation 
Delays 


• 
Addresses 256 Tracks Directly, 
Supports Unlimited Tracks 


• 
16 Byte FIFO 
• 
68-Pln PLCC 
(See Packaging Handbook Order Number # 240800, Package 


• 
High Speed Processor Interface 
Type 
N) 


The 82077SL, 
a 24 MHz crystal, 
a resistor 
package, 
and a device 
chip select 
implements 
a complete 
laptop 
solution, 
All programmable 
options 
default 
to 82077AA 
compatible 
values, 
The dual PLL data separator 
has 


better 
performance 
than 
most 
board 
level/discrete 
PLL implementations. 
The 
FIFO 
allows 
better 
system 


performance 
in multi-master 
systems 
(e.g., Microchannel, 
EISA), 


The 82077SL 
is a superset 
of 82077 AA, The 82077SL 
incorporates 
power 
management 
features 
while main- 


taining 
complete 
compatibility 
with the 
82077 AA/8272A 
floppy 
disk controllers. 
It contains 
programmable 
power 
management 
features 
while 
integrating 
all of the 
logic 
required 
for floppy 
disk control. 
The 
power 


management 
features 
are transparent 
to any application 
software, 
The 82077SL 
is available 
in three 
ver- 


sions-82077SL-5, 
82077SL 
and 82077SL-1. 
82077SL-1 
has all features 
listed in this data sheet. 
It supports 


both tape drives and 4 MB floppy drives. The 82077SL 
supports 
4 MB floppy drives and is capable 
of operation 


at all data rates through 
1 Mbps. The 82077SL-5 
supports 
500/300/250 
Kbps data rates for high and low 


density 
floppy 
drives. 


The 82077SL 
is fabricated 
with Intel's 
advanced 
CHMOS 
III technology 
and is available 
in a 58-lead 
PLCC 
(plastic) 
package. 


• 
Completely 
Compatible with Industry 
Standard 82077AA 


• 
Single-Chip Laptop Desktop Floppy 
Disk Controller 
Solution 
-100% 
PC AT- Compatible 
- 
100% PS/2- Compatible 
-100% 
PS/2 Model 30 Compatible 
- 
Fully Compatible with Intel's 386SL 
Microprocessor 
SuperSet 
-Integrated 
Drive and Data Bus 
Buffers 


• 
Power Management Features 
- 
Application 
Software Transparency 
- 
Programmable 
Powerdown 
Command 
- 
Auto Powerdown 
and Wakeup 
Modes 
- 
Two External Power Management 
Pins 
- 
Typical Power Consumption 
In 
Power Down: 10 IJ-A 
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Figure 
1. 82077SL 
Pinout 


·PS/2 and PC AT are trademarks of IBM. 


Refer 
to the 1994 Peripheral 
Components 
Handbook 
for the complete 
data sheet 
on this device. 


89C024FT 


V.42/42bis 
MODEM CHIP SET 


• 
CHMOS for Low Operating Power 


• 
Low Standby Power 
• 
Minimum Chip Count for Small Size 


• 
V.42 Compliant Error Correction 
(LAPM and *MNP4) 


• 
V.42bis and MNP5 Data Compression 
Increase Throughput 
up to 4 Times 


with DTE Rates of 9600 


• 
AT Command Set 
• 
V.22bis, V.22 AlB, V.21, Bell 212A, and 
Bell 103 Compatible 


• 
Automatically 
Detects Remote Modem 


Type and Data Rate 


• 
On-Chip Hybrid 
• 
DTMF and Pulse Dialing 


• 
On-Chip Serial Port and Handshake 
Signals for RS-232/V.24 Interface 


• 
Serial Interface to External NVRAM 


• 
Automatic 
Speed Matching in Reliable 


and Normal Modes 


• 
Hardware and Software Flow Control 


• 
Analog/Digital 
Loopback Diagnostics 


• 
Synchronous 
Modes 
• 
Easily Customized Command Set and 
Features 


• 
Intel's V.42/42bis and MNP Software 
Co-Developed with R. Scott 
Associates ** 


• 
Packaging: 
- 
For Packages 
QN89026FT SV901 68-Pin PLCC 
QP89027 28-Pln PDIP 
Order Kit #89C024FT SZ502 


- 
For Packages 
QN89026FT SV901 68-Pin PLCC 
QN89027 28-Pln PLCC 
Order Kit #89C024FT SZ503 


(See Packaging 
Spec. 
Order 
# 240800-001) 


QAIrlI/PSK 
RECEIVER 


SIGNAL 
RECOVERY 


DESCRAWBlER. 
DECODER 


TELEPHONE 


LINE 


INTERrACE 
(OU' 


This product is not licensed by Hayes Microcomputer Products, Inc. ("Hayes") or Interni1lional Business Machines ("IBM"). 
Specific applications of this product may be determined by Hayes or IBM to require a license. Intel Corporation dt-ds not 
assure any liability or provide patent identification based on such determination. 


"MNP is a registered trademark of Microcom, Inc. 
""R. Scott Associates, Inc., 5711 Six Forks Road, Suite 301, Raleigh, North Carolina 27609, (919) 846-7171 


For the complete 
data sheet on this product, 
refer to the 1994 Connectivity 
handbook. 
October 1993 
Order Number: 290398-005 


• 


intel~ 
89C024LT 
ERROR CORRECTING 
LAPTOP 
MODEM CHIP SET 
• CHMOS 
for Low Operating 
Power 
• Hardware 
and Software 
Flow Control 


• Low Standby 
Power 
Requirements 
• Analog/Digital 
Loopback 
Diagnostics 
• Minimum 
Chip Count for Small size 
• Telephone 
Line Audio Monitor 
Output 
• MNp· 
Operation 
through 
Class 4 for 
• Full Set of Control 
Signals for DAA 
Error Correction 
Interface 
• MNP Class 5 Data Compression 
for 
• International 
Call Progress 
Tone 
Increased 
Throughput 
Detection 
Capabilities 
• AT Command 
Set 
• Automatic 
Adaptive 
Equalization 
• V.22 bls, V.22 AlB, 
V.21, Bell 212A, and 
• Synchronous 
Modes 
Bell 103 Compatible 
• Easily Customized 
Command 
Set and 
• For Public Switched 
Telephone 
Features 
Network 
and Unconditioned 
Leased 
Intel's 
MNP Software 
Co-Developed 
Line Applications 
• with R. Scott Associates" 
• Automatically 
Detects 
Remote 
Modem 
• Packaging: 
Type and Data Rate 
- 
For Packages 
• On-Chip 
Hybrid 
QN89026L T SV782 68-Pln PLCC 
• DTMF and Pulse Dialing 
QN89027 
28-Pln PDIP 


On-Chip 
Serial Port and Handshake 
Order Kit # 89C024L T SZ504 
• 
- 
For Packages 
Signals for RS-232/V.24 
Interface 
QN89026L T SV782 68-Pln PLCC 
• Serial Interface 
to External 
NVRAM 
QN89027 
28-Pln PLCC 
• Automatic 
Speed 
Matching 
In MNP and 
Order Kit # 89C024L T SZ505 


Normal 
Modes 
(See Packaging 
Specifications 
Order 
Number 
240800..Q(1) 


89C026LT 
89027 


QAW/PSK 
RECEIVER 
SIGNAL 
RECOVERY 


DESCRAW8LER. 
DECODER 


TELEPHONE 
LINE 


INTERFACE 
(OM) 


Figure 1. 89C024L T System 
Block Diagram 
This product 
is not licensed 
by Hayes 
Microcomputer 
Producls, 
Inc. ("Hayes"). 
Specific 
applications 
of this product 
may be determined 
by Hayes 
to 
require 
a license. 
Intel Corporation 
does 
not assume 
any liability 
or provide 
patent 
indemnification 
based 
on such 
determination. 


°MNP 
is a registered 
trademark 
of Microcom, 
Inc. 


o0A. Scott 
Associates, 
Inc., 5711 Six Forks 
Road, Suite 301. Raleigh, 
North Carolina 
27609, 
(919) 846-7171 


For the complete 
data sheet on this product, 
refer to the 1994 ConnectiVity 
handbook. 


89C124FX 


DATA/FAX 
MODEM CHIP SET 


• 9600 bps Send and Receive 
FAX 
• On-Chip 
Hybrid 


• V.29 and V.27ter 
Compatible 
• DTMF and Pulse Dialing 


• Supports 
Communicating 
Applications 
• Automatic 
Speed 
Matching 
In Reliable 
Specification 
(CAS) 
and Normal 
Modes 


• EIA/TIA-578 
Compliant 
FAX Command 
• Serial Interface 
to External 
NVRAM 


Set (Service 
Class 1) 
• Hardware 
and Software 
Flow Control 
• Compatible 
with CCITT 
Group 3 FAX 
• Analog/Digital 
Loopback 
Diagnostics 
Machines 


2400 bps Data Modem 
• Automatically 
Detects 
Remote 
Modem 
• 
Type and Data Rate 
• V.22bls, 
V.22 A/B, 
V.21, Bell 212A, and 
• Easily Customized 
Command 
Set and 
Bell 103 Compatible 
Features 
• V.42 Compliant 
Error Correction 
(LAPM 
• Synchronous 
Modes 
and *MNP4) 


V.42bis and MNP5 Data Compression 
• On-Chip Serial Port Handshake 
Signals 
• 
for RS-232/V.24 
Interface 
• Easy Upgrade 
from Intel Modem 
Chip 
• Packaging 
Sets 
-89127: 
• AT Command 
Set 
28-Lead 
PDIP and PLCC, 64-Lead • 
• Minimum 
Chip Count for Small Size 
QFP Packages 
Package 
Type P, Nand 
S 
• CHMOS for Low Operating 
Power 
-89C126FX: 
• Low Standby 
Power 
68-Lead 
PLCC and 80-Lead 
QFP 
Package 
Type Nand 
SB 


(See 
Packaging 
Spec 
Order 
No. 240800) 


This product is not licensed by Hayes Microcomputer Products, Inc. ("Hayes") or International Business Machines ("IBM"). 
Specific applications of this product may be determined by Hayes or IBM to require a license. Intel Corporation does not 
assume any liability or provide patent indemnification based on such determination. 
·MNP is a registered trademark of Microcom, Inc. 
•• R. Scott Associates, Inc., 7701 Six Forks Rd, Suite 120, Raleigh, North Carolina 27615, (919) 846-7171 


For the complete data sheet on this product, refer to the 1994 Connectivity handbook. 
October 1992 
Order Number: 290415-002 


82~95 
ISA/PCMCIA 
HIGH INTEGRATION 
ETHERNET CONTROLLER 


• 
Optimal Integration for Lowest Cost 
Solution 
- 
Glueless 8-BIt/16-Bit ISA/PCMCIA 2.0 
Bus Interface 
- 
Provides Fully 802.3 Compliant AUI 
and TPE Serial Interface 
- 
Local DRAM Support up to 
64 Kbytes 
- 
FLASH/EPROM Boot Support 
- 
Hardware and Software Portable 
between Motherboard, Adapter, and 
PCMCIA 10 Card Solution 


• 
High Performance Networking 
Functions 
-16-Blt 
10 Accesses to Local DRAM 
with Zero Added Wait-States 
- 
Ring Buffer Structure for Continuous 
Frame Reception and Transmit 
Chaining 
- 
Automatic Retransmission on 
Collision 
- 
Automatically Corrects TPE Polarity 
Switching Problems 


• 
Low Power CHMOSIV Technology 


• 
Ease of Use 
- 
Design Time Reduced by High 
Integration 
- 
EEPROMInterface to Support 
Jumperless Design 
- 
Software Structures Optimized to 
Reduce Processing Steps 
- 
Automatically Maps into Unused PC 
10 Location to Help Eliminate LAN 
Setup Problems 
- 
All Software Structures Contained in 
One 16-Byte 10 Space 
- 
Automatic or Manual Switching 
between TPE and AUI Ports 


-JTAG 
Port for Reduced Board 
Testing Times 


• 
Power Manageme_nt__ 
- 
SL Compatible SMOUT Power Down 
Input 
- 
Software Power Down Command for 
non-SL Systems 


• 
144-Lead tQFP Package Provides 
Smallest Available Form Factor 
(See 
Packaging 
Spec., 
Order 
No. 240800) 


AO-9 


A14-19 
ISA/PCMCIA 
Bus 
Interface 


Local Memory 
Interface 
(DMA) 


TPE Serial 
Interface 


TPE 
Link IIF 


CSMA/CD 
Unit 


LED 
Control 


AUI Serial 
Interface 
AUI 
Link I/F 


82593 
CSMA/CD 
CORE LAN CONTROLLER 


• 
Supports 
Industry 
Standard 
LANs 
-IEEE 
10BASE5 
(Ethernet·) 
-IEEE 
10BASE2 
(Cheapernet) 
-IEEE 
10BASE-T 
(TPE) 


• 
Simple, 
High-Performance 
Control 
and 
Data Interface 
- 
Control 
and Status via RD, WR, and 
CS Lines 
- 
Data Transfers 
via DMA Interface 
- 
Two Clocks 
per DMA Transfer 
- 
Programmable 
Bus Throttle 
Timer 


• 
High-Performance 
Networking 
Functions 
- 
Automatic 
Retransmission 
from 
Internal 
FIFO 
- 
Back-to-Back 
Frame Reception 
with 
No CPU Intervention 
- 
Receive 
Ring Buffer 
Memory 
Structure 
- 
Transmit 
Frame Chaining 
- 
96-Byte 
Transmit 
FIFO and 96-Byte 
Receive 
FIFO 


Two 
Channel 
OW. 
Interface 


Logic 


• 
High Speed, 
5V CHMOS 
IV (P648.8) 
Technology 


• 
Serial Bit Rates up to 20 Mb/s 
(82593SX) 
- 
Direct Interface 
to Intel 82C501AD 
ESI or AMD 7992 SIA 
- 
Conforms 
to 802.3 CSMA/CD 
Standard 


• 
On-Board 
Diagnostics 
-Internal 
and External 
Loopback 
Operation 
-Internal 
Register 
Dump 
- 
TDR Functionality 


• 
44-Lead 
PLCC Package 
Type N 
(82593SX), 
44-Lead 
QFP Package 
Type S (82593SX) 
or 28-Pln PDIP 
- 
82593SX 
(8/16-Blt) 
System 
Clock up 
to 20 MHz 
- 
82593SX 
Package 
Pin Compatible 
with Intel 82592 PLCC 
• 


(See Packaging 
Spec., 
Order 
No. 240800-001 
Packaga 
Type 
N, S 
~ 
and P) 


CSWA/CO 
RXO 


RXC 


TXO 


WEOIUW 
TXC 
ACCESS 


CONTROL 


UNIT 
CRS 


COT 


RTS 


CTS 


Internal 
R.transmlt 
Control 


Configuration 
Registers 
.t IA 


For the complete data sheet on this product, refer to the 1994 Networking handbook. 
October 1992 
Order Number: 290411-ClO4 


82503 
DUAL SERIAL TRANSCEIVER 
(DST) 


• 
Single Component 
Ethernet· 
Interface 
to Both 802.3 10BASE-T 
and AUI 


• 
Automatic 
or Manual Port Selection 


• 
Manchester 
Encoder/Decoder 
and 
Clock Recovery 


• 
No Glue Interface 
to Industry-Standard 
LAN Controllers 
-Intel 
82586, 82590, 82593 and 82596 
- 
AMD 7990 (LANCE*) 
- 
National 
Semiconductor 
8390 and 
83932 (SONIC·) 
- 
Western 
Digital 83C690 
- 
Fujitsu 86950 (Etherstar·) 


• 
Diagnostic 
Loopback 


• 
Reset, 
Low Power 
Modes 


• 
Network 
Status 
Indicators 


• 
Defeatable 
Jabber 
Timer 


• 
User Test Modes 


• 
10 MHz Transmit 
Clock Generator 


• 
One Micron CHMOS·· 
IV (Px48) 
Technology 


• 
Single 5-V Supply 


TPE 
• 
Complies 
with 10BASE-T, 
IEEE Std. 
802.3i-1990 
for Twisted 
Pair Ethernet 


• 
Selectable 
Polarity 
Switching 


• 
Direct Interface 
to TPE Analog 
Filters 


• 
On-Chip TPE Squelch 


• 
Defeatable 
Link Integrity 
(L1) 


• 
Support 
of Cable Lengths> 
100m 


AUI 
• 
Complies 
with IEEE 802.3 AUI Standard 


• 
Direct Interface 
to AUI Transformers 


• 
On-Chip AUI Squelch 


A block diagram 
of a typical 
application 
is shown in Figure 1. The 82503 Dual Serial Transceiver 
is a high-inte- 


gration 
CMOS 
device 
designed 
to simplify 
interfacing 
industry 
standard 
Ethernet 
LAN Controllers 
to IEEE 
802.3 
local area network 
applications 
(10BASE5, 
10BASE2, 
and 10BASE-T). 
The component 
supports 
both 
an attachment 
unit interface 
(AUI) and a Twisted 
Pair Ethernet 
interface 
(TPE). It allows 
OEMs 
to design 
a 
state-of-the-art 
media 
interface 
that is jumperless 
and fully automatic. 
The 82503 
includes 
on-chip 
AUI and 
TPE drivers 
and receivers; 
it offers designers 
a cost-effective, 
integrated 
solution 
for interfacing 
LAN control- 


lers to the wire medium. 


""CHMOS 
is a patented 
process 
of Intel Corporation. 
"Ethernet 
is a registered 
trademark 
of Xerox Corporation. 
LANCE 
is a registered 
trademark 
of Advanced 
Micro 
Devices. 


Etherstar 
is a registered 
trademark 
of Fujitsu 
Electronics. 


Sonic 
is a registered 
trademark 
of National 
Semiconductor 
Corporation. 


For the complete data sheet on this product, refer to the 1994 Networking handbook. 
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Ethemet* LAN Card Product Brief 


Product - 
Complete 
plug and play 
H' hI' ht 
PCMCIA 
LAN solution. 
Comes 
Ig 
Ig 
S 
with all drivers, installation, 


card and socket services and 


card management 
software 
necessary 
for reliable operation 
. 


in a PCMCIA 
slot. 


_ 
Drivers for all major network 


operating 
systems. 


_ 
Industry.recognized 
Intel 


SoftSet installation 
software. 
Easy to operate and manage. 


_ 
Based on highly integrated 
Intel 


82595 Ethernet 
Controller 


_ 
Complies 
with PCMCIA 
2.0/ 


JEIDA 4.1 68-pin standard. 


_ 
5 mm·thick 
PCMCIA 
Type II 


card. 


_ 
Detachable 
line adapter module 


(LAM) for multiple media 
attachment. 


_ 
Ethernet 
IEEE 802.3 


compatibility 
(lOBASE· T/TPE, 
10BASE· 
2/BNC). 


intet 


The Intel PCMCIA Ethernet LAN Card brings high pelformance 
and ease of use to 
PCMCIA networking. 
It lets you plit neMorking 
capabilities 
into laptop computers 
without a lot of hassle, headache or expense. 


It's a plug and play solution, prol'iding PCMCIA neMork- 
readiness right out of 
the box. All the sofMare )'ouneed 
comes with the card: dri,'ers for the most popular 
neMork operating systems (Nol'ell NetWare* 2.2, 3.11. 4.0 and Lite 1.0; Microsoft* 
LAN Manager* 2.x; 18M* LAN Sen'er* 2.x; 8an)'an Vines* 4.x; and Microsoft 
Windows for Workgroup* 
3.1 J, PCMClA- 
compliam card and socket sen'ices from 
S)'stemSoft*, and Intel's own card manager and Softset awo-configuration, 
auto- 
installation sofMare that gets users on the neMorkfast. 


The Intel PCMCIA Ethernet LAN Card is based on the highly imegrated Imel 
825951SAIPCMCIA 
Ethernet Comroller, 
gil'ing )'ou 16-bit desktop pelformance 
in 
a PCMCIAformfactor. 
As we del'elop additional software functionality 
for our 
82595 line. you'll be able to leverage it across ),our entire 82595-based 
product 
line, from chips to NICs to PCMCIAformfactor 
products. 
You decrease ),our time 
to market b)' taking ad\'Ontage of Intel's product del'elopment 
efforts, while 
increasing the \'Olue of your products. 


The card is standard PCMCIA 2.0 68-pinformfactor 
and ani)' 5 mm thick. It' s 
passed Intel's extensi,'e quality and reliability testing to ensure that it stands up to 
the rigors of mobile users on the go. These include PCMClA mechanical 
qualification 
testing such as torque, bend, shock, vibration and enl'ironmental 
testing across extreme temperatllres 
and mltages. 
Our CMOS technology and 
highly imegrated 82595 are IWy power efficient, letting mobile users stay 
connected to the neMorkfor 
a long rime. Maximum power draw is 85 mA; in idle 


mode, power consumption 
drops to 20 mA. 


Finally, we'l'e made it easy to customi:e 
our card and its accessories 
to your own 
OEM marketing needs. Manuals, sofMare diskettes and the cards themsell'es can 
all be railored to reflect your company's 
look. 


• 


The Intel PCMCIA 
Ethernet LAN Card is the fastest, easiest way to deliver network- 
ready laptops to your customers. 
It snaps in and installs in minutes, not hours, thanks 
to a disk-full of software to make your job easier. Our industry-standard 
SoftSet 


installation 
utility automatically 
configures 
the card and sets up the software with a 
single command. 
Built-in card and socket services software provides.card 
recogni- 


tion and compatibility. 
Built-in card management 
software performs 
IRQ manage- 


ment and allows the card to be installed even when the system is running, so you 
don't have to reboot. There's 
even built-in driver support for popular network 


operating 
systems from Novell, Microsoft, 
IBM and Banyan. There are no jumpers 
or switches to set, no IRQ addresses to labor over. The card's installed and working 


in five minutes. Once installed, it's easy to operate too, improving customer 


satisfaction 
and decreasing 
the number of support calls you receive. 


The card is fully PCMCIA 2.0/JEIDA 
4.1 compliant 
with a standard 68-pin form 


factor. It's also fully compliant 
with Ethernet IEEE 802.3 standard for IOBASE-T 


and IOBASE-2 wiring. 


There's 
a detachable 
line adapter module (LAM) for attachment 
to multiple media 


and LEDs to indicate active and link integrity. 


Features 
Benefits 


- 
Installation, 
card manager, and card 
- 
A complete solution; no need for any other pieces 
and socket services software 
Easy to install, easy to configure, 
easy to use 


- 
Broad client driver support 
- 
Meets broad target market. Usable on industry- 
standard networks like Novell, 
LAN Manager and Banyan 


- 
Complies 
with PCMCIA 
2.0/JEIDA 
4.1 
- 
Nojumpers 
standards 
Easy to transport 
Small form factor 


- 
Intel-based 
82595 
- 
Great performance; 
8 & 16-bit data path 


Glueless 
interface to PCMCIA 
Bus 


- 
Activity and link integrity LEDs 
- 
Indicates card status, improves diagnostics 


- 
Supports both twisted pair Ethernet (I OBASE- T) 
- 
Flexibility 
to connect to multiple network media 
and ThinCoax 
(BNC IOBASE-2) 


Product Codes 


MBLA8110 


MBLA8l20US 


MBLA8120EU 


TPE all geographies 


BNC US 
BNC Europe 


Local Area Networking 
Family Product Brief 


EtherExpress 
Family Brochure 
TokenExpress 
Family Brochure 


Intel 82595 Data Sheet 
Intel 82595 User's Manual 


297085-002 


D413.01 


D4l4.02 


290458-003 
TBD 
• 


DataFax 14.4 Card Product Brief 


Product 
- 
LAM-less design (Integrated 
Highlights 
DAA) 


_ 
Complies with PCMCIA 
2.0 and 


JEIDA 4.1 standards 


_ 
PCMCIA 
Type II card- 


5.0 mm thick 


_ 
Fully compliant 
with CCnT 


T.30 and T.4 (Group 3 fax) 


_ 
Compliant 
with CCITT 
V.17 


(14.4 Kbps fax) 


_ 
Provides 
both Send and Receive 


fax capability 


_ 
8-bit I/O bus interface 


_ 
Includes 
Ring Detect notification 


to host computer 
in the power 
down mode 


_ 
FCC Class B, UL and DOCI 


UL (Canada) 


_ 
Compatible 
with EIAtrIA 
602 


(AT command 
set) 


_ 
Data modem complies with 


CCITT 
V.32bis, V.32, V.22bis, 
V.22, V.21, Bell· 212a, 103 


_ 
Supports 
CCITT 
V.42 error 


detection 
and V.42bis data 
compression 


_ 
Provides 
MNP/S· 
data compres· 


sion for backward 
compatibility 


intet 


With the design of the DataFax 14.4, Intel introduces a high speed fax card 
with no external line adapter module. Lightweight. 
Easy to carry. And less 
chance of damage or loss. Not only does the DataFax 14.4 feature inte- 
grated DAA, blll its high speed transmission helps your customers reduce 
the telephone expenses associated with faxes and modems. 


The DataFax 14.4 fax card is also Croup 3 compliant, assuring worldwide 
compatibility with most fax machines operating today. It sends and receil'es 
faxes and transfers files to orfrom notehook or hand-held computers over 
the puhlic telephone network. 


The hlfel DataFax 14.4 also conforms to PCMCIA 2.0 AND JElDA 4.1 
physical and electrical standards for pOl·tahie computers. 
The si:e of a 


credit card, the DataFax 14.4 slides into an external slot in the notebook 
complller, and connects easily into the public switched telephone network. 
The card feall/res four power management modes - 
on-line, actil'e, power 


sal'e, and power down-to 
ensure the lowest possihle power consumption. 


No external power is required. 


The DataFax 14.4 combines high speed with an integrated DAA design to provide 
optimal connectivity 
for notebooks and sub-notebooks. 
PCMCIA cards enhance the 


multiple functionality 
of the notebook computer. 
Intel's integrated DAA design 
incorporates 
the telephone interface circuitry directly onto the card. This includes a 


ring detector and telephone line coupling transformer. 
A six-foot cable connects the 


DataFax 14.4 card to a standard RJ-II 
modular telephone jack for connectivity 
(0 


the telephone network. 


The DataFax 
14.4 supports V.42 error correction 
which ensures that errors caused 


by the phone system are automatically 
corrected. 
The DataFax 
14.4 increases data 


throughput 
with V.42bis data compression. 
This detects redundant characters, 


character 
sequences, 
and uses fewer bits to send more frequently 
occurring 


sequences. 
DataFax 
14.4 users enjoy an effective 
throughput 
of up to 57,600 bits 


per second. 


Unlike other card manufacturers, 
Intel provides a full-service 
program for card 


labeling, custom kining with third party applications, 
and fulfillment. 
The Intel 
DataFax 
14.4 is the only fax card designed 
with recessed covers in order to accept 


adhesive labels, front and back, insuring quick turnaround 
for custom orders. 


Features 
Benefits 


- 
Integrated 
DAA 
- 
No external circuitry 


- 
Compact 
and lightweight 
- 
Easy to carry 
- 
Fits in briefcase with notebook computer 


- 
Exchangeable 
with other cards 
- 
Single slot serves multiple functions 


- 
CCIIT 
V.32bis (14.4 Kbps) 
- 
Ensures connectivity 
worldwide 


- 
Faster transmission 
means less costly phone bills 


- 
Supports V.42/V.42bis 
- 
Provides high throughput 
(57.6 Kbps) 


- 
MNP/IO 
- 
Enhanced 
data throughput 
with cellular connections 
(required phone adapter) 


- 
Hayes AutoSync 
- 
Provides synchronous 
communications 
capability 
(software required) 


Faxmodem 24/96 Card Product Brief 


Product - 
Integrated 
DAA on the card for 
Highlights 
U.SJCanada 


_ 
Complies 
with PCMCIA 
2.0 and 


JEIDA 4.1 standards 


_ 
PCMCIA 
Type II card 


• 
5mm thick 


_ 
Fully compliant 
with CCITT 


T.30 and T.4 (group 3 fax) 


_ 
Provides 
both Send and Receive 


fax capability 


_ 
Fully compliant 
with EIMIA 


578 (fax class 1 command 
set) 


_ 
Data modem complies with 


CCITT 
V.22bis, V.22, V.21, 


Bell* 212a and 103 


_ 
Supports 
CCITT 
V.42 error 


detection 
and V.42bis data 
compression 


_ 
Provides 
MNP5 data 


compression 
for backward 
compatibility 


_ 
Includes 
Ring Detect notification 


to host computer 
in the power 
down mode 


_ 
Multiple 
power conservation 
modes 


intet 


The Faxmodem 24/96 enables you to send or receive faxes and transferfiles 
to orfrom notebook computers over the public telephone network. It is the 
latest member of Intel's lIO card "plug and play" family. 


The Intel Faxmodem24/96 
also conforms to PCMCIA 2.0 AND JElDA 4.1 


physical and electrical standards for portable computers. The card's light 
weight and loll' power consumption combine to provide high-performance, 
low-cost connectivity for notebooks and sub-notebooks. 


Approximately 
the si:e of a credit card, the Faxmodem 24/96 slides into 
an external slot in the notebook computer, connecting easily illto the public 
switched telephone network. The card features four power management 
modes-on-line, 
active, power SOl'e,and power down -to 
ensure the 


lowest possible power consumption. No external power is required. 


The Faxmodem 24/96 card provides convenient, high-performance 
mobile fax and 


data communications 
capability for the notebook computer user. Adherence to 
accepted national and international 
standards provides the user with worldwide 
connectivity 
for both fax (approximately 
30 million machines in use worldwide) 
and data transfer (V.22bis is the most widely used communications 
standard in 


the world). Its exchangeability 
with other PCMClA cards enhances the multiple 


functionality 
of the notebook computer. 


The Faxmodem 
24/96 card contains the Intel 89Cl24FX 
integrated data-fax modem 
chipset, a UART, a microcontroller, 
an analog front-end and other supporting 
devices. The I6450-type 
UART provides an 8-bit data bus interface. This, together 
with the Class I AT command set support, provides compatibility 
with all the major 


fax application 
software. 


For U.S. and Canada, the DAA is integrated on the card. For rest of world, the 
detachable 
line adapter module incorporates 
country-specific 
telephone 
interface 
circuitry. This consists of the ring detector and telephone line coupling transformer. 
A standard RJ-II 
modular telephone jack provides connectivity 
to the telephone 
network. A short cable connects the line adapter module to the Faxmodem 
24/96 
card. 


The Faxmodem 
24/96 supports V.42 error correction 
which ensures that errors 
caused by the phone system are automatically 
corrected. The Faxmodem 
24/96 also 
supports V.42bis data compression. 
This increases data throughput 
by detecting 
redundant characters, 
character sequences, 
and using fewer bits to send more 
frequently occurring sequences. 
Throughput 
for file transfer operations 
is increased 
by as much as 400 percent, providing the user with an effective 9600 bits per second. 


Features 
Benefits 


- 
Integrated 
DAA 
- 
No external circuitry 
- 
Compact and lightweight 
- 
Easy to carry 
- 
Fits in briefcase with notebook computer 


- 
Exchangeable 
with other cards 
- 
Single slot serves mulliple functions 


- 
Group 3 fax 
- 
Compatible 
with worldwide 
installed base of 


fax machines 


- 
Class I fax command 
set 
- 
Compatible 
with standard communications 
packages 


- 
CCITT V.22bis, V.22, V.21, Bell 212 and 103 
- 
Ensures connectivity 
worldwide 
, 


- 
Supports V.42/Y.42bis 
- 
Faster; up to 4 to I data compression 
providing 
an 
effective throughput 
of 9600 bits per second for file 
transfers 


- 
M~em 
supports AT command set 
- 
Compatible 
with standard communications 
packages 


- 
Multiple power conservation 
modes 
- 
Prolong system battery life 


- 
Factory Configuration 
Option for Cellular Network 
- 
Ease of use 


Order Number: 297390.{)()1 
©Intel Co,!",ra,;on. 
1993 
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85C220/85C224-100, 
-80 AND -66 
FAST REGISTERED SPEED TSU, TSO 
8-MACROCELL 
PLDs 


These register optimized timing PLDs offer 
superior design features: 


• 
Low-Power, High-Performance 
Upgrade 
for SSI/MSI Logic and Bipolar PALs· 
High-Performance 
Systems 


• 
Replacement or Upgrade for 16V81 
20V8 PAL and GAL Architecture 


• 
8 Macrocells with Independently 
Programmable 
110 Architecture 


85C220-100 
AND 85C224-100 
• 
100 MHz Max Frequency (External 
Feedback); 5.5 ns (Max) Clock to 
Output; 4.5 ns (Min) Set-Up Time 


• 
Meets Critical Timing Requirements 
of 
Advanced Intel Microprocessor 
Systems 


• 
7.5 ns (Max) Propagation Delay 


• 
Typical lee = 90 mA 
• 
Available in 20-Pin and 28-Pln PLCC 
Packages 


• 
Low Power and Output Skew for Clock 
Device Applications 


• 
Up to 18 Inputs (10 Dedicated and 
8 110) and 8 Outputs 


• 
Programmable 
"Security 
Bit" Allows 
Total Protection 
of Proprietary 
Designs 


• 
100% Generally Tested Logic Array 


85C220-80 
AND 85C224-80 
• 
Quarter Power (lee = 40 mAl; 
Programmable Zero Power Mode 
(50 p-A Typical) 


• 
80 MHz Max Frequency (External 
Feedback); 5.5 ns (Max) Clock to 
Output; 7 ns (Mln) Set-Up Time 


• 
10 ns Propagation 
Delay 


• 
High-Speed Upgrade to EP320, EP330, 
and 5C032 


• 
Available In 300-mil 20-Pin and 24-Pin 
CerDIP/PDIP Packages, and 20-Pin and 
28-Pin PLCC Packages 


Inte1386™, 
i486™ 
and i860TM are trademarks 
of Intel Corporation. 


'PAL 
is a registered 
trademark 
of Advanced 
Micro 
Devices. 
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iPLD22V10L·5 
LOW POWER, ELECTRICALLY 
ERASABLE 
1~MACROCELLCMOSPLD 
• Low Power, High Speed Upgrade to 
• 10 Macrocells with Programmable 
I/O 
BICMOS/Blpolar 
22V10 and CMOS 
Architecture 
(Registered/ 
Equivalents 
Combinatorial) 
• Supply Voltage Range 4.75V to 5.25V 
• Sub-micron CHMOS III Electrical 
• tpD 5 ns, 142 MHz with Feedback 
Erasable Technology 
• Maximum IS8 = 5 mA 
• Programmable "Security 
Bit" Allows 
Total Protection 
of Proprietary 
Designs 
• Typical Ice = 15 mA @100MHz 
100% Generically Tested Logic Array 
• 
• 12 Dedicated Inputs and 10 I/O Pins 
Available In 28-Pln PLCC Packages 
• 


PROGRA 
••••••ABlE: 
A.ND ARRAY 
(44 
lC 132) 


Refer to the 1994 Programmable Logic Handbook for the complete data sheet on this device. 
November 1993 
Order Number: 290503-001 
4-23 


• 


LOW VOLTAGE, ELECTRICALLY 
ERASABLE 
1~MACROCELLCMOSPLD 
• Low Voltage, 
Low Current, 
High Speed 
• 10 Macrocells 
with Programmable 
I/O 
Upgrade 
to BICMOS/Bipolar 
22V10 and 
Architecture 
(Registered/ 
CMOS Equivalents 
Combinatorial) 
• Supply Voltage 
Range 3.0V to 3.6V. 
• Global Asynchronous 
Clear and 
Ideal for Battery 
Powered 
Systems 
Synchronous 
Preset 
P-terms 
• tpD 5 ns, 142 MHz with Feedback 
• Sub-micron 
CHMOS 
III Electrical 
• Maximum 
ISB = 100 IJ-A 
Erasable 
Technology 
• Typicallcc 
= 10 mA 
@ 100 MHz 
• Programmable 
"Security 
Bit" Allows 
Total Protection 
of Proprietary 
Designs 
• 12 Dedicated 
Inputs, 
10 I/O 
Pins 
Available 
In 28-Pln PLCC Packages 
• 
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82489DX 
ADVANCED PROGRAMMABLE 
INTERRUPT CONTROLLER 


82489DX 
FEATURES 
OVERVIEW 


• 
Advanced 
Interrupt 
Controller 
for 
32-Bit Operating 
Systems 


• 
Solution 
for MUltiprocessor 
Interrupt 
Management 


• 
Dynamic 
Interrupt 
Distribution 
for Load 
Balancing 
In MP Systems 


• 
Separate 
Nibble Bus (Interrupt 
Controller 
Communications 
(ICC) Bus) 
for Interrupt 
Messages 


• 
Inter-Processor 
Interrupts 


• 
Various 
Addressing 
Schemes- 


Broadcast, 
Fixed, Lowest 
Priority, 
etc. 


• 
Compatibility 
Mode with 8259A 


• 
32-Bit Internal 
Registers 


• 
Integrated 
Timer Support 


• 
33 MHz Operation 


• 
132·Lead 
PQFP Package, 
Package 
Type KU 
(See Packaging Specification. Order Number: 240600) 


RESET 


CLKIN 


AOS 


BGT 


OLE 
M/IO 
o/C 
W/R 
cs 
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ExtlNTA 


0[31:0] 
PNMI +- 


PINT 


'PRST +-, 
I 


... 
... 


Local 
Unit 
..• 


System 
I 
Decode 
« 
& 
>- 
I 
« 
Control 
c 


Unit 
wu« 
... 
... 


"- 
I/O 
Unit 
~ 
..• 
•. 
~ 
" 


>-a; 
I 
N 
'" 


JTAG 


Timer 
Int 
...---, 


Compatible 
... 
•.. 1 
L 
Boundary 
32 -Bit 
Timer I 
Scan 
TAP 
..• 
"I 
Controller 


LintO 


Lint 
1 


IntinO 


Intin 1 


Intin2 


TCK 


TRST 


TOI 


TMS 
TOO 


Refer to Application 
Note 
AP-388: 
82489DX 
User's 
Manual 
(Order 
Number 
292116) 
when 
evaluating 
your 


design 
needs. 
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The 82489DX Advanced Programmable Interrupt 
Controller provides multiprocessor interrupt man- 
agement, providing both static and dynamic sym- 
metrical Interrupt distribution across all processors. 


The main function of the 82489DX is to provide in- 
terrupt management across all processors. This dy- 
namic interrupt distribution includes routing of the in- 
terrupt 
to 
the 
lowest-priority 
processor. 
The 


82489DX works in systems with multiple I/O subsys- 
tems, where each subsystem can have its own set 
of interrupts. This chip also provides inter-processor 
interrupts, allowing any processor to interrupt any 
processor or set of processors. Each 82489DX I/O 
unit Interrupt Input pin is individually programmable 
by software as either edge or level triggered. The 
interrupt vector and interrupt steering information 


.. 
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can be specified per pin. A 32-bit wide timer is pro- 
vided that can be programmed to interrupt the local 
processor. The timer can be used as a counter to 
provide a time base to software running on the proc- 
essor, or to generate time slice interrupts locally to 
that processor. The 82489DX provides 32·bit soft- 
ware access to its internal registers. Since no 
82489DX register reads have any side effects, the 
82489DX registers can be aliased to a user read- 
only page for fast user access (e.g., performance 
monitoring timers). 


The 82489DX supports a generalized naming/ad- 
dressing scheme that can be tailored by software to 
fit a variety of system architectures and usage mod- 
els. It also supports 8259A compatibility by becom- 
ing virtually transparent with regard to an externally 
connected 8259A style controller, making the 8259A 
visible to software. 


"-Bit 
Open 


Drain 
Ice 


Bus 


82489DX contains one Local Unit, one I/O unit and 
a timer. The ICC bus is used to pass interrupt mes- 
sages. 


The ICC bus is a 5-wire synchronous bus connecting 
all 82489DXs (all I/O Untis and all Local Units). The 
Local Units and I/O Units communicate over this 
ICC bus. Four of these five wires are used for data 
transmissions and arbitration, and one wire is a 
clock. 


The Local Unit contains the necessary intelligence 
to determine whether or not its processor should ac- 
cept interrupt messages sent on the ICC bus by oth- 
er Local Units and I/O Units. The Local Unit also 
provides local pending of interrupts, nesting and 
masking of interrupts, and handles all interactions 
with its local processor such as the INT/INT AlEOI 
protocol. The Local Unit further provides inter-proc- 
essor interrupt functionality and a timer to its local 
processor. The interface of a processor to 
its 
82489DX Local Unit is identical for every processor. 


The I/O Unit provides the interrupt input pins on 
which I/O devices inject interrupts into the system in 


the form of an edge or a level. The I/O unit also 
contains a Redirection Table for the interrupt input 
pins. Each entry in the Redirection Table can be in- 
dividually programmed to indicate whether an inter- 
rupt on the pin is recognized as either an edge or a 
level; what vector and also what priority the interrupt 
has; and which of all possible processors should 
service the interrupt and how to select that proces- 
sor (statically or dynamically). The information in the 
table is used to send interrupt messages to all 
82489DX Units via the ICC bus. 


The 82489DX provides a 32-bit wide timer that can 
be programmed to interrupt the local processor. The 
timer can be used as a counter to provide a time- 
base to software running on the processor, or to 
generate time-slice interrupts local to that proces- 
sor. 


The 82489DX pin description is organized in a small 
number of functional groups. Pin definitions and pro- 
tocols have been designed to minimize interface is- 
sues. In particular, they support the notion of inde- 
pendently controlled address and data phases. The 
primary host interface is synchronous in nature. 


In the following pin definition table if the signal name 
has (_) over it, the signal is in its active state when it 
has a low level. The signal direction column identi- 
fies output only signals as a continuous drive (0), 
tristate (TIS), or open drain (O/D). All bi-directional 
(81-D)signals have tri-stating outputs. 
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Symbol 
Pin No. 
Type 
Function 


SYSTEM 
PINS 


RESET 
65 
I 
The RESET INPUT forces 82489DX 
to enter its initial state. The 82489DX 
Local Unit in turn asserts it PRST (Processor 
Reset) output. All tri-state 
outputs 
remain in high impedance 
until explicitly 
enabled. 


ExtlNTA 
41 
0 
The EXTERNAL 
INTERRUPT 
ACKNOWLEDGE 
output is asserted 
(high) 
when an external 
interrupt 
controller 
(e.g., 8259) is expected 
to respond 
to 
the current 
INTA cycle. If deasserted 
(low), 82489DX 
will respond, 
and the 
INTA cycle must not be delivered 
to the external 
controller. 


CLKIN 
57 
I 
CLOCK INPUT provides 
reference 
timing for most of the bus signals. 


TRST 
56 
I 
TEST RESET is the JT AG compatible 
boundary 
scan TAP controller 
reset 
pin. A weak pull-up keeps the pin high if not driven. 


TCK 
55 
I 
TEST CLOCK is the clock input for the JTAG compatible 
boundary 
scan 
controller 
and latches. 


TDI 
53 
I 
TEST DATA INPUT is the test data input pin for the JT AG compatible 
boundary 
scan chain and TAP controller. 
A weak pull-up keeps this pin high 
if not driven. 


TOO 
52 
0 
TEST DATA OUTPUT 
is the test data output for the JT AG compatible 
boundary 
scan chain. 


TMS 
54 
I 
TEST MODE SELECT 
is the test mode select pin for the JT AG boundary 
scan TAP controller. 
A weak pull-up keeps this pin high if not driven. 


TIMER 
PIN 


TMBASE 
59 
I 
The TIME BASE input provides 
a standard 
frequency 
that is only used by 
the 82489DX 
timer and that is independent 
of the system clock. 


INTERRUPT 
PINS 


INTIN[15:0] 
82-97 
I 
These 16 INTERRUPT 
INPUT pins accept edge or level sensitive 
interrupt 
requests 
from I/O or other devices. The pin numbers 
are specified 
respectively. 
INTIN15 corresponds 
to pin number 82, INTIN14 corresponds 
to pin number 83 etc., and INTI NO corresponds 
to pin number 97. These 
pins are active high. 


L1NTIN[1] 
80 
I 
Two LOCAL INTERRUPT 
INPUT pins accept edge or level sensitive 
L1NTIN[O] 
81 
I 
interrupt 
requests that can only be delivered 
to the connected 
processor. 
These pins are active high. 


REGISTER 
ACCESS 
PINS 


ADS 
64 
I 
ADDRESS 
STROBE 
signal indicating 
the start of a bus cycle. 82489DX 
does not commit to start the cycle internally 
until BUS GRANT 
is detected 
active. 


Symbol 
Pin No. 
Type 
Function 


REGISTER 
ACCESS 
PINS 
(Continued) 


M/iO, 
63 
I 
Bus cycle definition 
signals. Note that since the 824890X 
registers 
can be 
O/G, 
61 
I 
mapped in either memory or liD space, the MilO 
pin is not used for register 
W/R 
62 
1 
access cycles; it is only used to decode 
interrupt 
acknowledge 
cycles. 


824890X 
does not respond to code read cycles. 


BGT 
66 
I 
The BUS GRANT input is optional 
and is used to indicate the address 
phase of 


a bus cycle in configurations 
where address timing cannot 
be inferred from 
AOS. This signal is really used as an address 
latch enable, 
but is named as it is 


to indicate that it can normally 
be connected 
to the Intel Cache Controller 
generated 
signal of the same name. Must be tied low if not used. 


CS 
74 
1 
The CHIP SELECT input indicates 
that the 824890X 
registers 
are being 
addressed. 


A3 
31 
BI-O 
The address 
pins are used as inputs in addressing 
internal register space. 


A4 
29 
BI-O 
Output function 
is reserved. 
They are also used to latch local unit 10 on reset. 


A5 
28 
BI-O 
A6 
27 
BI-O 
A7 
26 
BI-O 
A8 
24 
BI-O 
A9 
22 
BI-O 
A10 
21 
BI-O 


OLE 
73 
1 
DATA LATCHIENABLE 
is optional 
and is used to indicate 
committing 
the data 


phase of a bus cycle in configurations 
where data timing cannot 
be inferred 
from other cycle timings. 
Must be tied low if not used. 


031 
105 
BI-O 
The OAT A BUS is for all register accesses 
and interrupt vectoring. 


030 
107 
BI-O 
029 
109 
BI-O 
028 
110 
BI-O 
027 
111 
BI-O 
026 
112 
BI-O 
025 
114 
BI-O 
024 
115 
BI-O 
023 
116 
BI-O 
022 
118 
BI-O 
021 
119 
BI-O 
020 
121 
BI-O 
019 
122 
BI-O 
018 
123 
BI-O 
017 
124 
BI-O 
016 
125 
BI-O 
015 
128 
BI-O 
014 
129 
BI-O 
013 
130 
BI-O 
012 
131 
BI-O 
011 
2 
BI-O 
. 


Symbol 
Pin No. 
Type 
Function 


REGISTER 
ACCESS 
PINS 
(Continued) 


010 
3 
81-0 
09 
4 
81-0 
08 
7 
81-0 
07 
8 
81-0 
06 
9 
BI-O 
05 
11 
BI-O 
04 
12 
BI-O 
03 
13 
BI-O 
02 
14 
BI-O 
01 
16 
BI-O 
DO 
18 
81-0 


OP3 
101 
BI-O 
One Data Parity pin for each byte on the data bus. EVEN parity is generated 
OP2 
102 
81-0 
any time the data bus is driven by the 824890X. 


OP1 
103 
BI-O 
OPO 
104 
BI-O 


ROY 
43 
0 
READY output indicates 
that the current bus cycle is complete. 
In the case of 
a read cycle, valid data and the return to inactive state after going active low 
may be delayed till OLE goes active. 


PROCESSOR 
PINS 


PINT 
35 
T/S 
The PROCESSOR 
INTERRUPT 
OUTPUT 
indicates 
to the processor 
that one 
or more maskable 
interrupts 
are pending. This pin is tn-stated 
at reset, and has 
an internal 
pull-down 
resistor to prevent false signaling 
to the processor 
until 
the 824890X 
local 
Unit is enabled 
and this pin is actively driven. 


PRST 
38 
0 
The PROCESSOR 
RESET OUTPUT 
is asserted/de-asserted 
upon 824890X 
reset, and also in response 
to ICC bus messages 
with "RESET" 
delivery 
mode. This pin should be used with care. 


PNMI 
37 
T/S 
The NON-MASKABLE 
INTERRUPT 
output is signaled 
in respone 
to ICC bus 
messages 
with "NMI" 
delivery 
mode. This pin is tn-stated 
at reset, and has an 
internal 
pull-down 
resistor to prevent false signaling 
to the processor 
until the 
local 
Unit is enabled 
and this pin is actively driven. 


ICC BUS PINS 


IClK 
60 
I 
The ICC BUS CLOCK input provides 
synchronous 
operation 
of the ICC bus. 


MBI[3:0] 
76-79 
I 
The four ICC BUS IN inputs are used for incoming 
ICC bus messages. 
In 
smaller configurations 
the ICC bus input and outputs may be tied directly 
together 
at the pins. Pin number for MBI3 is 76, M812 is 77, M811 is 78 and 
M810 is 79. 


M803 
45 
0/0 
The four ICC BUS OUT outputs are used for outgoing 
ICC bus messages. 
The 
MB02 
48 
current capacity 
is only 4 mA. So external 
bufferes 
will be needed. 


M801 
49 
MBOO 
51 


• 
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Symbol 
Pin No. 
Type 
Function 


RESERVED 
PINS 


Reserved 
34,42 
NC 
These pins MUST BE LEFT OPEN. 


Reserved 
70, 72, 75 
Reserved 
by Intel. These pins should be strapped 
to Vcc. 


Reserved 
71,19,20 
Reserved 
by Intel. These 
pins should be strapped 
to GND. 


POWER AND GROUND 
PINS 


Vcc 
1,32,69,98 
POWER 
Nominally + 5V. These pins along with Vss and VSSIshould be 
separately bypassed. 


Vccp 
6,15,25,100, 
POWER 
Nominally + 5V. These pins along with VSSPshould be separately 
108,117,126 
bypassed. 


Vccpo 
39,46 
POWER 
Nominally + 5V. These pins along with VSspo should be separately 
bypassed. 


Vss 
5,33,67, 
GND 
Nominally OV.These pins along with Vcc should be separately 


68,99 
bypassed. 


VSSP 
10,17,23,30, 
GND 
Nominally OV.These pins along with Vccp should be separately 


106,113,120, 
bypassed. 
127,132, 


VSspo 
36,40,44, 
GND 
Nominally OV.These pins along with Vccpo should be separately 


47,50 
bypassed. 


VSSI 
58 
GND 
Nominally OV.These pins along with Vcc should be separately 
bypassed. 


NOTE: 
Vcc, Vccp and Vccpo should 
be of same voltage. 
Vss, Vssp, Vsspo and VSSIshould 
be av. 


As far as interrupt management is concerned, the 
82489DX's interrupt control function spans over two 
functional units, the I/O Unit of which there is one 
per I/O subsystem, and the Local Unit of which 
there is one per processor. 82489DX has one I/O 
unit and one Local Unit in a single package. This 
section takes a detailed look at both local and I/O 
Units. 


The I/O Unit consists of a set of Interrupt Input pins, 
an Interrupt Redirection Table, and a message unit 
for sending and receiving messages from the ICC 
bus. The I/O Unit is where I/O devices inject their 
interrupts, the I/O Unit selects the corresponding 
entry in the Redirection Table and uses the informa- 
tion in that entry to format an interrupt request 
message. The message unit then broadcasts this 
message over the ICC bus. The content of the Redi- 
rection Table is under software control and is as- 
signed benign defaults upon reset. The masks in the 
Redirectional Table entries are set to 1 at hardware 
reset to disable the interrupts. 
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Redirection 
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I/o 
UNIT ID REG I 
11/0 
UNIT VERSION REG I 


Interrupt Management of the Local Unit is responsi- 
ble for local interrupt sources, interrupt acceptance, 
dispensing interrupts to the processor, and sending 
inter-processor interrupts. Depending on the delivery 


mode of the interrupt, zero, one or more units can 
accept an interrupt. A Local Unit accepts an inter- 
rupt only if it will deliver the interrupt to its processor. 
Accepting an interrupt is purely an inter-82489DX 
matter; dispensing an interrupt to the local proces- 
sor only involves a 82489DX and its local processor.• 


INTERRUPT 
ACK. CYCLE 


ExtiNTA 


IRR 
256-BIT 
VECTOR 
- - - - - - -- 
ARRAY 


5.0 
INTERRUPT 
CONTROL 
MECHANISM 


highest. Priority of interrupt A "is higher than" the 
priority of interrupt B if servicing A is more urgent 
than servicing B. An interrupt's priority is.implied by 
its vector; namely priority = vector/16. 
This section describes briefly the interrupt control 
mechanism in the 82489DX. 


With 256 vectors and 16 different priorities, this im- 
plies that 16 different interrupt vectors can share a 
single interrupt priority. 


The interrupt control function of all 82489DXs are 
collectively responsible for delivering interrupts from 
interrupt sources to interrupt destinations in the mul- 
tiprocossor system. When a processor accepts an 
interrupt, it uses the vector to locate the entry point 
of the handler in its interrupt table. The 82489DX 
architecture allows for 16 possible interrupt priori- 
ties; zero being the lowest priority and 15 being the 


Out of 256 vectors, interrupt vectors 0 to 15 should 
not be used in the 82489DX. Only 240 interrupt vec- 
tors (vectors from 16 to 255) are supported in the 
82489DX. 
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Interrupts are generated by a number of different in- 
terrupt sources in the system. 


Possible interrupt sources are: 
• Externally connected 
(110) 
devices. Interrupts 
from these external sources manifest themselves 
as edges or levels on interrupt input pins and can 
be redirected to any processor. 
• Locally connected devices. These originate as 
edges or levels on interrupt pins, but they are al 
ways directed to the local processor only. 


• 82489DX timer generated interrupts. Like locally 
connected devices, 82489DX timer can only in- 
terrupt its local processor. 
• Processors. A processor can interrupt any indi- 
vidual processor or sets of processors. This sup- 
ports software self-interrupts, preemptive sched- 
uling, TLB flushing, and interrupt forwarding. A 
processor generates interrupts by writing to the 
interrupt command register in its Local Unit. 


I/O Units can only source interrupts whereas Local 
Units can both source and accept interrupts, so 
whenever "interrupt destination" is discussed, it is 
implied that the Local Unit is the destination of the 
interrupt. In physical mode the destination processor 
is specified by a unique 8-bit 82489DX local 10.Only 
a single destination or a broadcast to all (LOCAL 10 
of all ones) can be specified in physical destination 
mode. 


In logical mode destinations are specified using a 
32-bit destination field. All Local Units contain a 
32-bit Logical Destination register against which the 
destination field of the interrupt is matched to deter- 
mine if the receiver is being targeted by the interrupt. 
An additional 32-bit Destination Format register in 
each Local Unit enables the logical mode address- 
ing. 


The description of interrupt delivery makes frequent 
use of the following terms: 


• Each processor has a processor priority that re- 
flects the relative importance of the code the 
processor is currently executing. This code can 
be part of a process or thread, or can be an inter- 
rupt handler. A processor's priority fluctuates as a 
processor switches threads, a thread or handler 
raises and lowers its priority level to mask out 
interrupt, and the processor enters an interrupt 
handler and returns from an interrupt handler to 
previously interrupted activity. 


• A processor is lowest priority within a given group 
of processors if its processor priority is the lowest 
of all processors in the group. Note that more 
than one processor can be the lowest priority in a 
given group. 


• A processor is the focus of an interrupt if it is 
currently servicing that interrupt, or if it currently 
has a request pending for the interrupt. 


Interrupt delivery begins with an interrupt source in- 
jecting its interrupt into the interrupt system at one of 
the 82489DX. Delivery is complete only when the 
servicing processor tells its 82489DX Local Unit it is 
complete by issuing an end-of-interrupt (EOI) com- 
mand to its 82489DX Local Unit. Only then has all 
(relevant) internal state regarding that occurrence of 
the interrupt been erased. The interrupt system 
guarantees exactly-once delivery semantics of inter- 
rupts to the specified destinations. Exactly-once 
guaranteed delivery implies a number of things: 


• The interrupt system never rejects interrupts; it 
never NAKs interrupt injection, interrupts are nev- 
er lost, and the same interrupt (occurrence) is 
never delivered more than once. 


Clearly a single edge interrupt or level interrupt 
counts as a single occurrence of an interrupt. In uni- 
processor systems, an occurrence of an interrupt 
that is already pending (IRR) cannot be distin- 
guished from the previous occurrence. All occur- 
rences are recorded in the same IRR bit. They are 
therefore treated as "the same" interrupt occur- 
rence. 


For lowest-priority delivery mode, by delivering an 
interrupt first to its focus processor (if it currently has 
one), the identical behavior can be achieved in a MP 
(Multiprocessor) system. If an interrupt has a focus 
processor then the interrupt will be delivered to the 
interrupt's focus processor independent of priority 
information. This means that even if there is a lower 
priority processor compared to the focus processor, 
the interrupt still gets delivered to the focus proces- 
sor. 


Each edge occurring on an edge triggered interrupt 
input pin is clearly a one-shot event; each occur- 
rence of an edge is delivered. An active level on a 
level triggered interrupt input pin represents more of 
a "continuous event". Repeatedly broadcasting an 
interrupt message while the level is active would 
cause flooding of the ICC bus, and in effect trans- 
mits very little useful information since the same 
processor (the focus) would have to be the target. 


Instead, for level triggered interrupts the 82489DX 
merely recreate the state of the interrupt input pin at 
the destination . The source 82489DX accomplishes 
this by tracking the state of the appropriate destina- 
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tion 82489DX's Interrupt Request Register (or pend- 
ing bit) and only sending inter-82489DX messages 
when the state of the interrupt input pin and the des- 
tination's interrupt request enter a disagreement. 
Unlike edge triggered interrupts, when a level inter- 
rupt goes into service, the interrupt request at the 
servicing 82489DX is not automatically removed. If 
the handler of a level sensitive interrupt executes an 
EOI then that interrupt will immediately be raised to 
the processor again, unless the processor has ex- 
plicitly raised its task priority, or the source of that 
interrupt has been removed. 


This section specifically talks about how a processor 
is picked during interrupt delivery. The 82489DX 
supports two modes for selecting the destination 
processor: Fixed and Lowest Priority. 


• 
Fixed Delivery 
Mode 
In fixed delivery mode, the interrupt is uncondi- 
tionally delivered to all local 82489DXs that 
match in the destination information suppliedwith 
the interrupt. Note that for I/O device interrupts 
typically only a single 82489DX would be listed in 
the destination. Priority and focus information are 
ignored. If the priority of a destination processor 
equal to or higher than the priority of the interrupt, 
then the interrupt is held pending locally in the 
destination processor's Local Unit, until the proc- 
essor priority becomes low enough at which time 
the interrupt is dispensed to the processor. More 
than one processor can be the destination in 
fixed-delivery mode. 


• 
Lowest 
Priority DelIvery 
Mode 
Under the lowest priority delivery mode, the proc- 
essor to handle the interrupt is the one in the 
specified destination with the lowest processor 
priority value. If more than one processor is at the 
lowest priority, then a unique arbitration 10 is 
used to break ties. For lowest priority dynamic 
delivery, the interrupt will always be taken by its 
focus processor if it has one. The lowest priority 
delivery method assures minimum interruption of 
high priority tasks. Since each Local Unit only 
knows its own processor priority, determining the 
lowest priority processor is done by arbitration on 
the ICC bus. Only one processor can be the des- 
tination in lowest-priority delivery mode. 


All I/O and Local Units communicate during interrupt 
delivery. Interrupt information is exchanged between 
different units on a dedicated five wire ICCbus in the 
form of broadcast messages. A 82489DX Unit's 8-bit 
10 is used as its name for the purpose of using the 
ICC bus, and all 82489DX units using one ICC bus 
should be assigned a different 10. The Arbitration 


10 of the Local Units used to resolve ties during low- 
est priority arbitration is also derived from the Local 
Unit's 10. 


6.0 
82489DX 
LOCAL UNIT 
REGISTERS 
DESCRIPTION 


6.1 
Local Unit 10 Register 


Each 82489DX Local Unit has a register that con- 
tains the Local Unit's 8-bit 10. The Local Unit 10 
serves as a physical name of the 82489DX Local 
Unit. It can be used in specifying destination infor- 
mation and is also used for accessing the ICC bus. 
Eight address lines A[10]-A[3] 
are sampled on ev- 
ery clock edge while RESET is asserted. The last 
sample remains in the Local Unit 10 register after 
reset. Alternatively, the 10 can be loaded with a reg- 
ister write as part of software initialization. The Local 
Unit 10 is read-write by software. 


824890X 
Local Unit 10 Register 


Bits [31..24] 
Local Unit 
10: The 
Local 
Unit 
10 
serves as the physical "name" of the 
Local Unit used for addressing the 
82489DX in physical destination mode 
and for the ICC bus usage. In a sys- 
tem with say four 82489DX, there are 
4 Local Units and 4 I/O Units. All the 8 
units should be assigned different IDs. 
For future compatibility use only IDs 
from 0 to 14. 
Bits [23..0] 
Bits 
[23..0] 
are 
Reserved. 
They 
should be written with O. 


Interrupt Destination can be either addressed physi- 
cally or logically. When the interrupt message ad- 
dresses the destination physically, each 82489DX in 
the ICC bus compares the address with its own unit 
10. If the message is a broadcast type then every 
Local Unit accepts the interrupt. 


When the message addresses the destination using 
logical addressing scheme each Local Unit in the 
ICC bus compares the logical address in the inter- 
rupt message with its own Logical Destination Reg- 
ister. If there is a bit match (Le., if at least one of the 
corresponding pair of bits match) this local unit is 
selected for delivery. 


• 


82489DX connected in the ICC bus should be writ- 
ten with "1" to enable the addressing scheme. 


Destination Format Register 
I 
Bns [31 :01 
I 


Logical Destination Register 
I 
Bits [31:0] 
I 


For future compatibility, use only bits 31-24 of Logi- 
cal Destination Register. For binary compatibility, it 
is strongly recommended that 82489DX software 
use only 8 MSB of Logical Destination Register. 


6.3 Local Interrupt Vector Table 
Registers 


The Redirection Table serves to steer interrupts that 
originate in the I/O subsystems to the processors. 
The Local Vector Table is its equivalent for inter- 
rupts that are restricted to only the local processor. 
The Local Vector Table contains three 32-bit regis- 
ters. Register 0 corresponds to the timer, registers 1 
and 2 correspond to 
local interrupt input pins, 
L1NTINOand L1NTIN1. 


The format of both the Local 0 and Local 1 interrupt 
vector tables are identical. The following register de- 
scription talks about both Local Interrupts 0,1 vec- 
tors. 


Local Interrupts 0, 1 Interrupt Vectors 


Vector: [Bits 7-01 
This is the vector to use when gener- 
ating an interrupt for this entry. 


Delivery Mode: [Bits 10-81 
000: Fixed 
001: <reserved> 
010: <reserved> 
011: <reserved> 
100: NMI 
101: <reserved> 


111: ExtlNT 
000: (fixed) means deliver the signal 


on the INT pin of the local proc- 
essor. Trigger mode for "fixed" 
Delivery Mode can be edge or 
level. 


100: (NMI) means deliver the signal 


on the NMI pin of the local proc- 
essor. Vector information is ig- 
nored. A Delivery Mode equal to 
"NMI" 
requires a "level" 
trig- 


gered mode. 


111: (ExtINTA) means deliver the sig- 


nal to the INT pin of the local 
processor as an interrupt that 
originated in an externally con- 
nected (8259A compatible) in- 
terrupt controller. ExtlNTA pin is 
as- serted also. The INTA cycle 
that corresponds to this ExtIN- 
TA delivery, should be routed to 
the external controller that is ex- 
pected to supply the vector. A 
delivery 
mode 
of 
ExtlNT 
re- 


quires an edge trigger mode. 
(See the section on compatiblity 
for more details.) 


Bit 11 is Reserved. It should be writ- 
ten O. 


Delivery Status: [Bit 121 
This field is software-read only. Soft- 
ware writes to this field (as part of a 
32-bit word) have no effect on this 
bit. Delivery status is a 1-bit field that 
contains the current status of the de- 
livery of this interrupt. Two states are 
defined. 
0: (idle) means that there is currently 


no activity for this interrupt. 


1: (send pending) indicates that the 


interrupt has been injected, but its 
delivery is temporarily held up by 
the recently injected interrupts that 
are in the process of being deliv- 
ered. 


Local INTOVector Table 
I 
Bns [31:17] 
I 
Bn 16 I 
Bn 15 I 
Bn 14 I 
Bit 13 I 
Bit 12 I 
Bn 11 I 
Bits [10:8] 
I 
Bits [7:0] 
I 


Bit 13 is Reserved. 
It should be written 
O. 


Remote 
IRA: [Bit 14] 


This bit is used for level triggered 
local 
interrupts; 
its meaning 
is undefined 
for 
edge triggered 
interrupts. 
Remote 
IRR 
mirrors 
the 
interrupt's 
IRR bit of this 
local 
unit. 
Remote 
IRR 
is 
software 
read-only; 
software 
writes to this bit do 
not affect 
it. 


Trigger 
Mode: 
[Bit 15] 


The 
Trigger 
Mode 
field 
indicates 
the, 


type of signal on the local interrupt 
pin 
that triggers 
an interrupt. 


0: indicates 
edge sensitive, 


1: indicates 
level sensitive. 


Only 
the 
local 
interrupt 
pins 
can 
be 
programmed 
as 
edge 
or 
level 
trig- 
gered. 
Timer 
interrupts 
are 
always 
treated 
as edges. 


MASK: 
[Bit 16] 


0: enables 
injection 
of the interrupt, 


1: masks 
injection 
of the interrupt. 


Bits [31:17] 
Bits 
[31:17] 
are Reserved. 
Should 
be 
written 
O. 


6.4 
Inter-Processor 
Interrupt 
Registers 


A processor 
generates 
inter-processor 
interrupts 
by 
writing 
to 
the 
Interrupt 
Command 
Register 
in its 
82489DX 
Local 
Unit. 
Conceptually, 
this 
can 
be 
thought 
of as the processor 
providing 
the interrupt's 
Redirection 
Table 
Entry on the fly. Not surprisingly, 
the 
layout 
of the 
Interrupt 
Command 
Register 
re- 
sembles 
that 
of an entry 
in the Redirection 
Table. 
Note that the format 
of this register 
allows a proces- 
sor to generate 
any interrupt. 
A processor 
may use 
this to forward 
device 
interrupts 
originally 
accepted 
by it to other 
processors. 


All 
fields 
of the 
Interrupt 
Command 
Register 
are 
read-write 
by software 
with the exception 
of the De- 
livery Status 
field which 
is read-only. 
Writing 
to the 
32-bit word that contains 
the interrupt 
vector 
causes 


the interrupt 
message 
to be sent. 


Vector: 
[Bits 7-0] 


The vector 
identifies 
the interrupt 
being 
sent. 
If the Delivery 
Mode 
is "Remote 
Read", 
then 
the 
Vector 
field 
contains 
the address 
of the register 
to be read in 
the remote 
82489DX's 
Local 
Unit. The 
addresses 
are listed in the section 
dis- 
cussing 
82489DX 
Local 
Unit 
register 
summary. 
For example, 
for ID register, 
remote 
read 
address 
of 02 should 
be 
specified 
in vector 
field. 


Delivery 
Mode: 
[Bits 10-8] 


The 
Delivery 
Mode 
is a 3-bit field 
that 
specifies 
how the 82489DX 
listed in the 
destination 
field 
(bits 63:32) 
should 
act 
upon reception 
of this signal. 
Note that 
certain 
Delivery 
Modes will only operate 
as intended 
when 
used 
in conjunction 
with a specific 
Trigger 
Mode. These 
re- 
strictions 
are indicated 
for each Delivery 
Mode. 


000: (Fixed) means deliver the signal on 
the INT pin of all processors 
listed 
in the 
destination. 
Trigger 
Mode 
for "fixed" 
Delivery 
Mode 
can be 
•• 
edge or level. 
~ 


001: (Lowest 
Priority) means deliver the 
signal 
on the INT pin of the proc- 


essor that is executing 
at the lower 
priority 
among 
all the 
processors 
listed 
in the specified 
destination; 
Trigger 
Mode 
for "lowest 
priority" 
Delivery 
Mode can be edge or lev- 
el. 


010: Intel 
Reserved. 
Should 
not 
be 
used. 


011: (Remote 
Read) 
is a request 
to a 
remote 
82489DX 
Local 
Unit 
to 
send the value of one of its regis- 
ters over the ICC bus. The register 
is selected 
by providing 
its address 
in the Vector field. The register val- 
ue 
is 
latched 
by 
the 
requesting 
82489DX 
and 
stored 
in the 
Re- 
mote 
Register 
where 
it 
can 
be 
read by the local processor. 
A De- 
livery Mode of "Remote 
Read" 
re- 
quires an "edge" 
Trigger 
Mode. 


100: (NMI) means deliver the signal on 
the NMI pin of all processors list- 
ed in the destination, vector infor- 
mation 
is 
ignored. A 
Delivery 
Mode equal to "NMI" requires a 
"level" Trigger Mode. 


101: (Reset) means deliver the signal 
to all local units listed in the desti- 
nation. The destination local unit 
will assert/deassert its PRST out- 
put pin. All addressed 82489DX 
Local Units will assume their re- 
set state but preserve their 10. 
One side effect of an ICC bus 
message 
with 
Delivery 
Mode 
equal to "Reset" that results in a 
deassert of reset is that all Local 
Units (whether listed in the desti- 
nation or not) will reset their low- 
est-priority tie breaker arbitration 
10 to their Local Unit 10 (see the 
section on the ICC bus for de- 
tails). A Delivery Mode of "Re- 
set" 
requires a "level" 
Trigger 
Mode. "RESET" 
should not be 
used with "self" or "all incl self" 
Shorthand 
mode 
since 
it 
will 
leave the system in non-recover- 
able reset state. If "RESET" is 
used with "all excl self" 
mode 
software should make sure that 
only one CPU executes this in- 
struction in a MP system. 


110: Intel Reserved. Should not be 
used. 


111: Intel Reserved. Should not be 
used 


Destination Mode: [Bit 11] 
This field determines the interpretation 
of the Destination field. 
0: (Physical Mode): in Physical Mode, 


a destination 82489DX is identified 
by its Local Unit 10. Bits 56 through 
63 (8 MSB of the destination field) 
specify the 8-bit 82489DX Local 
Unit 10. 


1: (Logical Mode): in Logical Mode, 
destinations are identified by match- 
ing on Logical Destination under the 
control of the Destination Format 
Register in each Local 82489DX. 
The 32-bit Destination field is the 
logical destination. 


Delivery Status: [Bit 12] 
Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 
0: (Idle) means that there is currently 


no activity for this interrupt; 


1: (Send Pending) indicates that the 


interrupt has been injected, but its 
delivery is temporarily held up by 
other 
recently 
injected 
interrupts 


that are in the process of being de- 
livered; 
Delivery Status is software read-only; 
software writes to this field (as part of 
a 32-bit word) do not affect this bit. 
Software can read to find out if the 
current interrupt has been sent, and 
the 
Interrupt Command Register is 


available to send the next interrupt. If 
the Interrupt Command Register is ov- 
erwritten before the Delivery Status is 
"Idle", then the destiny of that inter- 
rupt is undefined; i.e., the interrupt may 
have been lost. 


Bit 13: 
Bit 13 is Reserved . Should be written 
O. 


Level: [Bit 14] 
Software can use this bit in conjunc- 
tion with the Trigger Mode bit when is- 
suing an inter-processor interrupt to 
simulate assertion/deassertion of lev- 
el sensitive interrupts. 
To assert: Trigger mode = 1 and Lev- 
el = 1. 
To deassert: Trigger mode = 
1 and 


Level = O. 
For example, a message with Delivery 
Mode of "Reset", a Trigger Mode of 
"Level", and Level bit of 0 deasserts 
Reset to the processor of the ad- 
dressed 82489 OX Local Unites).As a 
side effect, this will also cause all 
82489DX to reset their Arbitration 10 
to their unit 10. (The Arb 10 is used for 
tie breaking in lowest priority arbitra- 
tion.) 


Trigger Mode: [Bit 15] 
Software can use this in conjunction 
with Level Assert/Deassert to gener- 
ate interrupts that behave as edges or 
levels. 
0: Edge 
1: Level 


intelQP 


Remote Read Status: [Bits 17,16] 
This field indicates the status of the 
data contained in the Remote Read 
register. This field is read-only to soft- 
ware. Whenever software writes to the 
Interrupt Command Register using De- 
livery Mode "Remote Read" the Re- 
mote Read status becomes "in-prog- 
ress" (waiting for the remote data to ar- 
rive). The remote 82489DX Local Unit is 
expected to respond in a fixed amount 
of 
ICC bus 
cycles. 
If 
the 
remote 
82489DX Local Unit is unable to do so, 
then the Remote Read status becomes 
"Invalid". 
If successful, the 
Remote 
Read status resolves to "Valid". Soft- 
ware should poll this field to determine 
completion and success of the Remote 
Read command. 
00: (invalid): the content of the Remote 
Read Register is invalid. This is the 
case after a Remote Read com- 
mand 
issued 
and 
the 
remote 
82489DX Local Unit was unable to 
deliver the Register content in time. 
01: (in progress): a Remote Read com- 
mand has been issued and this 
82489DX is waiting for the data to 
arrive from the remote 82489DX Lo- 
cal Unit. 


10: (valid): the 
most recent 
Remote 
Read command has completed and 
the remote register content in the 
Remote Read Register is valid. 
11: reserved. 


Destination Shorthand: [Bits 19,18] 
This field indicates whether a shorthand 
notation is used to specify the destina- 
tion of the interrupt and if so, which 
shorthand is used. Destination Short- 
hands do no use the 32-bit Destination 
field, and can be sent by software with 
a single 32-bit write to the 82489DX's 
Interrupt 
Command 
Register. Short- 
hands are defined for the following 
common cases: software self interrupt, 
interrupt to all processors in the system 
including the sender, interrupts to all 
processors in the system excluding the 
sender. 
00: (dest field): means that no short- 
hand is used. The destination is 
specified in the 32-bit Destination 
field in the second word (bits 32 to 
63) of the Interrupt Command Reg- 
ister. 


01: (self): 
means 
that 
the 
current 
82489DX Local Unit is the single 
destination of the interrupt. This is 
useful for software interrupts. The 
Destination field 
in the 
Interrupt 
Command Register is ignored. RE- 
SET Delivery mode should not be 
used with 
self 
destination. 
Only 
FIXED delivery mode should 
be 
used with SELF. 


10: (all incl self): means that the inter- 
rupt is to be sent to "all" 
proces- 
sors in the system including the 
processor 
sending 
the 
interrupt. 


The 82489DX will broadcast a mes- 
sage with destination unit ID field 
set to all ones. RESET assert Deliv- 
ery mode should not be used with 
"all incl self" destination. 


11: (all excl self): means that the inter- 
rupt is to be sent to "all" 
proces- 
sors in the system with the excep- 
tion of the processor sending the 
interrupt. The 82489DX will broad- 
cast a message with destination 
unit ID field set to all ones. AII-excl- 
self is useful during selection of a 
boot processor (init) and also for 
TLB flush where "self" 
is flushed 
using the processor flush instruc- 
tion. Only one CPU in a MP system 
should execute "all excl self" desti- 
nation if used with RESET Delivery 
mode. 
Bits [31:20] Bits [31:20] are Reserved. They should 
be written O. 


Interrupt Command Register [63:32] 
I 
Bits [03:32] 
I 


Destination: [Bits 63-32] 
This field is only used when the Desti- 
nation Shorthand is set to "Dest Field". 
If Destination Mode is Physical Mode, 
then the 8 MSB contain a Destination 
unit 10. If Logical Mode, the full 32-bit 
Destination field contains the logical ad- 
dress. The enabling is done by Destina- 
tion Format Register. 


All 82489DX Local Units listen to all messages sent 
over the ICC bus. For each message, the local unit 
first checks if it belongs to the destination in the 
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message. It does this by matching the 32-bit Desti- 
nation field in the message against its logical Desti- 
nation Register, if the message addresses in logical 
mode, and against its physical ID, if the message 
addresses in physical mode. All 82489DX Local 
Units that match are said to "belong to the group". 


Each 82489DX Local Unit contains three 256-bit 
registers that playa role in the acceptance of inter- 
rupts and in dispensing accepted interrupts to the 
local processor. Each of these registers is a bit array 
where bit position i tracks information about the in- 
terrupt with vector L These bits track information 
about the (PINT) maskable interrupts only. They are 
not relevant for NMI, RESET or ExtlNT type of inter- 
rupts. The Interrupt Request Register (IRR) contains 
the interrupts accepted by this 82489DX Local Unit 
but not yet dispensed to the processor. The In Serv- 
ice Register (ISR) contains the interrupts that are 
currently in service by the processor, Le., the inter- 
rupts that have been dispensed to the processor but 
for which the processor has not yet signaled the 
End-Of-Interrupt. 


Note that the 82489DX's IRR and ISR registers have 
the same meaning and operation as in the 8259A in 
fully nested/non-specific EOI mode. Note also that 
these registers play no role in providing 8259A com- 
patibility. Compatibility is handled by making an ex- 


ternal 8259A-style controller directly visible to the 
processor and having the 82489DX become trans- 
parent. 


Each interrupt has a vector associated with it, which 
determines the bit position, and hence the priority for 
the interrupt. When an interrupt is being serviced, all 
equal or lower priority interrupts are automatically 
masked by the 82489DX Local Unit. 


The Trigger Mode Register (TMR) indicates for each 
interrupt whether the interrupt is edge or level. This 
information is transmitted with each 82489DX inter- 
rupt request message and reflects the Trigger Mode 
bit in the interrupt's Redirection Table entry. If an 
interrupt goes in service and the TMR bit is 0 (edge), 
then the interrupt's IRR bit is cleared at the same 
time the ISR bit is set. If the TMR bit is 1 (level), then 
the IRR bit is not cleared when the interrupt goes in 
service. In the latter case, the IRR bit mirrors the 
state of the interrupt's input pin. 


The following diagram shows 82489DX operation 
with devices A and B sharing a level triggered inter- 
rupt input. The diagram illustrates how Remote IRR, 
and the IRR bit at the destination 82489DX track the 
state of INTIN. It also illustrates how an EOI is fol- 
lowed immediately be re-raising the interrupt as long 
as the INTIN is still asserted by some device. 


--fi\c 
XOR of 
INTIN 
and 
Remote 
IRR 0\c 


send "level assert" 
send "level deassert" 
message 
message 


Remote 
IRR 
1 
.\ 
_ 


IRR at 
de.t 
Unit 
1 
.\ 
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ISR, IRR, and TMR are read-only by software. Each 
of these 256-bit registers is accessed as four sepa- 
rate 32-bit registers. Note that there is no general 
Interrupt Mask Register (IMR) as in the 8259A. The 
processor masks interrupts temporarily by writing to 
the Icoal unit's Task Priority Register (described 
shortly). 


ISR [Interrupt Status Register] 


I 
Bits [255:01 


TMR [Trigger Mode Register] 
I 
Bits [255:0] 


Figure 7. ISR, IRR, and TMR 


TMR (Trigger Mode Register): 


If° [edge triggered] the corresponding 
IRR bit is automatically cleared when 
interrupt service starts. If 1[level trig- 
gered] this is not the case; instead, the 
source 82489DX must explicitly request 
the IRR bit be cleared (upon deassert 
of the interrupt input pin or upon send- 
ing an appropriate interprocessor inter- 
rupt). Upon acceptance of an interrupt, 
the TMR bit is cleared for edge trig- 
gered interrupts and set for level trig- 
gered interrupts. This information is car- 
ried in the accepted interrupt message. 
The source 82489DX I/O 
unit also 
tracks the state of the destination unit's 
IRR bit (Remote IRR bit in the Redirec- 
tion Table). When a level triggered in- 
terrupt input is deasserted, the source 
82489DX I/O unit detects the discrep- 
ancy between the input pin state and 
the 
Remote 
IRR, and automatically 
sends a message telling the destination 
82489DX to clear IRR for the interrupt. 


IRR (Interrupt Request Register): 
It contains the active interrupt requests 
that have been accepted, but not yet 
dispensed by this 82489DX Local Unit. 
A bit in IRR is set when the 82489DX 
Local Unit accepts the interrupt. When 
TMR is 0, it is cleared when the inter- 
rupt is serviced; when TMR is 1, it is 
cleared when the 82489DX Local Unit 
receives a message to clear it. 


ISR (In Service Register): 
It marks the interrupts that have been 
delivered to the processor, but that 
have not been fully serviced in that an 
End-Of-Interrupt has not yet been reo 
ceived. The ISR register reflects the 
current state of the processor's inter- 
rupt stack. 


Interrupt acceptance proceeds as follows. If the de- 
livery mode is Fixed, then each unit in the destina- 
tion group unconditionally accepts the interrupt mes- 
sage and sets the interrupt's IRR bit. If the delivery •• 
mode is Lowest Priority, then each processor in the 
~ 
group first checks if it is currently the focus of the 
interrupt by checking its ISR and IRR. If an 82489DX 
finds one of these bits set for the incoming interrupt, 
~henthat 82489DX Local Unit accepts the interrupt 
Independent of priority, and "signals" 
the other 
82489DX Local Units to abort the priority arbitration. 
This avoids multiple delivery of a same interrupt oc- 
currence to different processors, consistent with in- 
terrupt delivery semantics in uniprocessor systems 
as described above. If a message is to be delivered 
for NMI or Reset, then all 82489DX Local Units list· 
ed in the destination unconditionally assert/deassert 
the corresponding output pin. ISR, IRR, etc. are by- 
passed for NMI or reset and vector information is 
undefined. 


6.6 Tracking 
Processor 
Priority 


Each 82489DX Local Unit should be programmed 
with task priority so that it can mask interrupts 
that are less priority than that of the processor tem- 
porarily. 


Task switching and task priority changes are the re- 
sult of explicit software action. The operating system 
may define a number of task scheduling classes. Ex- 
amples are an idle class, a background class, a fore- 
ground class, and a time critical class. Alternatively, 
different classes can be assigned to user code 
versus system code. If tasks in different classes are 
executing when an interrupt comes in, then it may be 
advantageous to interrupt the processor currently 
running the task in the least important class. Clearly, 
if one processor is idle while others are doing work, 
the idle processor would be the obvious target for 
servicing the interrupt. This implies that there is use 
in defining priority levels below all interrupt levels 
that can participate in lowest priority delivery selec- 
tion. 


At times, the operating system may need to block 
out interrupts from being serviced. For example, to 
synchronize access to a shared data structure be- 
tween a device driver and its interrupt handler the 
driver raises it priority to equal or higher than the 
interrupt's priority. 


The local 82489DX supports this via its Task Priority 
Register (the 8259A supports this via the interrupt 
mask register (IMR).) Software that wants to make 
use of this is required to inform its 82489DX Local 
Unit of the prioity change by updating the Task Prior- 
ity Register. The Task Priority field is 8 bits providing 
up to 256 distinct priorities. The 4 MSB of this regis- 
ter correspond to the 16 interrupt priorities while the 
4 LSB provide more precision. Priorities are best 
noted as x:y, where x is the value of the 4 MSB and y 
is the value of the 4 LSB. For example, Task Priority 
Register values O:ywith 0 < Y < 15 (and 0 in the 4 
MSB) can be used to represent the priorities of the 
task scheduling classes described above (y = 0 for 
idle; y = 1 for background; etc.). Except for inter- 
rupts with vectors 0 through 15 (which are often pre- 
defined by the processor) which all have priority 0:0, 
the priorities of all other interrupts and their handlers 
is x:O with 1 < x < 15 and is above the base task 
priorities O:y. 


For example, interrupt vector 123 has priority .7:0 
(123/16 = 7) and can be masked by any task that 
raises its priority to a value equal or higher than 7:0. 


82489DX uses Task priority register for the purpose 
'of masking the interrupts. The task priority register 
should be programmed with a priority value to speci- 
fiy the priority of task the processor is executing. 
82489DX masks any interrupts of lower or equal pri- 
ority when compared with task priority. 


When task priority register is programmed with the 
priority 15, all the interrupts are masked. When task 
priority register is programmed with priority level X, 
by definition, all the interrupts of priority X and below 
X will be masked. When task priority register is pro- 
grammed with the priority 0 then all the interrupts 
above priority 0 are allowed to interrupt the proces- 
sor. This means that when task priority register is 
programmed even with the lowest value, i.e., 0, inter- 
rupts of priority 0 will be masked. So only 240 inter- 
rupt vectors should be used in 82489DX. Interrupt 
vecotrs from 0 to 15 should not be used. 


The first priority value computed is the maximum of: 


• Task Priority (4msb : 4lsb) and 
• the priority of the highest order ISR bit set ((vec- 
tor/16) :0). 


The value is used to determine whether or not a 
pending interrupt can be dispensed to the proces- 
sor. 


The second priority value computed is the maximum 
of:' 
• Task Priority (4msb : 4lsb), and 
• the priority of the highest order ISR bit set ((vec- 
tor/16) :0), and 
• the priority of the hi'ghestorder IRR bit set ((vec- 


tor/16) :0). 


This value is used during arbitration as part of low- 
est-priority delivery. 


Task Priority 
Register 


From the information in the Task Priority Register 
and the priority information derived from the ISR and 
IRR register, the 82489DX Local Unit computes two 
additional priority values: 
Bits [31:8] Bits [31:8] are Reserved. They should be 
written O. 


Bits [7:0] Task Priority 
Bits [7:0] are used to specify the task pri- 
ority. 


• 


6.7 
Dispensing 
Interrupts 


DISPENSING 
INTERRUPTS 
TO THE LOCAL 
PROCESSOR 


Once a 82489DX Local Unit accepts an interrupt, it 
guarantees delivery of the interrupt to its local proc- 
essor. (This part of the 82489DX functions similarly 
to an 8~59A.) Dispensing a maskable interrupt to the 
local processor begins when the Local Unit asserts 
the INT pin of its processor. If the processor has 
interrupts enabled, it will respond by issuing an INTA 
cycle. This causes the Local Unit to freeze its inter- 
nal priority state and release the 8-bit vector of the 
highest priority interrupt on the data bus where it is 
read by the processor and used to find the handler's 
entry point. The INT/INTA protocol also causes the 
interrupt's ISR bit to be set. The corresponding bit in 
the IRR register is only cleared if the TMR register 
indicates it should do so (edge triggered interrupts), 
otherwise (level triggered interrupts), IRR is only 
cleared when the Interrupt Input Pin is deasserted. 


Note that it can happen that a level-triggered inter- 
rupt is deasserted right before its INTA cycle. In that 
case, all IRR bits may be clear and the prioritizer 
may not find a vector to give to the processor. To 
satisfy the processor's demand for a vector, instead, 
the 82489DX will return a spurious interrupt vector 
instead. 


A similar situation may occur when the processor 
raises its Task Priority at or above the level of the 
interrupt for which the Processor INT pin is currently 
being asserted. When the INTA cycle is issued, the 
interrupt that was to be dispensed has become 
masked (masked but remembered). 


Dispensing the spurious interrupt vector does not af- 
fect the ISR register, so the handler for this vector 
should just return without EOI. If the vector is shared 
with a valid interrupt, then the handler can read the 
vector's bit in the ISR register to check if it is in- 
voked for the valid interrupt (ISR bit set) or not (ISR 
bit clear). Given the range of 240 vectors, overload- 
ing the spurious interrupt with a valid interrupt is not 
expected to be common practice. The spurious in- 
terrupt vector to be used by a Local Unit is program- 
mable via the Spurious Interrupt Vector Register. 


It is possible that Local Units exist in the system that 
do not have a processor to which to dispense inter- 
rupts. The only danger this represents in the system 


is that if any interrupt is broadcast to all processors 
using lowest priority delivery mode when all proces- 
sors are at the lowest priority, there is a chance that 
a Local Unit without the processor may accept the 
interrupt if this Local Unit happens to have the low- 
est Arb ID at the time. To prevent this from happen- 
ing, all Local Units initialize in the disabled state and 
must be explicitly enabled before they can either 
start accepting or transmitting messages from the 
ICC bus. A disabled 82489DX Local Unit only re- 
sponds to messages with Delivery Modes set to 
"Reset". Reset deassert messages should be sent 
in Physical Destination mode using the target's Lo- 
cal Unit ID since the logical destination information 
in the Icoal units is undefined (all zeroes) when the 
82489DX comes out of Reset. 


Bits [31:9] 
Bit 8 
Bits [7:0] 


Figure 9. Spurious 
Interrupt 
Vector 
Register 


Bits [31 .. 9] Reserved Bits. Should be written O. 


Unit Enable:[Bit 8] 
0: When a 0 is written to this bit, this 
Local Unit gets disabled with regard 
to responding to messages sent as 
well as transmitting on the ICCbus. It 
only responds to messages with De- 
livery Mode set to "Reset". Reading 
a 0 at this bit indicates that the unit is 
disabled. 


1: When a 1 is written to this bit, the 
current Local Unit is enabled for both 
transmitting and receiving unit mes- 
sages. Reading a 1 at this bit indi- 
cates that the unit is enabled. 


Spurious Vector: [Bits 7-0] 
For 
future 
compatibility, 
bits 
[3-0] 
should be 1111. 


Before returning from the interrupt handler, software 
must issue an End-Of-Interrupt (EOI) command to 
the 82489DX Local Unit. The data written to EOI 
register is don't care. This tells the 82489DX to clear 
the highest priority bit in the ISR register since the 
interrupt is no longer in service. Upon EOI, 82489DX 
goes through prioritization returning to the next high- 
est priority activity. This can be a previously inter- 
rupted handler (from ISR), a pending interrupt re- 
quest (from IRR), or an interrupted task (from Task 
Priority). 


Bits [31:0] 


Figure 10. EOI Register 


Bits [31:0]: 
are don't care. 


6.10 
Remote 
Read Register 


Since all 82489DX local Units would typically occu- 
py the same address range, an 82489DX local unit's 
registers can only be accessed by the local proces- 
sor. From a system debugging point of view, this 
would mean that a large amount of state would be- 
come inaccessible if its corresponding processor 
hangs for whatever reason. To assist in the debug- 
ging of MP systems, the 82489DX support a mecha- 
nism that provides read-only access to any register 
in any other 82489DX Icoal unit in the system. 


To read any register in a "remote" 82489DX local 
Unit, the processor writes to the Interrupt Command 
Register specifying a Delivery Mode equal to "Re- 
mote Read". The remote 82489DX is specified in 
the Destination field of the Interrupt Command Reg- 
ister in the usual fashion. Debug software would 
make sure that this selects a single 82489DX only- 
for example by using the target's 82489DX local 
Unit 10 in physical destination mode. Since no vector 
is associated with remote register access, the Vec- 
tor field in the Interrupt Command Register is used 
to select the individual remote 32-bit register to be 
read. The selector value corresponds to the address 
(offset) of the register in the local 82489DX's ad- 
dress space. Sending a "Remote Read" command 
results in sending a message on the ICC bus. The 
destination 82489DX responds by placing the 32-bit 
content of the selected register on the ICC bus. This 
value is read by sending the 82489DX and place it in 
the Remote Register where software can get at it 
using regualr register access to its 82489DX local 
Unit. The Remote Register is software read-only. 
The contents of the Remote Register is valid when 
the Delivery Status in the Interrupt Command Regis- 
ter has become "Idle" again. 


Bits [31:0J 


Figure 11. Remote 
Register 


Bits [31:0) 
Bits [31:0) contain the contents of Re- 
mote Read Register. 


Each 82489DX local Unit contains a hardware Ver- 
sion Register that identifies this 82489DX local Unit 
version. This register is read only. 


Figure 12. Local Version 
Register 


Version: [Bits 7-0) 
This is a version number that identifies 
this version. This field is hardwired and 
is read-only. Will be read as "1" 
for 


82489DX. 
Bits [31:8) 
Bits 31:8 are reserved. 


82489DX local Unit contains one 32-bit wide pro- 
grammable binary timer for use by the local proces- 
sor. The timer can select its clock base from one of 
three possible clock inputs. A timer mode can be 
programmed to operate in either one-shot mode or 
periodic mode. The timer can be configured to inter- 
rupt the local processor with a vector. 


The 82489DX has two independent clock input pins: 
1. The ClK pin provides the clock signal that drives 
the 82489DX's internal operation. 


2. The TMBASE pin allows an independent clock 
signal to be connected to the 82489DX for use by 
the timer functions. 


Signals from both ClK and TMBASE can be used as 
clock inputs that feed the timer. In addition, the 
82489DX contains a divider that can be configured 
to divide either input clock signal. The divider can be 
programmed to divide the selected input clock by 2, 
4, 8, or 16. ClK, TMBASE, and the output of the 
divider together provide three time bases: Base 0, 
Base 1, and Base 2. Base 0 is always equal to ClK: 
Base 1 is always equal to TMBASE; and base 2 is 
one of; ClK/2, ClK/4, ClK/8, ClK/16, TMBASE/2, 
.TMBASE/4, TMBASE/8, or TMBASE/16. The timer 
can independently select one of these three time 
bases as its clock input as depicted in the following 
diagram. 


elK -_+--------_~Bas. 0 


~ 


'--O-IV-IO-E-BY-~ 
B 
2 
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Figure 14. Divider Configuration 
Register 


Bits [31:3] 
Bits 31 to 3 are reserved. They should 
be written O. 


Divider Input: 
[Bit 2] Selects whether divider's input 
connects to the 82489DX Local Unit's 
CLK pin or TMBASE pin. 
0: means the divider takes its input sig- 
nal from CLK, 
1: means use TMBASE. 


Divide By: [Bits 1,0] 
This field selects by how much the divid- 
er divides. 
00: divide by 2 
01: divide by 4 
10: divide by 8 
11: divide by 16 


Software starts a timer going by programming its Ini- 
tial Count Register. The timer copies this value into 
the Current Count Register and starts counting down 
at the rate of one count for each time base pulse. 
The time is one of Base 0, Base 1, or Base 2. 


The timer has a programmable mode which can be 
One-Shot or Periodic. After the timer reaches zero in 
One-Shot mode, the timer simply stays at zero until it 
is reprogrammed. In Periodic mode, the timer auto- 
matically reloads its Current Count from the Initial 
Count and starts counting down again. 


For the timer, interrupt generation can be disabled or 
enabled, and an arbitrary interrupt vector can be 
specified. When enabled and the timer reaches 
zero, an interrupt is generated at the 82489DX Local 
Unit. Timer generated interrupts are always treated 
as edges. They can only generate maskable inter- 
rupts to the local processor. 


A timer set up with its interrupt masked is useful as a 
time base that can be sampled by the local proces- 
sor by reading the Current Count Register, for the 
purpose of measuring the intervals. By mapping the 
82489DX's register space into a read-only user 
page, safe and efficient performance monitoring of 
user programs can be supported. 


If necessary, software may want to ensure that peri- 
odic timer interrupts on the different 82489DX Local 
Units are staggered such that the 82489DXs don't 
all deliver their interrupt (e.g., a timer slice interrupt) 
to their local processor at the same time. This stag- 
gering avoids bursts of contention for shared re- 
sources (bus, cache lines, dispatch queue, locks). 
Randomness occurring "naturally" may be sufficient 
to ensure staggering. 


Figure 15. Initial Count and 
Current Count Registers 
Initial Count: 
Software writes to this register to set 
the initial count for timer. This regis- 
ter can be written at any time. When 
written, its value is copied to the Cur- 
rent Count Register and countdown 
starts or continues from there. The 
Initial Count Register is read-write by 
software. 
Current Count: This is the current count of timer. It is 
read-only by software and can be 
read at any time. 


The timer is configured via its Local Vector Table 
entry shown below (see also Interrupt Control in this 
section). 


Vector: [Bits 7-0] 
This is the 8-bit interrupt vector to be 
used when timer generates an inter- 
rupt. 


Delivery Status: [Bit 12] 
Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 
0: (Idle) means that there is currently 
no activity for this interrupt; 
1: (Send Pending) indicates that the in- 
terrupt has been injected, but its de- 
livery is temporarily held up by other 
recently 'injected interrupts that are 
in the process of being delivered; 
Delivery Status is software read- 
only; software writes to this field (as 
part of a 32-bit word) do not affect 
this bit. 


Bits 15-13: 
Reserved. Should be written O. 


MASK: [Bit 16] 
This bit serves to mask timer interrupt 
generation. 
0: means 
not 
masked, 
when 
timer 
reaches 0, it generates an interrupt 
with vector at the 82489DX Local 
Unit 
1: means masked, and no interrupt is 
generated. 


Timer Mode: [Bits 17] 
This field indicates the operation mode 
of timer. 
0: (One-Shot): the Current Count Reg- 
ister remains at zero after the timer 
reaches zero, and software needs to 
reassign the timer's 
Initial Count 
Register to rearm the timer. 


1: (Periodic): when the timer reaches 
zero, the Current Count Register is 
automatically reloaded with the val- 
ue in the Initial Count Register, and 
the timer counts down again. 
Timer Base: [Bits 19,18] 
This field selects the time base input to 
be used by timer. 
00: (Base 0) uses "CLKIN" as input; 
01: (Base 1) uses "TMBASE"; 
10: (Base 2) uses the output of the di- 
vider (Base 2). 
Bits [31:20] 
Bits [31:20] are Reserved. Should be 
written O. 


The I/O Unit indirect addressing scheme uses two 
registers directly mapped into the processor's ad- 
dress space: the I/O Register Select register and 
the I/O Window register. The I/O register select reg- 
ister selects which I/O unit Register appears in the 
I/O Window register where it can be manipulated by 
software. 


Figure 17.1/0 Register Select Register 


Bits [31:8]: Reserved. Should be written O. 
Bits [7:0]: 
I/O REGISTER SELECT: This register. 
selects an 82489DX I/O unit register. 
The contents of the selected 32-bit reg- 
ister can be manipulated via the I/O 
Window Register. The I/O Register Se- • 
lect register is read-write by software. 
~ 


Figure 18.1/0 Window Register 


Bits [31:0] I/O WINDOW REGISTER: This register 
is mapped onto the I/O Unit's register 
selected by the I/O Register Select reg- 
ister. Readability/writability by software 
is determined by the I/O unit register 
that is currently selected. 
The addresses (offsets to a platform-de- 
fined base address) of all registers are 
listed in the register summary section. 
Note that register offsets are aligned on 
128-bit boundaries; in other words, regis- 
ters are located only at every fourth 
32-bit address. This eliminates the need 
for lane-steering glue logic when con· 
necting the 82489DX's 32-bit data bus to 
a wider (64-bit and 128-bit) bus. 


I/O 
Unit 10 Register 


Each 82489DX I/O Unit has a register that contains 
the I/O Unit's 8-bit ID. The I/O unit ID serves as a 
physical name of the 82489DX I/O Unit. It is used in 
arbitrating for ICC bus ownership when the I/O unit 
wants to access the ICC bus for sending any inter- 
rupt message. Unlike the local unit ID, the I/O unit ID 
is not latched-in from the address bus during hard- 
ware reset. The I/O unit ID is set to 0 during reset. 
The software has to write different ID into the I/O 
Units before starting interrupt messages on the ICC 
bus. 


Bits [31:24] I/O Unit ID: 


The I/O unit ID serves as the physical 
"name" of the 82489DX unit used for 
arbitration purposes for the ICC bus us- 
age. 
In 
a 
system 
with, 
say, 
four 
82489DX, there are 4 Local Units and 4 
I/O Units. All the 8 units should be as- 
signed different ID. The IDs should start 
with 0 and each unit should have differ- 
ent ID. 


Bits [23:0] 
Bits 23..0 are reserved. Should be writ- 
ten O. 


I/O 
Unit Version 
Register 


Each 82489DX I/O Unit contains a hardware Ver- 
sion Register that identifies this 82489DX I/O unit 
version. This register is read only. 


I/O Unit Version Register 
I 
Bits [31:24] 
I 
Bits [23:16] 
I 
Bits [15:8] 
I 
Bits [7:0] 
I 


Version: [Bits 7-0] 
This is a version number that identifies 
this version. This field is hardwired and 
is read-only. Will be read as "1" 
for 
82489DX. 
Bits [15:8] 
Bits [15:8] are reserved. 


Max Redir Entry: [Bits 23-16] 
This is the entry number (0 being the 
lowest entry) of the highest entry in the 
Redirection Table. It is equal to the 
number of Interrupt Input Pins minus 
one of this I/O Unit. This field is hard- 
wired and is read-only. 
In the 82489DX I/O unit this is read as 
15. 


Bits [31:24] Bits [31:24] are reserved. 


Redirection 
Tables 


The Redirection Table has a dedicated entry for 
each interrupt input pin. Unlike IRQ pins of the 
8259A, the notion of interrupt priority is completely 
unrelated to the position of the physical interrupt in- 
put pin on the 82489DX. Instead, software can de- 
cide for each pin individually what it wants the vector 
(and therefore the priority) of the corresponding in- 
terrupt to be. For each individual pin, the operating 
system can also specify whether the interrupt is sig- 
naled as edges or levels, as well as the destination 
and delivery mode of the interrupt. The information 
in the Redirection Table is used to translate the in- 
terrupt manifestation on the corresponding interrupt 
pin into an inter-82489DX message. 


In order for a signal on an edge-sensitive Interrupt 
Input pin to be recognized as a valid edge ( and not 
a glitch) the input level on the pin must remain as- 
serted until the time 82489DX I/O Unit sends the 
corresponding message over the ICC bus. Only then 
will the source 82489DX be able to recognize a new 
edge on that Interrupt Input pin. That new edge will 
only result in a new invocation of the handler if its 
acceptance by the destination 82489DX causes the 
Interrupt Request Register bit to go from 0 to 1. (In 
other words, if the interrupt wasn't already pending 
at the destination.) 


82489DX I/O unit has 16 Redirection Table entries. 
The layout of an entry in the Redirection Table is as 
follows: 


Vector (Bits [7:0] 
Interrupt vector for this interrupt 


Mask (Bit 16) 
0: Not Masked 
1: Masked 
Bits [31:17] 
Reserved. Should be written O. 


000: (Fixed) means deliver the signal 
on the INT pin of all processors 
listed in the destination. Trigger 
Mode for "fixed" 
Delivery Mode 
can be edge or level. 


001: (Lowest Priority) means deliver 
the signal on the INT pin of the 
processor that is executing at the 
lower priority among all the proc- 
essors listed in the specified des- 
tination; Trigger Mode for "lowest 
priority" 
Delivery Mode can be 
edge or level. 


100: (NMI) means deliver the signal on 
the NMI pin of all processors list- 
ed in the destination, vector infor- 
mation 
is 
ignored. 
A 
Delivery 
Mode equal to "NMI" requires a 
"level" Trigger Mode. 


101: (Reset) means deliver the signal 
to all processors listed in the des- 
tination by asserting/deasserting 
the 82489DX's Reset output pin. 
All addressed 82489DXs' Local 
Units will assume their reset state 
but preserve their unit 10. one. 
side effect of a unit message with 
~ 
Delivery Mode equal to "Reset" 
that results in a deassert of reset 
is that all 82489DXs' Local Units 
(whether listed in the destination 
or not) will reset their lowest-prior- 
ity tie breaker arbitration ID to 
their unit ID (see the section on 
the ICC bus for details). A Deliv- 
ery Mode of "Reset" 
requires a 
"level" Trigger Mode. 


111: (ExtINT) means deliver the signal 
to the INT pin of all processors 
listed in the destination as an in- 
terrupt that originated in an exter- 
nally connected (8259A-compati- 
ble) interrupt controller. The Local 
Unit receiving this interrupt will ac- 
tivate ExtlNTA in response to this 
interrupt 
message. 
A 
Delivery 
Mode of 
"ExtINT" 
requires an 
"edge" 
Trigger Mode. (See the 
section on Compatibility for de- 
tails.) 


Delivery Mode (Bits [10:8]) 


000: Fixed 
001: Lowest Priority 
010: <reserved> 
011: <reserved> 
,100: NMI 


101: Reset 
110: < reserved> 
111: ExtlNT 


Destination Mode (Bit 11) 
0: Physical 
1: Logical 


Delivery Status (Bit 12) 
0: Idle 
1: Send Pending 
Bit 13 
Bit 13: Reserved. Should be written O. 
Remote IRR (Bit 14) 
Reflects the Remote IRR bit 
0: Remote IRR bit is clear. 
1: Remote IRR bit is set. 


Trigger Mode (Bit 15) 
0: Edge 
1: Level 


DESCRIPTIONS 


Vector: [Bits 7-0] 


The vector field is an 8-bit field con- 
taining the interrupt for this interrupt. 


Delivery Mode: [Bits 10-8] 
The Delivery Mode is a 3-bit field that 
specifies how the 82489DXs listed in 
the destination field should act upon 
reception 
of this signal. Note that 
remote read is not supported for I/O 
device interrupts. Note that certain 
Delivery Modes will only operate as in- 
tended when used in conjuction with a 
specific Trigger Mode. These restric- 
tions are indicated for each Delivery 
Mode. 


Destination Mode [Bit 11] 
This field determines the interpretation 
of the Destination field. 
0: (Physical Mode): in Physical Mode, 
a destination 82489DX Local Unit is 
identified by its unit 10. Bits 56 
through 63 (8 MSB of the destina- 
tion field) specify the 8-bit unit 10. 
1: (Logical Mode): in Logical Mode, 
destinations 
are 
identified 
by 
matching on Logical Destination un- 
der the control of the Destination 
Format Register in each 82489DX 
Local Unit. The 32-bit Destination 
field is the logical destination. 


Delivery Status: [Bit 12] 
Delivery Status is a 1-bit field that con- 
tains the current status of the delivery 
of this interrupt. Two states are de- 
fined: 
0: (Idle) means that there is currently 
no activity for this interrupt; 
1: (Send Pending) indicates that the 
interrupt has been injected, but its 
delivery is temporarily held up by 
other 
recently 
injected 
interrupts 
that are in the process of being de- 
livered; Delivery Status is software 
read-only; software writes to this 
field (as part of a 32-bit word) do not 
affect this bit. 


Bit 13: 
Bit 13 is Reserved. Should be written O. 


Remote IRR: [Bit 14] 
This bit is used for level triggered inter- 
rupts; its meaning is undefined for 
edge-triggered interrupts. Remote IRR 
mirrors the interrupt's IRR bit of the 
destination 82489DX Local Unit. When 
the value of the bit disagrees with the 
state of the Interrupt Input line, a unit 
message is automatically sent to make 
the destination's IRR both reflect the 
new state of the Interrupt Input line, 
and then the Remote IRR bit is updat- 
ed to track its associated IRR bit. Re- 
mote IRR is software read-only; soft- 
ware writes to this bit do not affect it. 


Trigger Mode: [Bit 15] 
The Trigger Mode field indicates the 
type of signal on the interrupt pin that 
triggers an interrupt. 


0: indicates edge sensitive, 
1: indicates level sensitive. 


Mask: [Bit 16] 
Use this bit to mask injection of this in- 
terrupt. 


0: indicates that injection of this inter- 
rupt is not masked. An edge or level 
on 
an 
interrupt 
pin that 
is 
not 
masked results in the delivery of the 
interrupt to the destination. 
1: indicates that injection of this inter- 
rupt is masked. Edge-sensitive in- 
terrupts signaled on a masked inter- 
rupt Input pin are simply ignored 
(I.e., it is not delivered and is not 
held 
pending). 
Level-asserts 
or 
deasserts occurring on a masked 
level-sensitive pin are also ignored 
and have no side effects. As ex- 
pected, changing the mask bit from 
unmasked to masked while the lev- 
el remains asserted has the side ef- 
fect of deasserting the level. It is 
software's responsibility to deal with 
the case where the Mask bit is set 
after the 
interrupt 
message 
has 
been sent but before the interrupt is 
dispensed to the processor. 
Bits [31:17] 
Bits [31:17] are reserved. Should be 
written O. 


Destination: If the Destination Mode of this entry is 
"Physical Mode", then the 8 MSB [bits 
56 through 63] contain an 82489DX 
Local Unit 10. If Logical Mode, then the 
Destination field potentially defines a 
set of processors. The interpretation of 
the 32-bit destination field is further en- 
abled by the Destination Format Regis- 
ter in the 82489DX Local Units. 
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Physical Characteristics 


The lee 
bus is a 5-wire synchronous 
bus connecting 


all 82489DXs 
(all I/O Units and all Local Units). Four 


of these 
five wires 
are used for data transmissions 


and arbitration, 
and one wire is a clock. The descrip- 


tion refers to the logical state of the lee 
bus. Electri- 


cal levels 
are just 
inverse 
of the 
logical 
state 
de- 


scribed. 
For example, 
the section 
describes 
that the 


lee 
bus is 0000 when not transmitting 
any message. 


This refers to logical 
state. 
Electrically, 
the lee 
bus 


is 1111 when not transmitting 
any message. 


The bus is electrically 
an open-drain 
connection 
pro- 


Viding for both bus use arbitration 
and arbitration 
for 


lowest 
priority. 
Being open-drain, 
the bus is run at a 


"comfortable" 
speed 
such 
that 
design-specific 
ter- 


mination 
tuning 
is not required. 
Furthermore, 
each 


82489DX 
receiving 
a message 
or participating 
in an 


arbitration 
must be given enough time in a single bus 


cycle to latch the bus and perform 
some simple logic 


operations 
on the latched 
information 
in order to de- 
termine 
whether 
the next drive cycle must be inhibit- 


ed. 


Note 
that 
it is likely 
in MP systems 
that 
additional 


processors 
be located 
on plug-in 
boards. 
Since the 


lee 
bus 
would 
be 
part 
of 
the 
connector, 
the 


82489DX 
to lee 
bus connection 
is defined 
so that it 


can 
be electrically 
isolated 
using 
external 
drivers. 


The 82489DX 
has separate 
lee 
bus input and out- 


put 
pins 
that 
can 
be connected 
externally 
to 
the 


82489DX 
to either provide 
or not provide 
isolation. 


The isolation 
can also be used to provide 
a hierar- 


chical 
connection 
of lee 
buses electrically 
support- 


ing large 
numbers 
of processors. 
The 
number 
of 


82489DXs 
supported 
using the hierarchical 
connec- 


tion is limited 
only by lee 
bus bandwidth. 
It should 


be 
noted 
that 
lee 
bus 
output 
low 
current 
is just 


4 mA. 


Bus Arbitration 


Arbitration 
(both for use of the bus and for determin- 


ing the .lowest 
priority 
82489DX) 
depends 
on 
all 


82489DX 
message 
units 
operating 
synchronously. 


To deal with the event 
where 
multiple 
agents 
start 


transmitting 
simultaneously, 
a distributed 
arbitration 


approach 
is used. Bus arbitration 
uses a small num- 


ber 
of 
arbitration 
cycles 
in the 
lee 
bus. 
During• 


these cycles, arbitration losers progressively drop 
off the bus until only the winner remains transmitting. 
The winner then transmits its actual inter-unit mes- 
sage. Once the sending of a message (including bus 
arbitration) has started, any possible contender must 
suppress transmission until enough cycles have 
elapsed for the message to be fully sent. The num- 
ber of message cycles depends on the type of mes- 
sage being sent. 


A bus arbitration cycle starts by the agent driving its 
unit ID on the ICC bus. High-order ID bits are driven 
first, successive cycles proceeding to the low bits of 
the ID. All losers in a given cycle drop off the bus, 
using every subsequent cycle as a tie breaker for the 
previous cycle. By the time all arbitration cycles are 
completed, there will be only a single agent left driv- 
ing the bus. 


The 8-bit unit ID (1716151413121110)is chopped up 
in successive groups of 2 bits (17 16)(1514)(1312) 
(1110).Each of these tuples is first decoded before 
driving them on the bus. The Osand 1 indicate logi- 
cal levels and not signal levels. The ICC bus is 0000 
when not transmitting any message. The decoding 
used is: 


IDTupie 
ICC8us 
---+ 
(Hi + 1] 
IUl) 
83 
82 
81 
80 


0 
0 
---+ 
0 
0 
0 
1 


0 
1 
---+ 
0 
0 
1 
0 


1 
0 
---+ 
0 
1 
0 
0 


1 
1 
---+ 
1 
0 
0 
0 


Note that the pattern generated on the ICC bus by 
tuple (13 12)will be represented as i32 i32 i32 i32. 
The lower case signifies this encoding. 


Each tuple of the ID only contributes to a single wire, 
making it possible for an agent to determine with 
certainty whether to "drop off" or to continue arbi- 
trating in the next cycle for the following two bits of 
the unit ID simply by checking whether the bus line 
the agent is driving is also the highest order 1 on the 
bus. Each ICC bus cycle therefore arbitrates 2 bits. 
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Lowest-Priority Arbitration 


Arbitration is also used to find the 82489DX Local 
Unit with the lowest processor priority. Lowest-priori- 
ty arbitration uses the value of the 82489DX's Proc- 
essor Priority value appended with an 8-bit Arbitra- 
tion ID (Arb ID) to break ties in case there are multi- 
ple units executing at the lowest priority. 


Using the constant 8-bit unit ID as the Arb ID has a 
tendency to skew symmetry since it WQuid favor 
82489DXs with low ID values. An 82489DX Local 
Unit's Arb ID is therefore not the unit ID itself but is 
derived from it. At reset, an 82489DX Local Unit's 
Arb ID is equal to its unit ID. Each time a message is 
broadcast over the ICC bus in lowest priority mode, 
all 82489DX Local Units increment their Arb ID by 
one, which gives them a different Arb ID value for 
the next arbitration. The Arb ID is then endian-re- 
versed (LSB becomes MSB, etc.) to ensure better 
rotation of which 82489DX gets to have the lowest 
Arb ID next time around. The reversed Arb ID is then 
decoded to generate arbitration signals on the ICC 
bus as described above. 


To support hot insertion of processor boards in a 
running MP system, a mechanism is provided to al- 
low the 82489DX of the added processor to syn- 
chronize its Arb ID with the existing 82489DXs. This 
is accomplished by broadcasting a message with 
Delivery Mode equal to "Reset", Trigger Mode equal 
to "Level", 
and Level equal to o. This message 
must 
be 
broadcast 
before 
the 
newly 
added 
82489DX is allowed to participate in a lowest-priority 
arbitration. Depending on the exact sequence under 
which the newly inserted board is powered-up and 
initialized, this Arb ID synchronization may occur nat- 
urally if a Reset-deassert to the new 82489DX is part 
of that sequence. If not, the local processor can al- 
ways send this as an inter processor interrupt (with a 
null destination), causing only Ihe side effect of re- 
setting all 82489DX Arb IDs. 


The short message format is described first. Note 
that the first 19 cycles of both short and long mes- 
sage formats have the same interpretation. 


ICCbus arbitration 
1: i76 i76 i76 i76 ICCbus arbitration 
1: 
i76 
i76 
i76 
i76 
2: i54 i54 i54 i54 
2: 
i54 
i54 
i54 
i54 
3: i32 i32 i32 i32 
3: 
i32 
i32 
i32 
i32 
4: i10 i10 i10 i10 
4: 
i10 
i10 
i10 
i10 
<message body> 
5: 
DM 
M2 
M1 
MO 


6: 
HO" 
"0" 
L 
TM 
7: 
V7 
V6 
V5 
V4 
8: 
V3 
V2 
V1 
VO 


destination mode and 
delivery mode 
control bits 
vector 
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9: 
D31 
D30 
D29 
D28 
destination 
10: 
D27 
D26 
D25 
D24 
11: 
D23 
D22 
D21 
D20 
12: 
D19 
D18 
D17 
D16 
13: 
D15 
D14 
D13 
D12 
14: 
D11 
D10 
D09 
D08 
15: 
D07 
D06 
D05 
D04 
16: 
D03 
D02 
D01 
DOO 
17: 
C 
C 
C 
C 
checksum 
for 
cycle 5 through 
16 
18: 
"1" 
"1" 
"1" 
"1" 
post amble 
19: 
A 
A 
A 
A 
accept 
(1000 if 
OK, 1110 if 
preempt, 
else 
error) 
20: 
"0" 
"0" 
"0" 
"0" 
idle 1 
21: 
"0" 
"0" 
"0" 
ICO" 
idle 2 


Cycles 
1 through 
4 are bus arbitr~tion 
as descri~ed 
earlier. 
Cycle 
5 (DM M2 M1 MO) IS the Destination 
Mode which is 0 for Physical 
mode and 1 for Logical 
Mode, and the Delivery 
Mode of the message. 
The 
encoding 
used for the Delivery Mode in the message 
is identical 
to the 
encoding 
used 
for the 
Delivery 
Mode in the Redirection 
Table, 
Local Vector 
Table, 
and Interrupt 
Command 
Register. 


M2 
M1 
MO 
Delivery 
Mode 


0 
0 
0 
Fixed 


0 
0 
1 
Lowest 
Priority 


0 
1 
0 
<reserved> 


0 
1 
1 
Remote 
Read 


1 
0 
0 
NMI 


1 
0 
1 
Reset 


1 
1 
1 
ExtlNT 


Cycle 
6 contains 
the Control 
Bits of the 
message. 


The control 
bits are: 


• 
TM (Trigger 
Mode): 
indicates 
whether 
this mes- 
sage corresponds 
to an edge or level; 


• 
L (Level): indicates 
whether 
this is an Assert 
or a 
Deassert 
of a "level" 
signal. L is undefined 
when 
TM is edge. 


6: "0" 
"0" 
L TM 
Control 
Bits 
TM = Trigger 
Mode (0 = edge, 
1 = level) 
L = Level (0 = deassert, 
1 = assert) 


The length of the message 
is derived from the Deliv- 


ery Mode, the Control 
Bits, and the Accept 
cycle of 
the message. 


Edge 
Level = Assert 
Level = Deassert 


Fixed 
Short 
Short 
Short 


Lowest Priority 
Short (1110) 
Short (1110) 
Short (1110) 


Long (1000) 
Long (1000) 
Short 


Remote 
Read 
Long 
Long 
Short 


NMI 
Short 
Short 
Short 


Reset 
Short 
Short 
Short 


ExtlNTA 
Short 
Short 
Short 


Cycles 
7 and 8 are the 8-bit 
interrupt 
vector. 
The 
vector 
is only defined 
for Delivery 
Modes 
Fixed, and 
Lowest-priority. 
For 
Delivery 
Mode 
of 
"Remote 
Read", 
the vector 
field contains 
the address 
of the 
register 
to be read remotely. 


If DM is 0 (physical 
mode), then cycles 9 and 10 are 
the unit ID and cycles 
11 through 
16 are zero. If DM 
is 1 (logical 
mode), then cycles 9 through 
16 are the 
32-bit Destination 
field. The interpretation 
of the logi- 
cal mode 32-bit Destination 
field is performed 
by the 
Local 
Units 
using the Destination 
Format 
Register. 
The sending 82489DX 
knows whether 
it should (incl) 
or should 
not (excl) respond 
to its own message. 


Cycle 
17 is a checksum 
over the data 
in cycles 
5 
through 
16. The checksum 
is computed 
by adding all 
4-bit quantities 
of cycles 
5 through 
16, feeding 
carry 
out of the MSB back into the LSB. This protects 
the 
data in these cycles against transmission 
errors. The 
(single) 
82489DX 
driving the message 
provides 
this 
checksum 
in cycle 
17. 


Cycle 18 is a post amble cycle driven as 1111 by the 
sending 
82489DX 
allowing 
all 82489DXs 
to perform 
various 
internal 
computations 
based on the informa- 
tion contained 
in the receiv~d 
message. 
One of the 
computations 
takes the computed 
checksum 
of the 
data received 
in cycles 
5 through 
16 and compares 
it against the value in cycle 17. If any 82489DX 
com- 
putes 
different 
checksum 
than 
the 
one 
passed 
in 
cycle 
17, then that 82489DX 
will signal an error on 
the ICC bus in cycle 
19 by driving it as 1111. If this 
happens, 
all 82489DXs 
will 
assume 
the 
message 
was never sent and the sender 
must try sending the 
message 
again, which 
includes 
re-arbitrating 
for the 
ICC bus. In lowest priority delivery when the interrupt 
has a focus 
processor, 
the focus 
82489DX 
will sig- 
nal this by driving 
1110 during cycle 
19. This tells all 
the other 
82489DXs 
that the interrupt 
has been ac- 
cepted, 
the 82489DXs 
is preempted, 
and short mes- 
sage format 
is used. All (non-focus) 
82489DXs 
will 
drive 
1000 in cycle 
19. Under 
lowest 
priority 
mode, 
1000 implies that the interrupt 
currently 
has no focus 
processor 
and that priority 
arbitration 
is required 
to 
complete 
the delivery. 
In that 
case, 
long 
message 
format 
is used. 
If cycle 
19 is 1000 for non Lowest 
Priority mode, then the message 
has been accepted 
and is considered 
sent. 


19:EEEE 


1000 
OK 
1110 
preempt 


<others> 
error (drive error as 1111) 


When an 82489DX 
detects 
and reports an error dur- 
ing the error cycle, that 82489DX 
will simply listen to 
the 
bus 
until 
it encounters 
two 
consecutive 
idle 
(0000) 
cycles. 
These 
two 
idle cycles 
indicate 
that 
the message 
has passed 
and a new message 
may 
be started 
by anyone. 
This allows 
an 82489DX 
that 
got itself out of cycle on the ICC bus to get back in 
sync with the other 82489DXs. 


Long Message 
Format 


Cycles 
1 through 
19 of the long message 
format 
are 
identical 
to cycles 
1 through 
19 of the short 
mes- 
sage format. 
As mentioned, 
long message 
format 
is 
used in two cases: 


(1) 
Lowest 
Priority delivery 
when the interrupt 
does 
not have a focus. Cycles 20 through 
27 are eight 
arbitration 
cycles 
where 
the 
destination 
82489DXs 
determine 
the 
one 
82489DX 
with 
lowest 
processor 
priority/ARB 
ID value. 


(2) Remote 
Read messages. 
Cycles 
20 through 
27 
are the 32-bit content 
of the remotely 
read regis- 
ter. This information 
is driven on the bus by the 
remote 
82489DX. 


Cycle 28 is an Accept 
cycle. 
In lowest 
priority 
deliv- 
ery, all 82489DXs 
that did not win the arbitration 
(in- 
cluding 
those 
that did not participate 
in the arbitra- 
tion) drive cycle 28 with 1000 (co accept), 
while the 
winner 82489DX 
drives 1111. If cycle 28 reads 1111, 
then all 82489DXs 
know that the interrupt 
has been 
accepted 
and the message 
is considered 
delivered. 
If cycle 28 reads 1100 (or anything 
but 1111 for that 
matter), 
then 
all 82489DXs 
assume 
the 
message 
was unaccepted 
or an error occurred 
during arbitra- 
tion. 
The 
message 
is considered 
undelivered, 
and 
the sending 
82489DX 
will try delivering 
the message 
again. 


For Remote 
Read messages, 
cycle 
28 is driven 
as 
1100 
by all 8 2489DXs 
except 
the 
responding 
re- 
mote 82489DX, 
who drives the bus as 1111 in case 
it was able to successfully 
supply the requested 
data 
in cycles 
20 through 
27. If cycle 
28 reads 
1111 the 
data in cycles 20 through 
27 is considered 
valid; oth- 
erwise, 
the 
data 
is considered 
invalid. 
The 
source 
82489DX 
that 
issued the Remote 
Read uses cycle 
28 
to 
determine 
the 
state 
of 
the 
Remote 
Read 
Status field in the Interrupt 
Command 
Register 
(valid 
or invalid). 
In any case, a Remote 
Read request 
is 
always successful 
(although 
the data may be valid or 
invalid) 
in that a Remote 
Read is never retried. The 
reason for this is that Remote 
Read is a debug fea- 
ture, and a "hung" 
remote 
82489DX 
that is unable 
to respond 
should 
not cause the debugger 
to hang. 


Cycles 29 and 30 are two idle cycles. The ICC bus is 
available 
for sending 
the next message 
at cycle 31. 


The two idle cycles at the end of both short and long 
messages, 
together 
with non zero (Le., non idle) en- 
coding for certain 
other bus cycles allow an ICC bus 
agent that happens 
to be out of phase by one cycle 
to sync back 
up in one message 
simply 
by waiting 
for 
two 
consecutive 
idle 
cycles 
after 
reporting 
its 
checksum 
error. This makes use of the fact that val- 
id arbitration 
cycles 
are never 0000. 


1: 
i76 
2: 
i54 
3: 
i32 
4: 
i10 
5: 
DM 
6: 
"0" 


7: 
V7 
8: 
V3 
9: 
D31 
10: 
D27 
11: 
D23 
12: 
D19 
13: 
D15 
14: 
D11 
15: 
D07 
16: 
D03 
17: 
C 


18: 
"1" 


19: 
A 


i76 
i76 
i76 
i54 
i54 
i54 
i32 
i32 
i32 
i10 
i10 
i10 
M2 
M1 
MO 
"0" 
L 
TM 
V6 
V5 
V4 
V2 
V1 
VO 
D30 
D29 
D28 
D26 
D25 
D24 
D22 
D21 
D20 
D18 
D17 
D16 
D14 
D13 
D12 
D10 
D09 
D08 
D06 
D05 
D04 
D02 
D01 
DOO 
C 
C 
C 


21: 
p54 
22: 
p32 
23: 
p10 
24: 
a76 
25: 
a54 
26: 
a32 
27: 
a10 
28: 
A 
29: "00" 
30: 
"0" 


p54 
p54 
p32 
p32 
p10 
p10 
a76 
a76 
a54 
a54 
a32 
a32 
a10 
a10 
A 
A 
"0" 
"0" 
"0" 
"0" 


delivery mode 
control 
bits 
vector 


p54 
p32 
p10 
a76 
a54 
a32 
a10 
A 


checksum 
for cycles 5 
through 
16 
postamble 
accept 
(1000 if OK, 
1110 if preempt, 
else error) 
lowest priority 
arbitration 
or 32 bits 
of remote register 
processor 
priority 


accept 
idle1 
idle2 


This section 
covers 
the following: 


- 
Timing 
Diagram 
Notation 


- 
82489DX 
Register 
Access 
Timing 
Diagrams 
with 
Descriptions 


A block 
diagram 
of the 
configuration 
of the 
CPU 
module 
of a MP system 
is shown. 
This in no way is 
intended 
to 
be 
a 
complete 
representation 
of 
486/1ntel 
Cache/Intel 
Cache 
Controller 
connec- 
tions. It is intended 
to show all the 82489DX 
connec- 
tions, 
and how they connect 
to other 
components 
on and off the module. 
This module 
has arbitrarily 
been drawn with a 64-bit data bus to show how the 
expanded 
address 
space 
architecture 
fits. The unit 
can be similarly 
attached 
to either a 32-bit or 128-bit 
data bus, with total transparency 
to shrink-wrap 
soft- 
ware. 


In this 
configuration, 
the 
82489DX 
uses 
the 
same 
clock source as the processor 
and cache. 
However, 


it is quite possible 
to consider 
82489DX 
as a memo- 
ry bus device 
and hence 
supply 
82489DX 
with the 
memory 
bus clock, 
which 
can 
be slower 
than 
the 
CPU module 
clock frequency. 


In the configuration 
shown, the processor's 
INT and 
NMI pins could be supplied 
by other source 
to allow 
for the possibility 
that 
82489DX 
can be totally 
by- 
passed 
if desired, 
by allowing 
those 
signals 
to be 
• 
driven from off the module while the 82489DX 
is dis- 
, 
abled. 
The reset signal 
generated 
by the 82489DX 
goes to the MBC (memory 
bus controller) 
which 
is 
required 
to drive 
configuration 
lines 
at reset 
time. 
This would 
probably 
be configured 
as a "warm" 
re- 
set by the MBC. 


A future 
version 
of cache 
controller 
may generate 
the chip select for 82489DX 
at a fixed memory 
loca- 
tion of hexFEEOOOOO.By having the cache controller 
to provide 
the chip select signal, it would encourage 
a standard 
mapping 
for 82489DX 
address 
space. 
In 
some MBC designs, 
this signal should be connected 
to the MBC since 82489DX 
cycles 
limit bus pipelin- 
ing by constraining 
how soon the next bus cycle can 
come. 
The 82489DX 
chip select 
can be generated 
by the MBC completely. 


The address, 
data and most of the bus control 
sig- 
nals share the respective 
bus with cache and cache 
controller. 
The block diagram 
shows 
attachment 
for 
only 6 address 
lines: A4-A9. 
A10 should 
be O. This 
is all the 82489DX 
needs 
for operation, 
however, 
if 
the address 
lines are used to initialize 
82489DX 
lo- 
cal ID at reset 
time, 
8 address 
lines 
are required, 
A3-A1O. 


The 82489DX 
has separate 
ICC bus input and out- 
put pins to facilitate 
using external 
drivers. 
The ICC 
bus 
input 
pins 
(MBI0-3) 
are TTL-Ievel 
compatible 
CMDS 
inputs. The output 
pins (MBDO-3) 
are open- 
drain 
pins 
which 
required 
external 
pull-ups. 
The 
open-drain 
output 
buffers 
are small buffers 
with: 


Sink current of < 4 mA. Special consideration 
must be exercised 
when driving large capaci- 
tive loads or long transmission 
lines. The pull- 
up resistor 
and the capacitive 
load constitute 
RC time 
constant 
that 
will affect 
the output 
transition 
times. This in turn will limit the oper- 
ating frequency 
of the ICC bus. 


When designing 
in the ICC bus, one needs to 
consider 
the loads that each 82489DX 
will be 
driving 
and 
whether 
external 
drivers 
should 
be used. In most situations, 
the ICC bus driv- 
en high (MBD pins pulled high by the external 
pull-up 
resistors) 
poses 
the 
most 
challenge. 


Simulating 
the target 
design 
on an electrical 
simulator 
(such as SPICE) will help greatly. 


as shown 
in the following 
examples. 


Figure 
21 a and 21 b are first order 
input buffer 
and 
output 
buffer 
models 
of the MBI and MBD pins. The 
open-drain 
of the 
MBD 
is modeled 
as a switch 
as 
the primary interest 
here is the MBD pins going high. 
These 
models 
can be used on SPICE simulations 
to 


obtain 
first 
order 
behaviors. 
The 
parameters 
for 


these 
models 
are as follows: 


Cp (package 
capacitance) 
= 3 pF 
Lp (package 
inductance) 
= 15 nH 
Rb (bond wire resistance) 
= 0.080 
Ci (input buffer capacitance) 
= 3 pF 
Co (output 
buffer capacitance) 
= 6 pF 
Ro (output 
buffer 
impedance) 
= 300-800 


MBO Pull-up Resistor 


To minimize the RC time constant, 
one would like to 
use the smallest 
pull-up resistor 
value possible. 
The 
MBD 
pins has a worst 
case 
lol-spec 
of 4 mA and 
Vcc = 4.75V. This translates 
to a minimum 
pull-up 
of 
about 
1 KO. 
Where 
stronger 
drive 
is needed 
(smaller 
pull-up 
resistors), 
external 
drivers 
must be 
used. 


Driving 
Lumped 
Capacitance 


In systems 
where 
external 
drivers 
are not used, the 
MBI pins will be tied to the MBD pins. Figure 21d is a 
SPICE 
simulation 
of the 
MBD 
output 
with 
a 1 KO 
pull-up 
driving 
lumped 
capacitive 
loads from 
10 pF 
to 150 pF. 


At a load of 50 pF, it takes about 30 ns to charge 
up 
to 2V. At 100 pF, it takes an additional 
25 ns. Figure 
21d can be used to estimate 
the 
loading 
delay 
at 
different 
lumped 
capacitive 
loads. 


~~ 
~dt 


intel~ 


In real systems, the loads are made up of lumped 
capacitance and transmission lines. More accurate 
results can be obtained using transmission line mod- 
els. 


Driving Transmission 
Lines 


Two device 
model 


In this example the ICC bus is a signal line on an 
FR-4 printed circuit board. The line width is 6 mils. 
Line length of 12 inches and 18 inches are modeled. 
The FR-4 PC board has the following characteris- 
tics: 
resistivity = 
0.6 mfi/sq. 
(0.1fi/inch 
for 6 mil 
width) 
inductance = 
60 pH/sq. (10 nH/inch for 6 mil 
width) 
capacitance = 0.55 nF/sq. in. (3.3 pF/inch for 6 
mil width) 


The ICC bus is shared by two 82489DXs, one at 
each end. The ICC bus is modeled as a transmission 
line. For the simulation, only one of the 82489DX is 
driving. A pull-up resistor of 2 Kfi is used at each 
end (1 Kfi equivalent value) as shown in Figure 21e. 
Figure 21f shows the signals at each end of the 12 
inch transmission line. Trace 1 is the wave form at 
the driven end and trace 2 is the signal at the receiv- 
ing end of the line. The 2 ns delay between the two 
signals is the propagation delay (or flight time) 
through the 12 inch transmission line. It takes about 
35 ns for the voltage to charge up to 2V. 


Figure 21g shows the received signal with different 
line length and with additional lumped capacitance. 
Trace 1 is for 12 inch only. Trace 2 is for 12 inch with 
additional 20 pF lumped capacitance to represent 
interconnect socket capacitance. Trace 3 is for 18 
inch plus 20 pF. The presence of the 20 pF at each 
end of the 12-inch transmission line increases the 
delay time by 20 ns at 2V. 
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One way to improve the low to high transition time is 
to use a stronger pull-up (smaller resistor value) 
which is possible using external line drivers with their 
larger current drive capabilities. 


In this example (Figure 21h), the ICC bus is a 
12-inch transmission line with four 82489DXs con- 
nected at 4 inch intervals. The loading at each junc- 
tion consisted of the MBI and MBO buffers and a 
20 pF lumped capacitance. 2 Kn pull-ups are at 
each end of the transmission line. 


Figure 21j shows the difference in output when the 
model is used with 300.11pull-ups at each end of the 
transmission line. 


As shown in Figure 21i, it takes more than 90 ns for 
the signal level at both ends to reach 2V. 


4.0 


V 
0 
3.0 


l 


T 


2.0 


1.0 


External 
Drivers/Buffered 
ICC bus 


The 82489DX has separate ICC Bus input (MBI) and 
output (MBO) pins that can be connected to external 
line drivers in systems that has appreciable loading 
on the ICC Bus or where modularity of the bus is 
needed. 


Figure 21k is a typical implementation using external 
drivers with tri-state outputs. Drivers such as 74F125 
or its equivalent can be used. The drivers should be 
placed as close to the MBO pins as possible. The 
input buffer on MBI is optional depending on the us- 


4.0 


V 
0 
3.0 
L 
T 
12" with 2k 


L 
~ 
I 
2.0 


N 


1.0 


ers ICC Bus scheme. The total delay through the 
drivers,.buffers, transmission line, clock skews etc. 
must be calculated to ensure that all the ICC bus 
timing requirements are met. 


A hierarchical bus connection can also be used in 
applications that cannot afford driver/buffer per unit 
and where bus loading are localized in cluster 
groups. Figure 211shows such a connection where 
each cluster group is connected directly and drivers 
are used to connect to other clusters. Each cluster 
group is assumed to be close together physically 
with small loading on the local ICC bus. 
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Transmission 
Line Termination 


As with all high speed designs, one has to consider 
transmission line effects on signals, especially clock 
signals. Even though the ICC bus clock, ICLK is usu- 
ally operated in the 10 MHz range, one has to con- 
sider proper transmission line termination also for 
short rise times. Figure 21m shows the ICLK wave 
form at the end of a 12 inch T-line when driven by a 
clock generator with and without series matched ter- 
mination. 


Series termination should not be used for the ICC 
bus data lines (MBO). The combination of the pull-up 
resistor and series resistor would degrade the output 
low voltage, Vol. For example, with a pull-up of 3000 
and a series termination of 500 at each end, the Vol 
voltage at the receiving end would be at 1.55V if the 
driving end is at OAV (see Figure 21n). 


ICC BUS Operating 
Frequency 


The 82489DX ICC BUS has a design target of oper- 
ating up to 16 MHz (62 ns period). As shown in the 
examples above, the MBO low-to-high transition 
times are strongly dictated by the loads and the pull- 
ups used. This will in turn affect the maximum oper- 
ating frequency of ICLK. 


In general, the minimum period is the larger of 62 ns 
or MBO-to-MBI low data time or MBO-to-MBI high 
data time. 


MBO-to-MBI low time = 
(ICLK skew + MBO valid low delay + T-Iine 
prop.delay + ext. buffer delay + MBI setup 
time) 


Note: 
Input buffer is optional 


MBO-to-MBI high time = 
(ICLK skew + MBO Hi-Z delay + pUll-high 
time + T-Iine prop.delay + ext. buffer delay 
+ MBI setup time) 
I 


Maximum MBO valid low delay = 50 ns 
Maximum MBO H-Z delay = 15 ns 
MBI minimum setup time = 8 ns 


In the example shown earlier where two 82489DXs 
are at each end of a 12-inch T-line with no other 
loads, the pull-high time to 2V is 35 ns (trace 1 in 
Figure 21g). If the ICLK skew is 2 ns, then this con- 
figuration can operate to 62 ns period or 16 MHz. 


If the same configuration has additional 20 pF loads 
at each end, then the pull-high time is 55 ns (trace 2 
in Figure 21g). The maximum frequency decreases 
to 12 MHz (82 ns period). 


In the four device model discussed earlier, where 
the ICC Bus is unbuffered, the pull-high time is 90 ns 
(Figure 21j). The operating frequency will be less 
than 8 MHz (117 ns period). If external buffers are 
used (whereby allowing use of 3000 pull-up) and 
assuming the external buffers have delays of 10 ns, 
the operating frequency is limited by the MBO-to- 
MBIIow time of 72 ns or 14 MHz. 


NOTE: 
Each application is unique in its configuration and 
loading on the ICC Bus. The above examples high- 
lighted some of the factors that need to be consid- 
ered. It is important to do electrical simulation to 
ascertain if the propose implementation is viable 
before committing to the printed circuit board. • 
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This section provides descriptions of the four basic 
cycle timings for the 82489DX, which are: 
Register Write 
Register Read 
Interrupt Acknowledge 
Reset 


In addition, Inter-Unit (nibble) bus timings are pre- 
sented. 


Register access occurs in three distinct phases: 
1. Control Phase, 
2. Address Phase and 
3. Data Phase. 


They always occur in this order, although in some 
cases address and data phases can .occur in the 
same clock cycle, as will be seen in the diagrams. 


NOTE: 
As mentioned previously, the clock signal in all 
these timing diagrams is assumed to be the proces- 
sor clock. 


The 82489DX bus (register access) interface is syn- 
chronous. Unless otherwise noted, setup and hold 
times for inputs, and delay times for outputs are 
measured with respect to a rising clock edge. There- 
fore the timing diagrams contain very few construc- 
tion lines to identify timing parameters and ones that 
exist are explicitly discussed. High and low level for 
signals are obvious; dashed lines at the middle of 
the signal range indicate a tristate output buffer in 
high impedance mode; hashed or cross-hatched 
lines indicate a "don't care" state for inputs. 


Cycle expansion marks-short 
curved lines breaking 
the waveform-appear 
throughout. These appear in 
sets or groups which are identified by construction 
arrows and/or vertical column alignment. Conditions 


resulting in cycle expansion are listed at the bottom 
of each diagram, and the associated set of expan- 
sion 
marks 
indicates 
which 
signals 
must 
be 
stretched for that condition. Signals not so indicated 
are not affected by that condition, and continue with- 
out any cycle stretching. For example, the data 
phase of a transaction may be delayed, stretching all 
data related signals, while address related signals 
can continue to the next cycle. 


Sample points for input signals are marked with 
bold, downward pointing arrows. Since sample tim- 
ing for address, data and cycle definition signals is 
dependent on the timing of related control signals, a 
bar is used on top of the arrow to indicate the signal 
with the independent timing. Each signal group has 
exactly one independent signal. In general, indepen- 
dent signals are sampled on every clock, and there- 
fore must meet setup and hold times on every clock 
edge. Signals having dependent timing, (indicated 
by the arrow with no bar), are only sampled when 
the associated independent signal is active, and 
therefore setup and hold times for dependent sig- 
nals need only be met at the indicated sample 
points. 


For discussion of this bus cycle, refer to Figure 23, 
82489DX Timing Diagram 1. This shows the relation- 
ship between the three phases of the bus cycle. 


The control phase is independently timed by the 
ADS signal. The cycle definition signals [MilO, DIG, 
are dependently sampled with ADS as indicated by 
the bold sample point arrows labeled "C". The cycle 
definition signals will be sampled in the first clock 
when ADS is active (low). The control signal should 
remain stable until ADS goes inactive. For any valid 
82489DX cycle, the memory bus controller should 
ensure that the ADS pulse for a subsequent bus cy- 
cle is NOT presented until after the 82489DX as- 
serts its ROY pin (low) as shown. 


The address phase is independently timed by the 
(BGT) signal, as indicated by the bold sample point 
arrows labeled "A". This signal is actually used an 
address latch enable, however, its name is intended 
to imply that in most cases it can be directly driven 
by the Intel cache controller signal of the same 
name. The BGT pulse may be delayed until the ad- 
dress bus is available, in which case all address and 
data phase signals will be stretched. Note that OLE 
must not occur before BGT. 82489DX does not start 
the internal cycle until BGT is recognized with the 
appropriate chip selec~nal, 
If multiple ADS has 
been issued without BGT and a valid chip se- 


intelQP 


lect, the internal cycle starts with the most recent 
ADS cycle definition preceding the BGT with valid 
chip select. 


NOTE C: 
Address information, including chip select (CS), is 
sampled in the first clock when BGT is active (low), 
and they must remain stable until BGT goes inac- 
tive, OR until ROY is asserted (low), whichever oc- 
curs first. CS should be stable when ADS is active. 


In some configurations, BGT may not be provided, 
and can be permanently tied low. In this case, the 
independent address timing will occur exactly one 
clock after the ADS signal is first sampled low, and 
the dependent address information (address and 
chip select) will be assumed stable at this time. 
82489DX recognize that independent BGT timing is 
not provided by sampling a low state of BGT at the 
time ADS is first sampled low. 


The data phase is independently timed by the OLE 
signal, as indicated by the bold sample point arrows 
labeled "0". In the case of register writes, this signal 
workJQ9!cally like a synchronous data latch enable. 
The OLE pulse may be delayed until the data bus is 
available, in which case data and ROY will be 
stretched. 


NOTE 0: 
Write data are sampled the first clock when OLE is 
active (low), and should remain stable until OLE 
goes inactive, OR until ROY is asserted (low), 
whichever comes first. 


In some configurations, OLE may not be provided, 
and can be permanently tied low. In this case, the 
independent data timing will occur exactly one clock 
after the ADS signal is first sampled low, OR on the 
same clock as BGT is first sampled low, whichever 
occurs later. The data bus will be assumed stable at 
this time. 82489DX recognize that independent OLE 
timing is not provided by sampling a low state of OLE 
at the time ADS is first sampled low. 


Cycle completion is signaled by the ROY signal. Its 
relative positioning on any of the timing diagrams 
does NOT imply the number of clock cycles required 
for an access. ROY is delayed as needed in order for 
the 82489DX to complete the cycle. It is then assert· 
ed (low) for one clock cycle and then deasserted. 
Again, the next ADS cannot start until after ROY has 
been driven low. ADS must return to an inactive high 
state before the next cycle can be issued. It is highly 
recommended not to have, ADS more than one 
clock wide. 
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For discussion of this bus cycle, refer to Figure 24, 
824890X timing Diagram 2. It shows the relationship 
of the three phases of the bus cycle, however, the 
dependent control and address signals are not 
shown here, since they behave exactly as in the 
case of a register write. See the previous section for 
the description of control and address phases of the 
bus cycle. 


In the case of a read when OLEis used (Figure 24A), 
it works logically like an asynchronous, output data 
enable. The 824890X drives the data bus within 
time delay "A" after OLE is asserted, which must not 
occur before BGT. Note that even though the bus is 
being driven, the data only becomes valid during the 
clock cycle in which ROY is asserted, after that 
~t, 
valid data is maintained on the bus as long as 
OLE remains asserted, after which the data bus re- 
turns to high impedance state within time delay "B" 
of 
OLE deassertion. 
If 
OLE is 
asserted 
late, 


824890X could complete its internal read cycle and 
return ROY early. In that case, ROY active low state 
will be maintained until OLE is asserted. 


In the case of a read when OLE is NOT used (Figure 
24B) the data is driven for exactly one clock cycle, 
coincident with ROY being asserted. 


OLE is sampled with the control signals to determine 
whether it is being used. If sampled in the asserted 
state when ADS is active, the OLE will be consid- 
ered not used, and its state during the remainder of 
the cycle doesn't matter. This is consistent with the 
notion of permanently tying this signal low when not 
used, as described in the previous section. 


Indication of the end of the bus cycle is dependent 
on the use of OLE. When it is not used, (Le., perma- 
nently tied to ground) ROY indicates the end of the 
bus cycle, as it does in the case of write ~ccess. 
When it is used, OLE deassertion indicates the end 
of the cycle, since the 824890X could be driving the 
bus well after ROY is deasserted. In either case, the 
824890X can accept the next ADS pulse anytime 
after ROY has been asserted. However, note that if 
OLE is being used, the next ADS should be delayed 
until OLEcan be safely sampled inactive. Note these 
two options for earliest next cycle in the timing dia- 
gram. 


For discussion of this bus cycle, refer to Figure 25 
824890X timing diagram 3, This cycle is the result of 
the 824890X posting an interrupt to the processor 
by asserting PINT. After PINT is asserted, other bus 
cycles may occur before the interrupt acknowledge 
cycle. 


PINT can be asserted for any external (8259) inter- 
rupt as shown in Figure 25A, or for an 824890X gen- 
erated interrupt as shown in Figure 25B. ExtlNTA 
indicates whether PINT was asserted in response to 
an 8259 request or an 824890X request. This signal 
is used by external control logic to either allow or 
preclude the 8259 from responding to the subse- 
quent interrupt acknowledge cycle. It should be not- 
ed that ExtlNTA pin gets deactivated at third clock 
after the ADS of the second INTA cycle. 


When ExtlNTA is high, the 824890X will not respond 
to the acknowledge cycle other than to deassert 
PINT signal, and clear the pendi~xternal 
interrupt 


two full clock cycles after the ADS of the second 
INTA cycle, as shown in Figure 25A. Once PINT is 
asserted in response to an external interrupt, it can 
only be deasserted by an INTA cycle. An INTA cycle 
is recognized by 824890X as soon as the bus cycle 
definition is sampled with ADS in a low state. BGT is 
not needed in this case. 


When ExtlNTA is low, the 824890X will respond to 
the acknowledge cycle, as shown in Figure 25B. In 
this case, external logic (e.g., the memory bus con- 
troller) is expected to prevent any attached 8259 
from seeing the acknowledge cycle. When PINT is • 
asserted in response to an 824890X internal inter- 
~ 


rupt, it can be deasserted by an INTA cycle. The 
PINT signal will be deasserted 5 full clocks after 
BGT of the second INTA cycle. 


Note that ExtlNTA is stable at all times while PINT is 
asserted. That means that even if new interrupts ar- 
rive between the time an interrupt is posted to the 
processor, 
and 
the 
acknowledge 
occurs, 
the 


824890X will not change its commitment for an ex- 
ternal (8259) or internal (824890X) acknowledge cy- 
cle, regardless of priority. This also means that PINT 
may be raised for a high priority internal interrupt 
right after responding to the external Interrupt. In any 
event, PINT will be kept low for a minimum of two 
clocks before reasserting itself. 


The interrupt acknowledge cycle is indicated by the 
bus cycle definition signals all being low, and looks 
like two consecutive read cycles, except that there is 
no explicit address information. The actual content 
of the address pins during this cycle is processor 
dependent, and therefore there is no chip select ei- 
ther. Chip select is implied by a combination of the 
bus cycle definition signals (all low) and BGT. 


Note that there is a "dummy" data phase in the first 
interrupt acknowledge cycle. This allows parity to be 
generated on the bus for processors like i860XP. 
During this cycle, 824890X drives random data on 
the bus with the appropriate parity. The interrupt Re- 
quest Register is "frozen" and the highest priority 


pending interrupt vector is returned to the processor 
in the second acknowledge cycle. 


The second acknowledge cycle has a complete data 
phase with timings identical to those of an ordinary 
register read. The data returned is the vector of the 
highest priority internally pending 82489DX interrupt, 
or the spurious interrupt vector, if there is no inter- 
rupt pending higher than the current processor prior- 
ity. 


Note that the timing diagram shows OLE being used 
(sampled high during ADS). However, just as in a 
normal read cycle, the option exists not to use OLE 
(Le., permanently tied to ground). 


RESET AND~MISCELLANEOUS TIMING 


For discussion of this bus cycle, refer to Figure 26 
82489DX Timing Diagram 4. It shows the 82489DX 
reset cycle, the timing of some related signals, and 
the ICC bus. The RESET input has a setup and hold 
time to the system clock edge, CLKIN, as do other 
independently timed signals, The RESET signal will 
reset the two asynchronous system on the chip, 
namely the ICC bus unit running synchronously to 
the ICLK and all the other unit running synchronous 
to the system clock, CLKIN. RESET must meet the 
minimum reset time with respect to both clocks and 
there should be at least one ICLK rising edge during 
reset. The TAP controller should also be initialized. 


During reset, an eight-bit 82489DX Local Unit 10 can 
be optionally initialized. Eight address lines, A10-A3 
are sampled on every clock edge while RESET is 
asserted. The last sample remains in the 82489DX 
Local Unit 10 register after reset. Alternatively, the 
82489DX Local Unit 10 can be loaded with a register 
write 
as 
part 
of 
software 
initialization, 
before 
82489DX operation is started. In any event, the reg- 
ister must be initialized before the 82489DX can 
communicate on the ICC bus, including sending/re- 
ceiving 
RESET 
messages. 
All 
valid 
signal 
to 
82489DX should wait at least two full clocks after 
RESET is deasserted. 


The PRST signal (reset output) is asserted both with 
RESET input, or under software control. Its on-off 
delay times are relative to the rising clock edge. The 
duration of PRST under software control is defined 
by the software itself. Also note that the PNMI pin 
has the same timing as does PRST when the latter 
is software controlled. 


The ICC bus signals are both input and output on 
each cycle. Setup, hold and delay times are all mea- 
sured with respect to the ICC bus Clock ICLK which 
has no relationship to the Processor clock on which 
the remainder of the 82489DX runs. This means 


that the ICC bus is independently sampled on each 
ICLK edge, as shown. It also implies that largest 
possible hold time will not exceed the minimum de- 
lay time. 


After reset, all 82489DX registers are reset to "0" 
state. The mask bits in the local vector table and the 
redirection table are reset to "1" state to mask out 
all interrupts. All reserved bits are all wired to "0" 
state permanently on chip. 


10.0 
BOUNDARY SCAN 
DESCRIPTION 


The 82489DX is equipped with the JTAG boundary 
scan standard. This feature allows the user to test 
the interconnections between 82489DX and the ex- 
ternal hardware once they have been assembled 
onto a printed circuit board or other substrate. In 
addition to the JTAG mandatory instruction set, 
82489DX also 
provides the 
INTEST instruction 
which allows static testing of the on-chip logic. 


The detailed information related to the IEEE Std 
1149.1-1990 (the JTAG standard) can be obtained 
from the reference document IEEE Standard Test 
Access Port and Boundary Scan Architecture (IEEE 
Std 1149.1-1990). 


10.1 Boundary Scan Architecture 


The boundary scan logic contains the following ele- 
ments: 
- 
Five Test Access 
Ports (TAP): They are la- 
beled as trst, tck, tdi, tdo and tms. All ports are 
input pins except tdo, which is a tri-state output 
pin. 


- 
A TAP Controller: 
The logic is used to control 
the boundary scan activity. 
- 
82489DX Device 10 Register: This is a 32-bit 
read-only register. The DID can be shifted out in 
ascending order to the tdo pin. 


- 
JTAG Instruction 
Register (IR): This is a 4-bit 
register which accepts instruction code shifted in 
from the tdi pin. The opcode stored in the IR reg- 
ister is used to control operation. 


- 
Boundary Scan Register: This is a 137 stages 
scan path which connects almost all 82489DX 
signal pins for boundary scan purposes. 
- 
Bypass Register: This register simply allows the 
data which goes into tdi pin to be shifted out di- 
rectly from tdo. 


The following block diagram illustrates the imple- 
mentation of the JTAG architecture in the 82489DX 
design. 
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Test Access Ports 


trst 
TAP controller master reset pin. When trst is 
low, the TAP controller's state machine will be 
reset to "test-logic-reset" state asynchronous- 
ly. This pin is tied to a weak internal pull-up for 
keeping to be a logical 1 when not driven. 


tck 
This is the test logic clock. The test logic will 
change state on the rising edge of the tck. 
tdi 
Test data input. Data is shifted into the tdi pin 
on the rising edge of tck. This pin is tied to a 
weak internal pull-up for keeping it to be a logi- 
cal 1 when not driven. 
tms 
Test mode select. This pin is used to select the 
state of the TAP controller. This pin is synchro- 
nous to the rising edge of the tck. This pin is 
tied to a weak internal pull-up for keeping it to 
be a logical 1 when not driven. 


tdo 
Test data output. This is a tri-state pin which 
allows the data to be shifted out. 


The TAP controller in 82489DX is implemented to 
conform the IEEE1149.1 standard. The TAP control- 
ler is a single phase clock, synchronous finite state 
machine. It controls the sequence of the operation 
of the test logic. 


The value of the test mode state (tms) pin at a rising 
edge of tck controls the sequence of the state 
changes. The state diagram for the TAP controller is 
shown in Figure 28. Test designers must consider 
the operation of the state machine in order to have 
the correct sequence of value to drive on tms. 


The behavior of the TAP controller and other test 
logic in each of the controller states is briefly de- 
scribed as follows. 


The test logic is disabled so that normal operation of 
the on-chip system logic (Le., in response to stimuli 
received through the system pins only) can continue 
unhindered. This is achieved by initializing the in- 
struction register to contain the IOCOOEinstruction. 
No matter what the original state of the controller, it 
will enter Test-Logie-Reset when tms is held high for 
at least five rising edges of tck. The controller re- 
mains in this state while tms is high. 


If the controller should leave the Test-Logie-Reset 
controller state as a result of an erroneous low sig- 
nal on the tms line at the time of the rising edge on 
tck (for example, a glitch due to external interfer- 


ence), it will return to the Test-Logie-Reset state fol- 
lowing three rising edges of tck with the tms line at 
the intended high logic level. The operation of the 
test logic is such that no disturbance is caused to 
on-chip system logic operation as the results of such 
an error. On leaving the Test-Logie-Reset controller 
state, the controller moves into the Run-Test/Idle 
controller state where no action will occur because 
the current instruction has been set to select opera- 
tion of the device identification register. The test log- 
ic is also inactive in the Select-OR-Scan and Select- 
IR-Scan controller states. 


Note that the TAP controller will also be forced to 
the Test-Logie-Reset controller state asynchronous- 
ly by applying a low logic level at trst. 


A controller state between scan operations. Once 
entered, the controller will remain in the Run-Test/ 
Idle state as long as tms is held low. When tms is 
high and a rising edge is applied at tck, the controller 
moves to the Select-OR-Scan state. 


In the Run-Test/Idle controller state, activity in se- 
lected test logic occurs only when certain instruc- 
tions are present such as RUNBIST. Since 824890X 
does not have RUNBIST instruction, this state is act- 
ing like an idle state. 


The instruction does not change while the TAP con- 
troller is in this state. 


This is a temporary controller state in which all test 
data register (824890X has one test data register 
which is the boundary scan shift registers path) se- 
lected by the current instruction retain their previous 
state. 


If tms is held low and a rising edge is applied to tck 
when the controller is in this state, then the control- 
ler moves into the Capture-DR state and a scan se- 
quence for the selected test data register is initiated. 
If tms is held high and a rising edge is applied to tck, 
the controller moves on to the Select-IR-Scan state. 


The instruction does not change while the TAP con- 
troller is in this state. 


This is a temporary controller state in which all test 
data registers selected by the current instructing re- 
tain their previous state. 


If tms is held low and a rising edge is applied to tck 
when the controller is in this state, then the control- 
ler moves into the Capture-IR state and a scan se- 
quence for the instruction register is initiated. If tms 
is held high and a rising edge is applied to tck, the 
controller returns to the Test-Logic-Reset state. The 
Instruction 
does not change 
while the TAP con- 
troller 
Is In this state 


In this controller state data may be parallel-loaded 
into test data registers selected by the current in- 
struction on the rising edge of tck. If a test data reg- 
ister selected by the current instruction does not 
have a parallel input, or if capturing is not required 
for the selected test, then the register retains its pre- 
vious state unchanged. The Instruction does not 
change while the TAP controller Is In this state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Exit 1-0R state if tms is held at 1 or the Shift-DR 
state if tms is held at O. 


In this controller state, the test data register con- 
nected between tdi and tdo as a result of the current 
instruction shifts data one stage towards its serial 
output on each rising edge of tck. Test data registers 
that are selected by the current instruction, but are 
not placed in the serial path, retain their previous 
state 
unchanged. 
The 
Instruction 
does 
not 
change while the TAP controller Is In this state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Exit 1-0R state if tms is held at 1 or remains in the 
Shift-DR state if tms is held at O. 
. 


This is a temporary controller state, if tms is held 
high, a rising edge applied to tck while in this state 
causes the controller to enter the Update-DR state, 4 
which terminates the scanning process. If tms is 
held low and a rising edge is applied to tck, the con· 
troller enters the Pause-DR state. All test date reg- 
isters selected 
by the current 
Instructions 
retain 
their previous 
state 
unchanged. 


The instruction does not change while the TAP con- 
troller is in this state. 


This controller state allows shifting of the test data 
register in the serial path between tdi and tdo to be 
temporarily halted. All test data registers selected by 
the current instruction retain their previous state un- 
changed. 


The controller remains in this state while tms is low. 
When tms goes high and a rising edge is applied to 
tck, the controller moves on to the Exit 2-0R state. 
The instruction does not change while the TAP con- 
troller is in this state. 


This is a temporary controller state. If tms is held 
high and a rising edge is applied to tck while in this 
state, the scanning process terminates and the TAP 
controller enters the Update-DR controller state. If 
tms is held low and a rising edge is applied to tck, 
the controller enters the Shift-DR state. 


All test data registers selected by the current in- 
struction retain their previous state unchanged. The 
instruction does not change while the TAP controller 
is in this state. 


Some test date registers may be provided with a 
latched parallel output to prevent changes at the 
parallel output while data is shifted in the associated 
shift-register path in response to certain instructions 
(e.g., EXTENT, INTEST, and RUNBIST). Data is 
latched onto the parallel output of these test data 
registers from the shift-register path on the falling 
edge of tck in the Update-DR controller state. The 
data held at the latched parallel output should not 
change other than in this controller state unless op- 
eration during the execution of a self test is required 
(e.g., during the Run-Test/Idle controller state in re- 
sponse to a design-specific public instruction). 


All shift-register stages in test data registers select- 
ed by the current instruction retain their previous 
state 
unchanged. 
The 
Instruction 
does 
not 
change while the TAP controller 
Is In this state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Select-DR-Scan state if tms is held at 1 or the Run- 
Test/Idle state if tms is held at O. 


In this controller state the shift-register contained in 
the instruction register loads a pattern of fixed logic 
values on the rising edge of tck. In addition, design- 
specific data may be loaded into shift-register stages 
that are not required to be set to fixed values. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state. 


When the TAP controller is in this state and rising 
edge is applied to tck, the controller enters either the 
Exit 1-IR state tms is held at 1 or the Shift-IR state if 
tms is held at O. 


In this controller state the shift-register contained in 
the instruction register is connected between tdi and 
tdo and shifts data one stage towards its serial out- 
put on each rising edge of tck. 
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Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state. 
When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters either the 
Exit1-IR state if tms is held at 1 or remains in Shift-IR 
state if tms is held at O. 


This is a temporary controller state. If tms is held 
high, a rising edge applied to tck while in this state 
causes the controller to enter the Update-IR state, 
which terminates the scanning process. If tms is 
held low and a rising edge is applied to tck, the con- 
troller enters the Pause-IR state. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not changes while the TAP controller is in this state 
and the instruction register retains its state. 


This controller state allows shifting of the instruction 
register to be halted temporarily. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state 
and the instruction register retains its state. 


The controller remains in this state while tms is low. 
When tms goes high and a rising edge is applied to 
tck, the controller moves on to the Exit 2-IR state. 


This is a temporary controller state. If tms is held 
high and a rising edge is applied to tck while in this 
state, termination of the scanning process results, 
and the TAP controller enters the Update-IR control- 
ler state. If tms is held low and a rising edge is ap- 
plied to tck, the controller enters the Shift-IR state. 


Test data registers selected by the current instruc- 
tion retain their previous state. The instruction does 
not change while the TAP controller is in this state 
and the instruction register retains its state. 


The instruction shifted into the instruction register is 
latched onto the parallel output from the Shift-regis- 
ter path on the falling edge of tck in this controller 
state. Once the new instruction has been latched, it 


becomes the current instruction. Test 
data 
regIs- 
ters 
selected 
by the 
current 
Instruction 
retain 
their previous 
state. 


When the TAP controller is in this state and a rising 
edge is applied to tck, the controller enters the Se- 
lect-OR-Scan states if tms is held at 1 or the Run- 
Test/Idle state if tms is held at O. 


The function of the instruction register is to select 
the operating mode of the test logic. For instance, 
read the 10 register, or capture the 824890X output 
signals. 824890X has implemented 4 instructions. 


Instruction 
Mandatory IOptional 
Opcode 


bypass 
m 
1 1 1 1 


extest 
m 
0000 


sample/preload 
m 
0001 


idcode 
m 
0010 


reserved 
0 
1001 


Bypass 
Instruction 


The bypass instruction selects the bypass register to 
be connected to tdi and tdo, effectively bypassing 
the test logic on the 824890X boundary scan path 
and reducing the shift length to be on one bit. Note 
that an open circuit fault in the board level test data 
path will cause the bypass register to be selected 
following an instruction scan cycle due to the inter- 
nal pull-up on the tdi pin. This has been done to 
prevent any unwanted interference with the proper 
operation of the system logic. 


The extest instruction allows testing of circuitry ex- 
ternal to the component package, typically board in- 
terconnects. It does so by driving the values loaded 
into the 824890X's boundary scan register out on 
the output pins corresponding to each boundary 
scan cell and capturing the values on 824890X's 
input pins to be loaded into their corresponding 
boundary scan register locations. I/O pins are se- 
lected as input or output depending on the value 
located into the output control cell. Values shifted 
into input latch in the boundary scan register are 
never used by the internal logic of the 824890X. 


NOTE: 
824890X must be reset after extest instruction has 
been executed. 


Sample/Preload 
Instruction 


The sample/preload instruction has two functions 
that it can perform. When the TAP controller is in the 
CAPTURE-DR state, the sample/preload instruction 
allows a snap-shot of the normal operation of the 
824890X without interfering with that normal opera- 
tion. The instruction causes boundary scan register 
cells associated with outputs to sample the value 
being driven into the 824890X. On both outputs and 
inputs the sampling occurs on the rising edge of tck. 
When preloads data into the 824890X pins to be 
driven to the board by executing the extest instruc- 
tion. Data is preloaded to the pins from the boundary 
scan register on the falling edge of tck. 


The idcode instruction selects the device identifica- 
tion register to be connected to tdi and tdo, allowing 
the device 10code to be shifted out of the device on 
tdo. Note that the bit stream shifted into tdi will ap- • 
pear on tdo after all 32 bits of the DID has been 
• 
shifted out. 


The device identification is a 32 bits number which 
can be read by the external hardware by using the 
idcode instruction. The 824890X device 10 is as- 
signed to 1489A013 (hex). This is subject to change. 
The upper 4 bits of DID may be changed for different 
version. The 16-bit number (bit 27-bit 
12) 489A 
(hex) is the part 10.The lower 12 bits are the manu- 
facturer 10for Intel which must be 013 (hex). 


824890X has only one test data register, I.e., the 
boundary scan register. The boundary scan register 
is a single shift register path containing the boundary 
scan cells that are connected to all signal input and 
output pins of the 824890X. There are three generic 
type of boundary scan cells-input, 
output, and bi-di- 
rectional. For each input only cell, one stage of shift 
register is added to the boundary scan path. 


All 
output 
pins will 
become 
tri-stateable 
when 
boundary scan is activated, regardless whether they 
are tri-stateable or not in the normal operation. To 
explain further, the user will enable/disable an out· 
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put driver with a specific tri-state control cell in the 
scan path. The user must shift in a proper control 
signal for these tri-state control cells in the scan 
path. 


82489DX 


INTERNAL 


LOGIC 


I 
i/o 
pin 
I 
II 
II 
II 
II 
I 


I 
output 
pin 
I 
I 
I 


BOUNDARY 
SCAN CELL 
NAMES 
IN ORDER 
FROM tdi TO tdo 


The following 
table is a list of the boundary 
scan cell 
names in the order from tdi to tdo. The type informa- 
tion indicates 
the purpose 
of the cells. 


I = input only cell 


B = bi-directional 
cell 


T = tri-state 
output 
cell 


e = tri-state 
control. 
Note that the signal 
name 
enclosed 
within 
the 
parenthesis 
is 
con- 
trolled 
by this cell. 


Cell Number 
Type 
Name 
Pin # 


1 
I 
eLKIN 
57 


2 
I 
TMBASE 
59 


3 
I 
leLK 
60 


4 
I 
DC 
61 


5 
I 
WR 
62 


6 
I 
MilO 
63 


7 
I 
ADS 
64 


8 
I 
RESET 
65 


9 
I 
BGT 
66 


10 
I 
reserved 
70 


11 
I 
reserved 
71 


12 
I 
reserved 
72 


13 
I 
OLE 
73 


14 
I 
es 
74 


15 
I 
reserved 
75 


16 
I 
MBI3 
76 


17 
I 
MBI2 
77 


18 
I 
MBI1 
78 


19 
I 
MBIO 
79 


20 
I 
L1NTIN1 
80 


21 
I 
L1NTINO 
81 


22 
T 
reserved 


23 
e 
(reserved) 


24 
I 
INTIN15 
82 


25 
I 
INTIN14 
83 


26 
T 
reserved 


27 
e 
(reserved) 


CellNumber 
Type 
Name 
Pin # 


28 
I 
INTIN13 
84 


29 
I 
INTIN12 
85 


30 
T 
reserved 


31 
e 
(reserved) 


32 
I 
INTIN11 
86 


33 
I 
INTIN10 
87 


34 
T 
reserved 


35 
e 
(reserved) 


36 
I 
INTIN9 
88 


37 
I 
INTIN8 
89 


38 
T 
reserved 


39 
e 
(reserved) 


40 
I 
INTIN7 
90 


41 
I 
INTIN6 
91 


42 
T 
reserved 


43 
e 
(reserved) 


44 
I 
INTIN5 
92 


45 
I 
INTIN4 
93 


46 
T 
reserved 


47 
e 
(reserved) 


48 
I 
INTIN3 
94 


49 
I 
INTIN2 
95 


50 
T 
reserved 


51 
e 
(reserved) 


52 
I 
INTIN1 
96 


53 
I 
INTINO 
97 


54 
I 
reserved 


55 
B 
OP3 
101 


56 
e 
(OP[3:0]) 


57 
B 
OP2 
102 


58 
B 
OP1 
103 


59 
B 
OPO 
104 


60 
T 
reserved 


61 
e 
(reserved) 


62 
B 
031 
105 


63 
e 
(0[31 :0]) 


64 
B 
030 
107 


Cell Number 
Type 
Name 
Pin # 


65 
B 
029 
109 


66 
B 
028 
110 


67 
T 
reserved 


66 
C 
(reserved) 


69 
B 
027 
111 


70 
B 
026 
112 


71 
T 
reserved 


72 
C 
(reserved) 


73 
B 
025 
114 


74 
B 
024 
115 


75 
B 
023 
116 


76 
T 
reserved 


77 
C 
(reserved) 


78 
B 
022 
118 


79 
B 
021 
119 


80 
B 
020 
121 


81 
B 
019 
122 


82 
B 
018 
123 


83 
B 
017 
124 


84 
B 
016 
125 


85 
B 
015 
128 


86 
B 
014 
129 


87 
B 
013 
130 


88 
B 
012 
131 


89 
B 
011 
2 


90 
T 
reserved 


91 
C 
(reserved) 


92 
B 
010 
3 


93 
B 
09 
4 


94 
T 
re$erved 


95 
C 
(reserved) 


96 
B 
08 
7 


97 
B 
07 
8 


98 
B 
06 
9 


99 
B 
05 
11 


100 
B 
04 
12 


101 
B 
03 
13 


- 


Cell Number 
Type 
Name 
Pin # 


102 
B 
02 
14 


103 
B 
01 
)6 


104 
B 
00 
18 


105 
I 
reserved 
19 


- B 
reserved 
20 
106 


107 
C 
(ce1l106, A[10:31) 


108 
B 
A10 
21 


109 
B 
A9 
22 


110 
B 
A8 
24 


111 
B 
A7 
26 


112 
B 
A6 
27 


113 
B 
A5 
28 


114 
B 
A4 
29 


115 
B 
A3 
31 


116 
T 
reserved 
34 


117 
C 
reserved 


118 
T 
PINT 
35 


119 
C 
(PINT) 


120 
T 
PNMI 
37 


121 
C 
(PNMI) 


122 
T 
PRST 
38 


123 
C 
(PRST) 


124 
T 
ExtlNTA 
41 


125 
C 
(reserved) 


126 
T 
reserved 
42 


127 
C 
(reserved) 


128 
T 
ROY 
43 


129 
C 
(ROY) 


130 
T 
MB03 
45 


131 
C 
(MB03) 


132 
T 
MB02 
48 


133 
C 
(MB02) 


134 
T 
MB01 
49 


135 
C 
(MB01) 


136 
T 
MBOO 
51 


137 
C 
(MBOO) 


The bypass register is simply a 1-bit shift register 
which connects between the tdi and tdo. When se- 
lected by using the bypass instruction, the data shift- 
ed into tdi will be shifted out from tdo one tck clock 
later. 


The TAP controller must be reset to test-logic-reset 
state when 82489DX is first powered up. There are 
two ways to reset the TAP controller: 


1. Assert trst to be 0, it will reset the TAP controller 
asynchronously. 


2. Assert tms to be 1, and clock the TAP controller 


at least five times, the TAP controller will be reset 
after the fifth rising edge of the tck. 


After reset, the idcode instruction is loaded into the 
IR automatically. 


Note that the tms and trst pins both have an internal 
weak pull-up device to keep them to be logic 1 leve:. 
Therefore the user can simply apply 5 clocks at the 
tck input to reset the TAP controller. If the TAP con- 
troller is not reset properly, 82489DX may not func- 
tion because the boundary scan logic might be ac- 
tive which will impact the signals flow in and out to 
the chip. 
• 
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11.0 
ELECTRICAL 
CHARACTERISTICS 


NOTICE: This data sheet 
contains 
information 
on 
products in the sampling and initial production phases 
of development. 
The specifications 
are subject 
to 
change 
without 
notice. 
Verify with your local 
Intel 
Sales office that you have the latest data sheet be- 
fore finalizing a design . 


• WARNING: Stressing the device beyond the "Absolute 
Maximum 
Ratings" 
may 
cause 
permanent 
damage. 


These are stress ratings 
only. Operation 
beyond 
the 
"Operating 
Conditions" 
is not recommended 
and ex- 
tended 
exposure 
beyond 
the 
"Operating 
Conditions" 


may affect device reliability. 


Case Temperature 
Under Bias 
- 65·C to + 11O·C 


Storage Temperature 
- 65·C to + 150·C 


Voltage 
on Any Pin 
with Respect 
to Ground 
-0.5 
to VCC + 0.5 


Symbol 
Parameter 
Mln (ns) 
Max 
Units 
Notes 


VIL 
Input LOW Voltage 
(TIL) 
-0.3 
+0.8 
V 


VIH 
Input HIGH Voltage 
(TIL) 
2.0 
VCC + 0.3 
V 


VOL 
Output LOW Voltage 
(TIL) 
+0.45 
V 
(Note 1) 


VOH 
Output HIGH Voltage 
(TIL) 
2.4 
V 
(Note 2) 


Icc 
33 MHz Power Supply Current 
200 
mA 


III 
Input Leakage Current 
15 
IJ-A 


ILL 
Input Leakage Current 
-600 
IJ-A 
(Note 5) 


ILH 
Output Leakage Current 
600 
IJ-A 
(Note 4) 


ILO 
Output Leakage Current 
15 
IJ-A 
(Note 3) 


CIN 
Input Capacitance 
3 
pF 


Co 
1/0 or Output Capacitance 
6 
pF 


CCLKIN 
Clock Capacitance 
3 
pF 


IMLO 
ICC Bus Output 
Low Current 
4 
mA 
(Note 6) 


CMC 
ICC Bus Total Capacitance 
100 
pF 


VMH 
ICC Bus Input High (TIL) 
2.0 
VCC + 0.3 
V 


VML 
ICC Bus Input Low (TIL) 
-0.3 
+0.8 
V 


NOTES: 
1. This parameter is measured with current load of 4 mA. 
2. This parameter is measured with current load of 1.0 mA. 
3. This parameter is for output without pulldown. 
4. This parameter is for tri-state output with pulldown and VOH = 3.0V. 
5. This parameter is for input with pullup at VIL = OV. 
6. ICC bus output low current is measured at 0.6V. 
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A.C. Parameters Referencing 33 MHz System Clock 
vcc 
= 5V ±5%; 
Tc = O°C to +85°C 


Symbol 
Parameter 
Ref. 
Load 
Mln 
Max 
Notes 
Fig. 
(pF) 
(ns) 
(ns) 


te 
CLKIN Period 
30 
30 
100 
(Note 1) 


t1 
CLKIN High Time 
30 
5 


t2 
CLKIN Low Time 
30 
5 


t3 
CLKIN Rise Time 
30 
3 
(Note 2) 


t4 
CLKIN Fall Time 
30 
3 
(Note 2) 


t5 
ADS, SGT, OLE, MilO, 
DIG, WIR, CS 
31 
8 
Setup Time 


t6 
031-00, 
OP3-0PO, 
A9-A3 
Setup Time 
31 
8 


t8 
ADS, SGT, OLE, MilO, 
DIG, WIR, CS 
31 
5 
Hold Time 


t10 
031-00, 
OP3-0PO, 
A9-A3 
Hold Time 
31 
5 


t11 
031-00, 
OP3-0PO, 
Valid Delay 
30 
50 
18 


t12 
031-00, 
OP3-0PO, 
Low-Z Delay 
32 
50 
3 
(Note 7) 
When OLE is Not Used 


t13 
031-00, 
OP3-0PO, 
High-Z Delay 
32 
50 
14 
(Note 7) 
When OLE is Not Used 


t14 
031-00, 
OP3-0PO 
Enable Delay 
33 
50 
3 
42 
When OLE is Used 


t15 
031-00, 
OP3-0PO 
Disable Delay 
33 
50 
3 
14 
When OLE is Used 


t20 
ROY Valid Delay 
30 
50 
3 
18 


t21 
PRST, PNMI, PINT Valid Delay 
30 
50 
3 
34 


t22 
RESET Setup Time 
31 
8 
(Note 5) 


t23 
RESET Hold Time 
31 
5 
(Note 5) 


RESET Cyele Time 
5 te 
(Note 3) 


1tie 
(Note 3) 


t24 
INTIN[15:0J, 
L1NTIN[1:0] 
Low Time 
10 
(Note 6) 


All parameters 
are given 
in nanoseconds. 


TIL 
Level timing 
is measured 
at 1.5V for both "0" 
and "1" 
levels. 


NOTES: 
1. ICC bus clock 
ICLK 
period 
must 
be at least 
5 ns longer 
than 
system 
clock 
CLKIN 
for proper 
synchronization 
of the 
internal 
asynchronous 
signals. 
2. System 
clock 
CLKIN 
measured 
from 0.8V-2.0V. 


3. Minimum 
Reset 
cycle 
is the greater 
of the two cycle times. 
4. Minimum 
pulse width 
must be met for valid level to be attained 
on the DATA 
or ADDRESS 
output. 


5. Set up and hold time is required 
for RESET 
to start at the next rising edge of the clock. 


6. INTIN and UNTIN 
low time is measured 
from 
1.5V of the falling 
edge to 1.5V of rising edge. 
7. Not 100% 
tested. 
Guaranteed 
by design 
characterization. 


• 
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Time Base A.C. Parameters 
vcc 
= 5V ±5%; 
Tc = O°C to +85°C 


Symbol 
Parameter 
Ref. Fig. 
Mln (n8) 
Max (n8) 
Note 


tmc 
TMBASE 
Period 
35 
40 
10000 


t30 
TMBASE 
High Time 
35 
10 


t31 
TMBASE 
Low Time 
35 
10 


t32 
TMBASE 
Rise Time 
35 
8 


t33 
TMBASE 
Fall Time 
35 
8 


Symbol 
Parameter 
Ref. Fig. 
Mln (n8) 
Max (n8) 
Note 


ttc 
TCK Period 
35 
40 
1000 


t50 
TCK High Time 
35 
10 


t51 
TCK Low Time 
35 
10 


t52 
TCK Rise Time 
35 
8 


t53 
TCK Fall Time 
35 
8 


t54 
TDI, TMS, TRST Setup Time 
34 
10 


t55 
TOI, TMS, TRST Hold Time 
34 
5 


t56 
TOO VALID Delay 
34 
5 
24 
(Note 1) 


t57 
Output Delay in EXT est in EXTEST Mode 
34 
5 
27 
(Note 1) 


t58 
TRST Minimum 
Low Time 
10 
(Note 2) 


All parameters are given in nanoseconds. 
TIL level timing is measured at 1.5V for both "0" and "1" levels. 


NOTES: 
1. These parameters are specified for 50 pF load. 
2. This parameter is measured at 1.5V between the rising and falling edges. 
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A.C. Parameters for ICC Bus 
vcc = 5V ±5%; Tc = O°Cto +85°C 


Symbol 
Parameter 
Ref. Fig. 
Mln (n8) 
Max (n8) 
Notes 


tic 
IClK Period 
35 
60 
(Note 1) 


t40 
IClK High Time 
35 
20 


t41 
IClK low Time 
35 
20 


t42 
IClK Rise Time 
35 
10 


t43 
IClK Fall Time 
35 
10 


t44 
MBI3-MB10 Setup Time 
36 
8 


t45 
MBI3-MB10 Hold Time 
36 
5 


t46 
MB03-MBOO VALID low Delay 
36 
50 
(Note 2) 


t47 
MB03-MBOO VALID High·Z Delay 
36 
5 
15 
(Note 3) 


t48 
MB03-MBOO VALID low-Z Delay 
36 
12 
25 
(Note 3) 


All parameters 
are given 
in nanoseconds. 
TIL 
level timing 
is measured 
at 1.5V for both "0" 
and "1" 
levels. 


NOTES: 
1. MB13-0 
and MB03-0 
timing 
is tested 
at 150 ns cycle time. 
2. This parameter 
is specified 
for 50 pF load. 
3. Not 100% 
tested. 
Guaranteed 
by design 
characterization. 


82489DX registers can be located at any 1 Kbyte 
boundary in either memory or I/O space for as far as 
the 82489DX architecture itself is concerned. From 
a platform standard point of view, it is recommended 
to locate all 82489DX local Units in memory space 
at address OxFEEQ-OOOO.It is further recommend- 
ed that all 82489DX I/O Units also be located in 
memory space; I/O Unit 1 at address OxFECo- 
0000, I/O Unit 2 (if present) at address OxFECo- 
1000, and so on. Chip select for the 82489DX 
should be based on a full decode of address pins 
A31-A10. 


All directly accessible 82489DX registers are 32 bits . 
wide and are aligned at 128-bit boundaries. The reg- 
ister being accessed is determined by bits 4 through 
9 of the address. This is listed in the tables below. 


Addresses not listed are reserved by the architec- • 
ture. The tables also show whether the register is 
readable and/or writable by software, and what the 
side effects are of software accessing the register. 


After reset, all registers are initialized to all zeroes 
with the following exceptions, The local Unit ID field 
is initialized with data present on the 8 lSB address 
pins. The Mask bit is initialized to 1 ("masked" state) 
in all entries in both the local vector table and the 
redirection table. 


For the I/O Unit, only the I/O register select and I/O 
window registers are directly accessible in the ad- 
dress space. The other I/O unit registers are ac- 
cessed indirectly through the select and window reg- 
ister. 
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Register 
Address (9:4) 
SW 
Side Effects 


I/O Register 
Select 
000000 
W 


I/O Window 
Register 
000001 


Register 
I/O Reg Select (7:0) 
SW 
Side Effects 


I/O Unit 10 Register 
00000000 
rw 


Version 
Register 
00000001 
r 


Redirection 
Table [0] (31 :0) 
00010000 
rw 


Redirection 
Table [0] (63:32) 
0001 0001 
rw 


Redirection 
Table [1] [31:0] 
0001 0010 
rw 


Redirection 
Table [1] [63:32] 
0001 0011 
rw 


Redirection 
Table [2] (31 :0) 
0001 0100 
rw 


Redirection 
Table [2] (63:32) 
0001 0101 
rw 


Redirection 
Table [3] [31 :0] 
00010110 
rw 


Redirection 
Table [3] [63:32] 
00010111 
rw 


Redirection 
Table [4] (31 :0) 
0001 1000 
rw 


Redirection 
Table [4] (63:32) 
0001 1001 
rw 


Redirection 
Table [5] [31 :0] 
0001 1010 
rw 


Redirection 
Table [5] [63:32] 
00011011 
rw 


Redirection 
Table [6] (31 :0) 
00011100 
rw 


Redirection 
Table [6] (63:32) 
0001 1101 
rw 


Redirection 
Table [7] [31 :0] 
00011110 
rw 


Redirection 
Table [7] [63:32] 
00011111 
rw 


Redirection 
Table [8] (31 :0) 
00100000 
rw 


Redirection 
Table [8] (63:32) 
00100001 
rw 


Redirection 
Table [9] [31 :00] 
00100010 
rw 


Redirection 
Table [9] [63:32] 
00100011 
rw 


Redirection 
Table [10] (31:00) 
00100100 
rw 


Redirection 
Table [10] (63:32) 
00100101 
rw 


Redirection 
Table [11] [31 :0] 
00100110 
rw 


Redirection 
Table [11] 
[63:32] 
00100111 
rw 


Redirection 
Table [12] (31:0) 
00101000 
rw 


Redirection 
Table [12] (63:32) 
00101001 
rw 


Redirection 
Table [13] [31:0] 
00101010 
rw 


Redirection 
Table [13] [63:32] 
00101011 
rw 


Redirection 
Table [14] (31:0) 
00101100 
rw 


Redirection 
Table [14] (63:32) 
00101101 
rw 


Redirection 
Table [15] [31:0] 
00101110 
rw 


Redirection 
Table [15] 
[63:32] 
00101111 
rw 
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Registers 
Address 
(9:4) 
SW 
Side Effects 


Local Unit ID Register 
000010 
rw 


Version 
Register 
000011 
r 


Reserved 
000100 


Reserved 
000101 


Reserved 
000110 


Reserved 
000111 


Task Priority Register 
001000 
rw 
mask intr dispense 


Reserved 
001001 


Reserved 
001010 


EOI Register 
001011 
rw 
prioritization 
cycle 


Remote 
Register 
001100 
r 


Logical Destination 
Reg. 
001101 
rw 


Destination 
Format Reg. 
001110 
rw 


Spurious 
Vector Register 
00 1111 
rw 


ISR (31:0) 
010000 
r 


ISR (63:32) 
010001 
r 


ISR (95:64) 
010010 
r 


ISR (127:96) 
01 0011 
r 


ISR (159:128) 
010100 
r 


ISR (191:160) 
01 0101 
r 


ISR (223:192) 
01 0110 
r 


ISR (255:224) 
01 0111 
r 


TMR (31:0) 
011000 
r 


TMR (63:32) 
01 1001 
r 


TMR (95:64) 
01 1010 
r 


TMR (127:96) 
01 1011 
r 


TMR (159:128) 
01 1100 
r 


TMR (191:160) 
01 1101 
r 


TMR (223:192) 
01 1110 
r 


TMR (255:224) 
01 1111 
r 


IRR (31:0) 
100000 
r 


IRR (63:32) 
100001 
r 


IRR (95:64) 
100010 
r 


IRR (127:96) 
100011 
r 


IRR (159:128) 
100100 
r 


IRR (191:160) 
100101 
r 


• 


Registers 
Address (9:4) 
SW 
Side Effects 


IRR (223:192) 
100110 
r 


IRR (255:224) 
100111 
r 


Intrpt Comnd Reg. (31 :0) 
110000 
rw 
send interrupt 


Intrpt Comnd Reg. (63:32) 
11 0001 
rw 


Local Vector Table [timer] 
11 0010 
rw 


Reserved 
11 0011 


Reserved 
11 0100 


Local Vector Table [local int 0] 
11 0101 
rw 


Local Vector Table [local int 1] 
11 0110 
rw 


Reserved 
11 0111 


Initial Count Register 
11 1000 
rw 


Current Count Register 
11 1001 
r 


Reserved 
11 1010 


Reserved 
11 1011 


Reserved 
11 1100 


Reserved 
11 1101 


Divider Configuration 
Reg. 
111110 
rw 


Reserved 
111111 


NOTE: 
Address 
space 
101000 
to 101111 
and 111111 
are reserved 
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132-Lead 
PQFP 
Package 
Type KU (See Packaging 
Specification. 
Order Number 
240800) 


Vee 
011 
010 
09 
Vss 
Veep 
DB 
07 
06 
Vssp 
05 
04 
03 
02 
Veep 
01 
Vssp 
DO 
Reserved 
Reserved 
Al0 
A9 
VSSP 
AB 
Veep 
A7 
A6 
A5 
A4 
Vssp 
A3 
Vee 
Vss 


Vss 
Vee 
INTINO 
INTINl 
INTIN2 
INTIN3 
INTlN4 
INTlN5 
INTIN6 
INTIN7 
INTIN8 
INTIN9 
INTIN 10 
INTlN11 
INTIN12 
INTIN13 
INTIN14 
INTIN 15 
L1NTINO 
L1NTIN 1 
MB10 
MB11 
MB12 
MB13 
Reserved 
CS 
OLE 
Reserved 
Reserved 
Reserved 
Vee 
VSS 
Vss 


NOTE: 
See pin description 
section 
for appropriate 
pin-strapping 
of the reserved 
pins. 
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15.0 
PACKAGE THERMAL 
SPECIFICATION 


any environment, to determine whether the device is 
within the specified operating range. 


The 82489DX is specified for operation when the 
case temperature is within the range of O·C to 
+ 85·C. The case temperature may be measured in 


The PQFP case temperature should be measured at 
the center of the top surface opposite the pins, as 
shown in Figure below. 


Thermal Reslstance-·C/W 


Parameter 
Air Flow Rate (Ft./Min) 


0 
200 
400 
600 
800 
1000 


8 Junction to Case 
8 
8 
8 
8 
8 
8 


8 Junction to Ambient 
32.5 
25.5 
20 
18.5 
16 
15 


NOTES: 
1. Table 
above 
applies 
to 82489DX 
PQFP plugged 
into a socket 
or soldered 
directly 
into the board. 


2. IJJA = IJJC + IJCA· 


• 


16.0 
GUIDELINES FOR 82489DX 
USERS 


This section outlines one possible initialization sce- 
nario. Other scenarios are certainly possible, and 
one would be selected as part of a platform stan- 
dard initialization scheme. The intent of this section 
is to illustrate that the initialization support provided 
by the 82489DX is adequate to support MP (Multi- 
processor) system initialization. 


Each 82489DX has a RESET input pin connected to 
a common Reset line. Upon system reset, this com- 
mon 
reset 
line 
is 
activated, 
causing 
all 
the 
82489DXs to go through reset. All 82489DX local 
units (note: only local units and not I/O units) latch 
their ID from their address bus on reset. The ID can 
be provided by the bus control agent based on slot 
number. 


The local units next assert their processor's Reset 
pin, holding the processor in reset, and next perform 
their internal reset, setting all registers to their initial 
state. The initial state of all 82489DX Units (both 
local and I/O units) is "all masks set" and all Local 
Units disabled; registers are otherwise initialized to 
zero. Note that the PINT and PNMI output pins are in 
tri-state mode when the local unit is disabled. After 
this, each 82489DX local unit will deassert its proc- 
essor's Reset pin, allowing the processors to come 
out of reset and perform self test and start executing 
initialization code. 


Note that while connecting PRST pin it should be 
noted that whenever PRST pin is activated by 
82489DX either because of software reset message 
or hardware reset, the 82489DX itself is reset. It 
should be taken care in the cases of Warm reset 
where only processors need to be reset and not the 
interrupt controller. In brief, the usage of PRST de- 
pends upon the system requirement on various re- 
set. 


Somewhere in this code sequence, the processors 
that are "alive" will enable their 82489DX local units, 
and attempt to force all the other processors back 
into Reset. Forcing the other processors into reset is 
performed by sending them the inter-processor in- 
terrupt with Destination Mode = "Physical", Deliv- 
ery Mode = "Reset", 
Trigger Mode = "Level", 
Level = "1", and Destination Shorthand = "All Excl 
Self". Only the first processor to get the ICC bus will 
succeed in sending this signal and reset all other 
82489DXs and their processors. The other proces- 
sors are kept in reset until such time that an MP 
operating system decides they can become active 
again. The only running processor next performs the 
rest of system initialization. 


intel~ 


Eventually, an MP operating system will be booted 
at which time the operating system would send 
"deassert reset" interprocessor signals to activate 
the other processors in the system. A mechanism 
must be provided by the platform that allows the 
added processors to differentiate the very first reset 
from a subsequent one. 


16.2 Compatibility 


COMPATIBILITY 
LEVELS 


The 82489DX can be used in conjunction with stan- 
dard 8259A-style interrupt controllers to provide a 
range of compatibility levels. 


At the lowest level we have "PC shrink-wrap" com- 
patibility. This level effectively creates a uniproces- 
sor hardware environment within the MP platform 
capable of booting/running 
DOS shrinkwrap soft- 
ware. In this mode, only the 8259A generates inter- 
rupts and the 82489DX becomes a virtual wire. The 
interrupt latency can be minimized by connecting the 
8259A interrupt to local unit directly. 


The next level preserves the software compatible 
view of an 8259A but it allows more than one proc- 
essor to be active in the system. This results in an 
asymmetrical arrangement, with one processor field- 
ing all 8259A interrupts but with added inter-proces- 
sor interrupt capability. In this 
mode, 82489DX 
"merges" 8259A interrupts with inter-processor in- 
terrupts. Existing I/O drivers would be bound to the 
compatible CPU and interface directly with the 
8259A. 


At the next compatibility level, 8259A compatible 
drivers can be mixed with native 82489DX drivers. 
Devices can generate interrupts at either 8259A or 
an 82489DX. This provides for partial symmetry as 
individual drivers migrate from the 8259A to native 
82489DXs. 


Another 8259A compatible point can be defined for 
MP systems. Each processor could have its own 
compatible 
8259A 
controllers, 
allowing 
multiple 
processors to run compatible I/O drivers, but stati- 
cally spreading the load across the available proces- 
sors. 


The 
principle 
of 
compatible 
operation 
is 
very 
straightforward; the 82489DX(s) become a virtual 
wire connecting the 8259A's INT output through to 
the processor, while at the same time making 8259A 
visible to the processor. 


The two connection schemes described only differ 
in the number of 82489DX(s) (one or two) that are 
located in the path from the 8259A to the processor. 
In the one 82489DX example illustrated in Figure 37, 
the INT output of the 8259A connects to one of the 
Interrupt Input pins of the 82489DX through an edge 
generation logic. This could be an interrupt pin on 
the 82489DX's I/O unit or local unit; assume a local 
interrupt input is used. The Local Vector Table entry 
for the interrupt pin that connects to the 8259A is set 
up with a Delivery Mode of "ExtINT" and edge trig- 
ger mode. This indicates that the interrupt is gener- 
ated by an external controller. The processor's INT 
pin connects to the 82489DX PINT pin. 


This setup enables the 82489DX local unit to detect 
assertions (up-edges) of the 8259A's INT output pin 
and pass this on to the processor's INT input. 
82489DX asserts ExtlNTA pin along with (one clock 
prior to) PINT pin to indicate "8259" interrupt. When 
the processor performs its INTA cycle the 82489DX 
itself does not respond other than deasserting PINT 
to the processor. At the third clock after ADS in the 
second bus cycle of INTA cycle ExtlNTA is deassert- 
ed. External logic should make use of the ExtlNTA 
signal to make the INTA cycle visible to the 8259A 
and the 8259A should provide the vector. At the 
same time, the local unit considers the external re- 
quest as delivered, and need not wait for the exter- 
nal 8259A's INT to be deasserted. A new up-edge 
must be generated on the 8259A INT pin before the 
local unit will assert the processor's INT pin on be- 
half of the 8259A. External edge generation logic 
should be used for this. Compatible software inter- 
acts directly with the 8259A. 


The mechanism is essentially the same in the two- 
82489DX scheme. The difference is that the 8259A 
connects to an interrupt input pin of the 82489DX 
I/O unit in the I/O system. The Redirection Table 
entry for this pin is again programmed with an 
"ExtINT" Delivery Mode, and the (single) 82489DX 
destination local 10corresponding to the compatible 
DOS processor. Capturing the up-edges of the 
8259A's INT pin by the 82489DX local unit now in- 
volves sending messages from the 82489DX I/O 
unit to the 82489DX local unit via the ICC bus. The 
"virtual wire" now includes messages over the ICC 
bus. 


Adding inter-processor ICC interrupts (or any other 
82489DX generated interrupts) to the compatible 
operation is accomplished by having the 82489DX 
internally OR the 8259A's INT request with any 
82489DX interrupt request. 


Before the 82489DX actually sends the interrupt sig- 
nal to the processor, the 82489DX decides whether 
it does this for an 82489DX interrupt or whether it 
does this on behalf of the external controller. When 
the processor performs the corresponding INTA cy- 
cle, only the 82489DX knows whether it should re- 


spond with a vector, or whether the external 8259A 
should. 


If the 82489DX needs to respond, then it will enable 
an externally implemented trap that prevents the 
8259A from seeing the INTA cycle. If the 8259A 
needs to respond, then the 82489DX will not enable 
the INTA trap, and the INTA will be allowed to reach 
the 8259A. 82489DX implements this by asserting 
its EXTINTA pin to indicate external 8259A should 
respond with the vector. The 82489DX local unit 
controls the INTA trap via its "ExtINTA" output pin; 
the 82489DX does not actually provide the trap it- 
self. 


In systems that can be booted either as a configura- 
tion with compatible 8259A or without, device inter- 
rupt lines are connected to both the Interrupt Re- 
quest pins of the 8259A and Interrupt Input pins of 
the 82489DX with all interrupts either masked at the 
82489DX or at the 8259A. Some EISA and Micro- 
Channel chip sets that include on-chip 8259As also 
have internally connected interrupt requests. For ex- 
ample, the 82357 (the ISP of the EISA chipset) gen- 
erates timer and DMA chaining interrupts internally. 
These are not available as separate interrupts out- 
side the ISP. In non compatible mode the ISP timers 
are not used, since each local 82489DX unit pro- 
vides its own timer. Therefore, the ISP's 8259A is 
configured to mask out all interrupts except the DMA 
chaining interrupt which is configured in level-sensi- 
tive, auto EOI mode. This causes the 8259A's INT 
output to track the state of the internal DMA inter- 
rupt request. The 8259A's INT output is then con- 
nected to one of the 82489DX interrupt input pins 
programmed 
to 
generate 
a 
regular 
(Le., 
not 


"ExtINT") level-sensitive interrupt. The ISP 8259A 
then no longer functions as an external interrupt 
controller; it has been logically disabled, and it 
needs no interrupt acknowledge or EOI. The INTA 
and EOI cycles occur only at the 82489DX. It should 
be noted that 82489DX accepts only active high lev- 
el/edge interrupt inputs. External programmable log- 
ic should take care of polarity reversal that may be 
needed in EISA system for sharing of interrupts. 


The 82489DX goes to reset state either by Hard- 
ware Reset state or' Software Reset message re- 
ceived on the ICC bus. On reset, 82489DX is dis- 
abled. The following 
is the 
hardware state 
of 
82489DX after reset. 


PRST 
Active (HIGH) 


PNMI 
TRI-STATED (Internal Pull-Down Provided) 


PINT 
TRI-STATED (Internal Pull-Down Provided) 


82489DX is disabled on Reset and unless specifical- 
ly enabled, it does not start its interrupt mechanism. 


The difference between hardware reset and soft- 
ware reset message is that during hardware reset 
82489DX samples the address bus and stores the 
last sample in local 
Unit 10 whereas for software 
reset it does not sample and store the unit 10. In 
addition, during the hardware reset pulse should be 
wide enough to accommodate for at least one rising 
and falling edge of IClK. On hardware reset ExtlN- 
TA is held high. 


PNMI, PINT are tri-stated at power on and they are 
maintained in tri-state condition till the unit is en- 
abled. Eventhough internal pull down resistor is pro- 
vided on PNMI and PINT external additional pull 
down resistor may be needed depending upon the 
loading on these pins by external logic. The DC 
characteristics gives the control specification from 
which the value of resistor, if needed, can be calcu- 
lated. 


It should be kept in mind that the ICC bus being 
electrically open drain bus requires pull up resistors 
at the MBO pins. ICC bus output low current is just 
4mA. 


PINT and ExtlNTA 
Timings 


It should be noted that for ExtlNTA type of interrupts 
PINT gets activated 
one clock after ExtiNTA 
gets 
activated. 
When getting deactivated, both PINT and 
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ExtlNTA 
Timings 


In the interrupt acknowledge cycle for External Inter- 
rupt control, 82489DX asserts ExtiNTA. It decodes 
the type of cycle from CPU control signals like M/IO, 
D/G and W/A. CPU does two bus cycles back to 
back for interrupt acknowledge cycle. 82489DX 
maintains ExtlNTA active throughout the first cycle. 
For next cycle (when the vector will be given by ex- 
ternal 8259) after 82489DX senses the start of the 
cycle (by ADS) 82489DX deactivates ExtINTA. Ex- 
ternal control logic may be inserting wait states to 
match the 8259 timings. Since 82489DX has no way 
of finding out the cycle completion, 82489DX deas- 
serts ExtlNTA before the second bus cycle gets 
completed. This should be kept in mind While using 
ExtlNTA for external interrupt control logic. 


The 82489DX is a 32-bit high performance interrupt 
controller. It allows the CPU to do 32-bit read and 
write to it. By memory mapping 82489DX the system 
performance can be enhanced. It should be noted 
that 82489DX does not support pipelining. Even- 
though 82489DX can be memory mapped, its func- 
tionality as an interrupt controller should be kept in 
mind while programming the virtual memory man- 
agement control data structure. The caching policy 
for the page where 82489DX is mapped should also 
be done with the functionality of 82489DX in mind. 
For example, the reads to 82489DX should not be 
cached and writes should be write-through. Since 
82489DX registers are aligned at 128-bit bounda- 
ries, memory mapping 82489DX with interleaved 
memory system should not be a problem. 


The JTAG circuit is used for boundary scan test. The 
JTAG pins has a TCK, (JTAG clock), TRST, (JTAG 
Reset), TDI, (Test Data Input), TMS, (Test Mode Se- 
lect) and TOO, (Test Data Output). 


The JTAG circuitry, if not used, should be properly 
deactivated so that it will not interfere in the normal 
functional operations. The JTAG can be inactivated 
in anyone of the following ways: 
1. Jl'AG inactivation through TRST: The TMS, Test 
mode Select should be either left open (internal 
pull up is provided) or tied to Vcc. The TRST can 
be pulsed low (bring it low and after meeting the 
pulse width requirement bring it back high again) 
to keep the JTAG circuitry to idle state. The 
TRST pulse brings the JTAG circuitry to idle state 
and TMS being kept high maintains the JTAG cir- 
cuitry in idle state. 


2. JTAG inactivation through TCK: The TMS, Test 
mode select should be either left open (internal 
pull up is provided) or tied to Vcc. The TCK within 
5 clocks brings the JTAG circuitry to idle state. 
The TMS, being held at logic high level, maintains 
the JTAG circuitry in idle state. 


16.4 
Programming 
Guidelines 


The 824890X register data structure contains differ- 
ent fields to specify the mode of operations and the 
options available within each mode. Since certain 
options are applicable to specific modes only (for 
example "Remote Read" mode applies only to inter- 
rupt command register, it does not have relevance 
to I/O unit's redirection tables) the following pro- 
gramming guidelines are provided. 


All the local units and I/O units hooked in a ICC bus 
should have unique 10before they can use the bus. 
This should be ensured by the programmer since for 
ICC bus arbitration the units (whether it is local unit 
or I/O unit) arbitrate with their unit 10. 


For future compatibility, the Units should be as- 
signed IDs starting with 0, 1, 2 etc. with the highest 
10 in the system being number of units minus 1. So 
in a four 824890X system there are four local units 
and four I/O units. The 10starts with 0 and the high- 
est 10 in the system will be 7. Note that each unit 
should have different 10 in the system. 


ATOMIC 
WRITE 
READ TO TASK 
PRIORITY 
REGISTER 


Normally, the task priority register is written with 
highest priority to mask certain low level interrupts 
before entering into critical section code. In a sys- 
tem where 82489DX is memory mapped the CPU 
may buffer this task priority register write to its on 
chip write buffer. The following scenerio can happen 
in such situation: CPU posts task priority register 
write to its on chip write buffer and enters into the 
critical code. A lower priority interrupt (which should 
not enter the critical code) interrupts the CPU before 
the write buffer gets flushed into task priority regis- 
ter). The CPU accepts the lower priority interrupt. To 
avoid the situation atomic write read to task priority 
register should be done. The read following write en- 
sures that the write buffer is flushed to task priority 
register and the atomicity ensures that no interrupt 
will be accepted by the CPU during its write to task 
priority. 


It should be noted that if the CPU does interrupt 
acknowledge cycle only after flushing the write buff- 
ers then the above situation may not arise. 


Each 82489DX has a single Interrupt Command 
Register that it uses to send interrupts to other proc- 
essors. The programmer should make sure to syn- 
chronize access to this register. Specifically, 1). writ- 
ing all fields of the register, 2). Sending the interrupt 
message (by writing the LSB register), and 3). wait- 
ing for Delivery State to become Idle again, should 
occur as a single atomic operation. For example, if 
interrupt handlers are allowed to send inter-proces- 
sor interrupts, then interrupt dispensing to the proc- 
essor must be disabled for the duration of these ac- 
tivities. 


INTERRUPT 
COMMAND 
REGISTER 
PROGRAMMING 
SEQUENCE 


The interrupt command register (31:0) has a side 
effect of sending interrupt once it is written. The des- 
tination is provided in the interrupt command register 
(63:32). So 
always 
interrupt 
command 
register 
(63:32) should be programmed before programming 
interrupt command register(31:0). 


Two different interrupts should not be programmed 
with the same interrupt vector. 


Only Interrupt command register supports "Remote 
Read" Delivery mode. Local and I/O unit interrupts 
do not support "Remote Read". 


ICR (INTERRUPT 
COMMAND 
REGISTER) 
1. ExtlNTA delivery mode is not supported for all 
destination shorthands. 


2. "Remote Read" should always be programmed 
as "Edge" triggered interrupt. 


3. "Remote Read" should always be programmed 
with physical destination mode (and not with Log- 
ical Destination mode). Broadcast addressing 
should not be used for Remote Read. 
4. For "all incl Self" and "All exc. self" destination 
shorthands, "remote read" delivery mode should 
not be used. 
5. For "all incl self" and "self" 
destination short- 
hands "Reset" 
delivery mode should not be 
used. 


• 


ICR (INTERRUPT 
COMMAND 
REGISTER) 


(Continued) 
6. For "all exc self" destination shorthand if "Re- 
set" delivery mode is used, it should be ensured 
at system level that only one processor executes 
this instruction at any time. 


7. Messages could be sent out in "Logical" 
or· 


"Physical" mode with destination ID of all 1's de- 
pending on the way Destination mode entry is 
programmed. In brief, "All incl self" and "All exc. 
Self" support both "Logical" and "Physical" ad- 
dressing mode. 


8. When destination shorthand (Le., broadcast) is 
used with lowest priority destination mode, then 
even though all participates in arbitrating for des- 
tination, only the lowest priority gets the mes- 
sage. So even though the addres!!ing is broad- 
cast since the destination mode is lowest priority 
only one gets the message. 


9. When destination shorthand (Le., broadcast) is 
used with "Fixed" destination mode, then all the 
units get the message. 


ISRIIRR/TMR 


Bits 0-15 
of IRRIISR/TMR 
do not track interrupt. 


No interrupt of vector numbers from 0-15 
can be 


posted; The total interrupt supported are 240. When 
reading the lowest 32 bits of these registers, 0 will 
be returned for the lower 16 bits. 


Focus processor is applicable only within the ad- 
dressed units. 


ExtlNT 
Interrupt 
Posting 


The external interrupt has no priority relationship 
with the 82489DX priority. But when posting an inter- 
rupt to the processor, if both an external interrupt 
and a 82489DX interrupt are pending, 82489DX 
could post either one to the processor. In 82489DX 
implementation, 
it would 
post 
external interrupt 


whenever there is no other 82489DX interrupt that 
can be posted to the processor. It should be also 
noted that External interrupts can not be masked by 
raising task priority. However, they can be masked 
by the mask bit in the table entry for the ("ExtINTA") 
external interrupt. 


The extlNT interrupts are specific in their character- 
istics in that they do not have any priority relation- 
ship with the rest of the interrupt structure. ISR and 
IRR bits in 82489DX are used to do the housekeep- 
ing functions for interrupt priority. Since extlNT inter- 
rupts do not have any priority relationship, ISR and 


IRR bits are not maintained for external interrupts. 
As far as interrupt acceptance is concerned, if more 
than one extlNT interrupts are directed towards a 
local unit, that local unit treats all the extlNT inter- 
rupts directed to it as only one extlNT interrupt. This 
leads to an important point that in a system not more 
than one interrupt should be programmed as extlNT 
interrupt type with the same destination. It should be 
noted that there can be more than one extlNT type 
of interrupt in a system with each having different 
local unit as destination. 


Synchronizing 
Arb IDs 


Initialization of an 82489DX's local unit ID is imple- 
mentation dependent. In some platforms, power-on 
reset will latch the right values into the 82489DXs; in 
other platforms, unique IDs may be assigned by ini- 
tialization firmware. In both cases the 82489DX I/O 
unit should be assigned unique ID by initialization 
firmware. The important point is that the 82489DXs 
are required to have unique IDs before they can use 
the bus, and in addition, all their Arb IDs must be "in 
sync". Synchronizing Arb IDs is accomplished as a 
side effect of a "deassert reset" interrupt command. 
This resets the (rotating) Arb ID to the (constant) unit 
ID; it assumes that all 82489DXs have their unique 
ID. 


"Only once delivery" semantics for a group destina- 
tion is guaranteed only if multiple fixed delivery of 
the same interrupt vector are not mixed. For lowest 
priority arbitration to work, all the arbitration ID of 
local 82489DXs in the system should be in sync. 
This means after local unit IDs are written in all local 
units (each ID should be different from other IDs) a 
RESET DEASSERT message should be sent in ALL 
INCLUSIVE mode. The RESET DEASSERT mes- 
sage should be sent before system is used for low- 
est priority arbitration. This ensures that all ARB IDs 
are also different. (Arb IDs are copied from local unit 
IDs during RESET DEASSERT message.) 


The RESET DEASSERT message, if not sent, only 
one delivery semantics may not be guaranteed in 
the cases where lowest arbitration is used in the sys- 
tem. 


DISABLING 
LOCAL 
UNIT 


Once the 82489DX is enabled by setting bit 8 of 
spurious vector register to 1, the user should not 
disable the local unit by resetting the bit to O. The 
result will put the local unit in an inconsistant state. 
The local unit can be disabled by sending "reset" 
interrupt message to the local unit. 


EOI, End of Interrupt issuing indicates end of service 
routine to 82489DX. The ISR bit which is set during 
INTA cycle gets cleared by EOI. This section dis- 
cusses the relevence of EOI to the specific types of 
interrupts and its timing related to interrupt deasser- 
tion. 


External Interrupts should be programmed as edge 
type. INTA cycles to external interrupts are taken 
automatically as EOI by 82489DX. This is similar to 
AEOI, Automatic End of Interrupt of 8259A. So there 
is no need to issue EOI to 82489DX for external 
interrupt servicing. This is done to achieve software 
transparency in the compatible mode. 


Spurious interrupts do not have any priority relation- 
ship to other interrupts in the system. So IRR is not 
set for spurious interrupts. EOI should not be issued 
for spurious interrupts. It is advisable not to share 
the spurious interrupt with any vector. If spurious in- 
terrupt vector is shared with some other interrupt 
then while servicing issuing EOI depends on the 
source of interrupt. If the source is spurious interrupt 
(for which the corresponding IRR is not set) then 
EOI should not be used. If the source is a valid inter- 
rupt sharing the spurious interrupt vector (for which 
the IRR is s.et)then EOI should be issued. 


For NM type of interrupt no IRR bit is set. So EOI 
should not be issued while servicing NMI type of in- 
terrupts. 


Task priority register is used to specify the priority of 
the task the processor is executing. In 8259 the 
priority is defined only among the interrupts that it 
handles. 82489DX goes farther ahead in handling 
priority. In multitasking system, in addition to device 
interrupts various tasks have different priority and 
82489DX allows consideration of the priority at sys- 
tem level. The processor specifies the priority of the 
task it executes by writing to task priority register. 
Now any interrupts at and below the task priority will 
be masked temporarily till the task priority gets low- 
ered. The masking granularity is at priority level. Out 


of 256 interrupt vectors 16 priority levels are speci- 
fied and 16 vectors share one priority level. Since 
the masking granularity by the task priority register is 
at priority level, group of 16 vectors get masked 
when task priority register is increased by one level. 


When task priority is at its minimum level of 0, inter- 
rupt vectors having level 1 to 16 are passed to CPU. 
Stated in other words, even when the task priority 
register is at its minimum (of level 0), interrupt vec- 
tors at level 0 will be masked. This means that the 
interrupt should not be programmed with vectors 0 
to 15. So out of 256 interrupt vectors, only 240 inter- 
rupt vectors (vector 16 to 255) can be used in 
82489DX. 


ExtlNT interrupt does not have any priority relation- 
ship with other interrupts or task priority register. So 
ExtlNT interrupt can not be masked by raising task 
priority. They can be masked by writing to the vector 
table entry which corresponds to ExtlNT interrupt. 


When enabling units and removing Mask bits in situ- 
ations where a device may already be injecting inter- 
rupts into the 82489DX system, the Mask in the Re- 
direction Table should be removed last to ensure 
proper initial state (e.g., Remote IRR bit matching 
IRR in local unit). 


It is software's responsibility to make sure that Deliv- 
ery Mode and Trigger Mode are set to meaningful 
combinations as listed below. 


Delivery 
Mode 


Fixed 
Lowest Priority 
Remote Read 
NMI 
Reset 
ExtlNT 


Trigger 
Mode 


edge/level 
edge/level 
edge 
level 
level 
edge 


Software is also responsible for not using meaning- 
less Delivery Modes in Redirection Table entries and 
local Vector Table entries (e.g., use of Remote Read 
delivery mode). 


• 


Software has total control over the assignment of 
interrupt vectors to interrupt sources. The operating 
system writer should be aware of a number of things 
when doing this assignment. 


Some processor architectures assign a predefined 
meaning to some of the vectors (Le., entries in the 
interrupt table) as entry points to certain trap and 
exception handlers (e.g., divide error, invalid op- 
code, page fault, etc.). The programmer is strongly 
advised not to reuse these vectors. 


The programmer must also be careful when using 
the same vector number to represent different inter- 
rupt sources (sharing vectors). This is especially true 
for level triggered interrupts. When multiple sources 
with different Redirection table entries share an in- 
terrupt vector, any of the sources deactivating its 
level signal will remove the interrupt request for all 
sources. Giving each interrupt source its interrupt 
vector in any case is the preferred approach. 


Each 82489DX has a single Interrupt Command 
Register that it uses to send interrupts to other proc- 
essors. It is software's responsibility to synchronize 
access to this register. Specifically, 1) writing all 
fields of the register, 2) sending the interrupt mes- 
sage (by writing the low register), and 3) waiting for 
Delivery State to become Idle again, should occur as 
a single atomic operation. For example, if interrupt 
handlers are allowed to send inter-processor inter- 
rupts, then interrupt dispensing to the processor 
must be disabled for the duration of these activities. 


When a level triggered interrupt source deasserts its 
interrupt input, the destination will clear the inter- 
rupt's IRR bit only after receiving the message from 
the ICC bus. This introduces a small delay between 
the removal of the interrupt at the source and the 
removal of the interrupt at the processor. To avoid 
generating unnecessary interrupts, the interrupt han- 
dier should remove the interrupt at the source (at the 
device) as early as possible in the handler. 


In any case, handlers should be able to deal with 
unnecessary interrupts. 


A side effect of a reset deassertion message broad- 
cast in the ICC bus is that all 82489DX local units 
reset their Arb ID to their unit ID. Interrupt com- 


mands that use the "self" Destination Shorthand do 
not generate a message on the ICC bus. If software 
only wants to generate the side effect of resetting 
Arb IDs, it should use a command with Logical Desti- 
nation Mode and a Destination field containing all 
zeroes. 


There are a number of levels at which interrupts can 
be masked, each resulting in a different behavior on 
interrupt delivery. 


• First, 
interrupt 
injection 
(or 
deliver) 
can 
be 
masked by setting the Mask bit in the interrupt's 
Redirection Table or local Vector Table entry. 
These interrupts are ignored, no message is sent 
for them. Granularity is an individual interrupt. 


• Second, each 82489DX can individually mask in- 
terrupt dispensing by raising its Task Priority to 
some level. This 82489DX will not dispense inter- 
rupts to its processor of this and lower priority 
unless it is currently the focus of the interrupt. 
Note again that the 82489DX is designed to oper- 
ate as fully nested with non-specific EOI (to use 
existing 8259A terminology). There is no explicit 
interrupt mask (such as MR) and there is no no- 
tion of specific EOI. 


• Third, each processor may provide a mechanism 
that masks all interrupt dispensing to it using the 
processor supplied instructions or status bits to 
do so. This does not interfere with lowest priority 
arbitration of the processor's 82489DX local unit. 


Redirection Tables are typically set up at initializa- 
tion time. When modifying a Redirection Table entry 
"on the fly" the programmer must be aware of state 
kept at other 82489DXs relative to the interrupt be- 
ing modified. 


It is strongly recommended to read the device status 
registers before servicing the device. This is be- 
cause if an edge triggered device deasserts its inter- 
rupt before interrupt acknowledge cycle (it should 
NOT) 82489DX will NOT give spurious vector. It will 
give genuine interrupt vector corresponding to the 
device. So, interrupt service routine should validate 
the interrupt request before servicing the device. 


SYSTEM 
HARDWARE 
AND 
SOFTWARE 
DESIGN 
CONSIDERATIONS 


Design Consideration 
1 


Description: The following design consideration has 
to be taken care of when using ISP (82357) as exter- 
nal interrupt controller. 82489DX allows connecting 
external 8259 type interrupt controller at one of its 
inputs. The mode associated with the interrupt input 
which has 8259 connected to it is called ExtlNTA 
mode. 82489DX allows only EDGE TRIGGERED 
programming option for ExtlNTA mode. But in the 
case of 82357, the INT output from ISP stays high in 
case more than one interrupt is pending at its inputs. 
It does not always inactivate its INT output after 
INTA cycle. This will lead to a situation where ISP 
keeps the interrupt at high level continously and 
waits for INTA cycle. But since 82489DX expects an 
edge for interrupt sensing (for ExtlNTA interrupts) it 
does not pass the interrupt to CPU and further inter- 
rupts are lost. So External circuitry should monitor 
the end of SECOND CYCLE of INTA cycle and force 
an inactive state at 82489DX's input. To avoid glitch- 
es at 82489DX input, this external logic should clear 
its output only at the end of second INTA cycle. It 
should be set by high going 82357 output. It should 
never be cleared by low going 82357 output. That is 
it should not follow 82357 output. 


Design 
Consideration 
2 


Description: The following design consideration has 
to be taken care of when using 82489DX in EISA 
systems. EISA ISP(82357) chip integrates 8259A. It 


additionally allows sharing of interrupts. To facilitate 
this sharing it has a programmable register, ELCR 
(Edge / Level trigger control register) by which cer- 
tain interrupt inputs can be programmed as edge 
(low to high except for RTC) or level (the level is 
active low). The determination of edge or level is 
done during initial configuration of EISA system by 
reading EISA add in boards from the interrupt de- 
scription data structures. The solution is to have pro- 
grammable logic at the interrupt inputs so that 
82489DX is compatible with EISA ISP. This will intro- 
duce one more register and logic to support this. 
This should be an 11-bit programmable register and 
an array of ExOR logic (12 ExOR gates or equivalent 
PLD). The ISP allows programmability of the follow- 
ing interrupts. 


INT3 INT4 INTS INT6 INT7 INT9 INT10 INT11 
INT12 INT14 INT1S.In addition to the above 11 in- 
terrupts, it fixes INT8 to be active low edge triggered 
interrupt. INT8 is the only case where it is active low 
edge triggered type. So the following logic can be 
used to add programmability in 82489DX based 
EISA system. Before connecting these 11 interrupt 
lines directly (#INT8 which is from Real Time Clock 
is always active low edge triggered. # INT8 can be 
4 


passed through an inverter since there is no need 
for programmability) to the 82489DX they should 
pass through an array of 11 Ell-OR 
gates. One in- 


put of Ell-OR 
gate connects to the corresponding 


INT pin and other input connects to a bit of program- 
mable register. The output of Ell-OR 
gate is con- 


nected to 82489DX. The idea of Ell-OR 
is to use as 


a controlled inverter.. 


DO 


01 
INTlN6 


02 


03 
INTIN9 
04 


05 


06 
INTlNll 


07 


08 


09 
INTlN14 


010 


INTIN are the interrupt inputs to the 82489DX and 
INT are the system interrupt. The Ex-OR 
gating 
register is programmed after EISA configuration is 
found from add in boards as how these interrupt 
lines are going to be used in that particular configu- 
ration. If a particular input is edge triggered, then the 
corresponding bit in the register is written with O.If a 
particular input is level triggered, then the corre- 
sponding bit in the register is written with 1. 


8259 by itself does not have polarity control where- 
as 8259 when implemented in EISA chipsets have 
the polarity control. Similarly APIC does not have by 
itself polarity register. So polarity register should be 
programmed as a part of system BIOS and not APIC 
BIOS. 


Design Consideration 
3 


lee bus drive is an open drain bus with drive capacity 
of 4 mA only. Since data is transmitted at each lee 
clock, the "charging" 
of lee bus should be fast 
enough to ensure proper logic level at each clock 
edge. The lee bus needs pull up resistors since it is 
open drain bus. Since the drive is only 4 mA, the pull 
up resistor value can not be less than 5V/ 4 mA. This 
being the limit of the resistor value, the length and 
the characteristics of the lee trace forces a capaci- 
tance value. Both the resistor and capacitance 
brings a RC time constant to the lee bus waveform. 
So, Electrical consideration has to be given to and 
practice of controlled impedence should be exer- 
cised for layout of the lee bus. The length of the 
trace should be kept as minimum as possible. If the 
length of the lee bus can't be kept less, than say 6 
inch, because of mechanical design of the system, 
the external line drivers should be added to lee bus 
and lee bus should be simulated with the added driv- 
er characteristics. 


~BO 


8H89DX 


~BI 


NOTE: 
R1 can be typically 
1K. 
R2 is designed 
from the simulation 
results. 


Design Consideration 
4 


This is related to ADS#, BGT# and CS# timings. 
For bus cycles not intended for 82489DX, (CS# = 1 
where 82489DX is supposed to sample it), any 
change in CS# line while the ADS# is still active, 
may erroneously cause a RDY# 
returned from 
82489DX. Anomolous behavior may result if for 
BGT# ties low cases 
a) BGT# goes away just one clock after ADS# or 
b) ADS# is still active, and CS# changes during 
this period. 
For other cases anomolous behavior results if CS# 
changes when ADS# is still active. The following 
considerations are important from timing point of 
view. Always limit the pulse width of 82489DX 
ADS# to one CLKIN. Also avoid changing levels on 
BGT#/CS# 
line, when ADS# is active for cases 
being identified as BGT# tied low (BGT# sampled 
low when ADS# goes active). Also avoid changing 
levels on CS# line when BGT# is active. 


Design Consideration 
5 


82489DX does not recognize the interrupt when an 
edge occurs at the interrupt input pin while interrupt 
is masked. When later it is unmasked there is no 
further edge and so 82489DX never passes that for- 
gotten edge and that interrupt channel becomes un- 
usable after that. 


The recommendation is that first 82489DX should 
be unmasked and then the device interrupt should 
be enabled in the device register. By this, software 
can ensure that always an edge will occur after an 
interrupt is unmasked. 


Design Consideration 
6 


Description: Edge triggered interrupts should not 
deassert their output till they are acknowledged by 
INTA cycle from CPU. 


82489DX employs glitch detection logic for edge 
triggered logic. To make sure the detected edge in- 
terrupt is not a glitch, 82489DX samples the input 
again before sending the interrupt message. The 
time difference between the first sampling of inter- 
rupt to be active and second sampling Oust before 
sending the interrupt) is not a constant number. This 
is because the ICC bus might have been occupied 
by other messages. So, for example if during first 
sampling it was detected that INTINOand INTIN15 
are both active and after sending INTINOit samples 
INTIN15 
again 
before 
sending 
message 
for 


INTIN15. But between this time ICC bus might have 
been occupied by other messages. So even if an 
edge triggered interrupt is held active high for a real- 
ly long time and then brought low before INTA cycle, 
it is considered as a glitch. Because it may happen 
that the second sampling occurred just when the in- 
terrupt line got low. 


Once 
the 
glitch 
detection 
circuitry 
found 
this 


"glitch", it goes back to the state where it will start 
sampling and waiting for an active edge to occur. 
This takes more than one clock cycle (ClK) and if 
the "glitch interrupt" generates an edge before that 
time after the second sampling of low level is done, 
then the edge is lost forever. 


t.4/IO# 


D/C# 
W/R# 
CYCLE# 


Since the time when the second sampling is done is 
unknown, the best way is to make sure the edge 
triggered interrupts do not deassert their outputs till 
they are acknowledged by INTA cycle from CPU. It 
is found that in some cases 8259 can generate brief 
active low pulses on its output. So the glue logic 
between 8259 and 82489DX input pin should make 
sure that 82489DX input pin is clear only after get- 
ting second interrupt acknowledge cycle. The glue 
logic should not just follow the 8259 output. Put in 
other words, after interrupt acknowledge cycle to 
8259, if the 8259 input is seen active high, it should 
generate an edge at 82489DX input. Moreover, even 
if 8259 output goes low the glue logic should not 
lower its output since the only time when the glue 
logic can deassert its output is when it finds an inter- 
rupt acknowledge cycle for 8259. The following PlD 
equations and schematics serves as an example for 
the glue logic between 8259 and 82489DX. 


INTAset 
8259Cyc 
set 8259Cyc 


setTimer 


BCLK is the standard 
EISA BCLK. 


CYCLEii' 
is an external 
signal to indicate 
AOSiI' 
is asserted 
earlier 
and ROY low is not yet returned. • 


82489DX 
.-n+_I 
'eI@ 


+ ITO' 
IReset' 
APICinput 
• INTAset 


+ IINTAset 
• 82591NT 


set 8259Cyc 
- 
IReset' 
INTA • ExtlNTA 
; This INTA cycle is for 8259 


- 
set8259Cyc' 
ITO • IReset 
; Set 8259cyc 
will set 8259 cycle and TO will clear It 


+ ITO • lSet8259Cyc 
• 8259Cyc 
• IResat; 
Hold 8259 cycle till TO clears it 


= IReset * IINT"et 
* INTA ; wait for very first INTA cycle after reset 


+ IReset * INT"et 
; once first INTA cycle after Reset is found, 
set the INT"et 


T2 :- Tl 
• IReset 
• 1T5 
; Follow TO after two clock for setting 
but clear along with TO 


T3 :- T2 • IReset 
• 1T5 
; Follow TO after three clock for setting 
but clear along with TO 


T4 :- T3 • IReset 
• 1T5 
; Follow TO after four clock for setting but clear along with TO 


NOTES: 
T1, T2, T3, T4 and T5 are clocked 
signals 
and others 
are combinatorials. 


This circuit 
and PLD equations 
are given for concept 
clarification 
purpose. 
They are not tested. 


INTAset 
is needed 
so that 
some 
8259 
logic at power 
on activates 
its INT output 
to 1 and it deactivates 
its output 
after 
only 
8259 
initialization 
(which 
should 
happen 
after 
APIC 
initialization) 
and 
since 
APIC 
needs 
to detect 
rising 
edge 
at 
8259, 
it is essential 
to follow 
the 8259 until first interrupt. 
This is the only occasion 
8259 output 
will be just followed. 


DIRECTIONS 
FOR EASY MIGRATION 
TO FUTURE 
INTEGRATED 
APIC 


The following are the software programming direc- 
tions Intel strongly recommends for easy migration 
from 82489DX to integrated APIC. The audience to 
this portion of the document are both hardware de- 
signers and firmware developers for APIC based 
systems. In the following discussions, the APIC 
BIOS is viewed functionally as two subsections 1) 
APIC BIOS which are all interrupt vector, priority, in- 
terrupt distribution related functions and the remain- 
ing portion of BIOS which is referred to part of sys- 
tem BIOS which is responsible for interrupt polarity 
programming, starting next processor, etc. 


Note that the names APIC BIOS and APIC DRIVER 
are interchangeably used in the following discussion. 
Different Operating systems refer such functional 
module differently. 


Question: The logical destination register in future 
implemented APIC may have only 8 MSBs defined 
and 82489DX has 32 bits specified. Will this hinder 
binary level compatibility? 


Response: 
In logical destination (flat addressing 
mode) 82489DX can go up to 32 CPUs whereas fu- 
ture APIC can go up to 8 CPUs with flat logical ad- 
dressing mode. For binary compatibility, 
It is 
strongly recommended that 82489DX software 
use ONLY8 MSB of logical destination register. 


Consideration 
2 


Question: The present day MP systems with exter- 
nal control ports for starting next processor may pro- 
gram those external control ports for starting next 
processor. APIC DRIVER may use external control 
ports for starting next processor. In future implemen- 
tations of APIC, the starting of next processors may 
use more refined mechanisms which may not use 
external control ports. Will this introduce compatibil- 
ity problem? 


Response: Again, the starting of next processor is 
really part of MP system DRIVER and depending of 
the mechanism used to start next processor it will 
vary. In future implementation if starting next proces- 
sor is done using new mechanisms, the starting next 
processor portion of MP DRIVER will be changed 
accordingly. Even though this will not result in any 
change in the APIC DRIVER which deals with inter- 
rupt priority, distribution, etc., the corresponding 
change will be needed in the starting application 
processors portion of DRIVER. 


One possible method of implementing software is 
using version register. Version register is different in 
489DX and future implementations of APIC. Taking 
care of these differences, such as mechanism for 
starting next processor, should be possible using 
version register. 


Question: APIC architecture, by its nature, seems to 
misinterpret spurious interrupts as genuine inter- 
rupts. That is, if an edge triggered interrupt goes in- 
active before interrupt acknowledge cycle, APIC, in- 
stead of giving spurious interrupt vector, gives genu- 
ine interrupt vector. Is it true that this is not the case 
with 8259? If that is the case, drivers which do not 
check device status registers for servicing the de- 
vice may work with 8259 but may not work with 
APIC. Is this a compatibility problem? 


Response: No, this is not true. Even with 8259 there 
is a time window in which a similar thing can happen. 
For example if interrupt goes inactive just after first 
INTA cycle but before second INTA cycle 8259 will 
also signal this spurious interrupt as genuine inter- 
rupt. So drivers which do not check device status 
registers may also fail with 8259. 


Our strong recommendation to device drivers is to 
read device status register before servicing the de- 
vice. If the device status register indicates that there 
is no valid source of the interrupt, the service routine 
should just issue EOI and return. It should not serv- 
ice the device. This should take care of the new driv- 
ers that will be written for APIC. To coexist with 
8259, the APIC interrupt input connected to 8259 will 
be programmed for virtual wire mode. In virtual wire 
mode, the time window of 8259 will apply. So the 
driver will behave same way as it was behaving with 
8259. 


Consideration 
4 


Question: EISA system has active low level polarity. 
82489DX itself does not have polarity control regis- 
ter to support this EISA feature. Implementations us- 
ing external polarity register may implement the po- 
larity register at different address. Will this introduce 
a problem for achieving the goal of single binary? 


Response: 8259 by itself does not have polarity 
control whereas 8259 when implemented in EISA 
chipset has the polarity control. Similarly APICdoes 
not have by itself polarity register. When implement- 
ed in ESC chipset, it will have polarity control regis- 4 
ter. So polarity register should be programmed as a 
part of EISA BIOS and not APIC BIOS. Since system 
BIOS or EISA BIOS should be able to take charge of 
changes, if any, to polarity control register. APIC 
BIOS should not be affected by differences in the 
address for polarity register. 


Question: 
8259 recognizes the interrupt when an 
edge occurs at the interrupt input pin even if the 
interrupt was masked. So when the interrupt input is 
later unmasked, the interrupt is posted to the CPU. 
82489DX does not register this edge and if interrupt 
happens when the interrupt is masked 82489DX just 
ignores the interrupt. When later it is unmasked 
there is no further edge and so 82489DX never 
passes that forgotten edge and that interrupt chan- 
nel becomes unusable after that. 


Response: When the interrupt is masked, logically 
interrupt controller should ignore whatever happens 
there. It is strongly recommended that first 82489DX 
should be unmasked and then the device interrupt 
should be enabled. By this sequence, software can 
ensure that always an edge will occur at the APIC 
input only after the interrupt is unmasked. 


Please contact Intel for platform level specification 
in Multiprocessor system design using APIC. 


UPI-41AH/42AH 


UNIVERSAL PERIPHERAL INTERFACE 


8-BIT'SLAVE MICROCONTROLLER 


• UPI-41: 6 MHz; UPI-42: 
12.5 MHz 
• Fully Compatible 
with all Intel and Most 
• Pin, Software 
and Architecturally 
Other Microprocessor 
Families 


Compatible 
with all UPI-41 and UPI-42 
• Interchangeable 
ROM and OTP EPROM 
Products 
Versions 


• 8-Bit CPU plus ROM/OTP 
EPROM, RAM, 
• Expandable 
I/O 


I/O, Timer/Counter 
and Clock In a 
• Sync Mode Available 
Single Package 


2048 x 8 ROM/OTP, 
256 x 8 RAM on 
• Over 90 Instructions: 
70% Single Byte 
• UPI-42, 
1024 x 8 ROM/OTP, 
128 x 8 
• Available 
In EXPRESS 


RAM on UPI-41, 8-Bit Timer/Counter, 
18 
- 
Standard 
Temperature 
Range 
Programmable 
I/O Pins 
• inteligent 
Programming 
Algorithm 
• One 8-Blt Status and Two Data 
- 
Fast OTP Programming 


Registers 
for Asynchronous 
Siave-to- 
• Available 
In 40-Lead 
Plastic and 44- 
Master 
Interface 
Lead Plastic Leaded 
Chip Carrier 


• DMA, Interrupt, 
or Polled Operation 
Packages 


Supported 
(See 
Packaging 
Spec., 
Order 
#240800..0(1) 


Package 
Type 
P and N 


The Intel UPI-41 AH and UPI-42AH 
are general-purpose 
Universal 
Peripheral 
Interfaces 
that allow the designer 
to develop 
customized 
solutions 
for peripheral 
device 
control. 


They are essentially 
"slave" 
microcontrollers, 
or microcontrollers 
with a slave interface 
included 
on the chip. 
Interface 
registers 
are included 
to enable the UPI device to function 
as a slave peripheral 
controller 
in the MCS 
Modules 
and iAPX family, as well as other 8-, 16-, and 32-bit systems. 


To allow full user flexibility, 
the program 
memory 
is available 
in ROM and One-Time 
Programmable 
EPROM 


(OTP). 
All UPI-41 AH and 
UPI-42AH 
devices 
are fully 
pin compatible 
for easy transition 
from 
prototype 
to 
produ~tion 
level designs . 
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210393-2 
Figure 1. DIP Pin Configuration 
210393-3 


Figure 2. PLCC Pin Configuration 


UPI-C42/UPI-L42 
UNIVERSAL PERIPHERAL INTERFACE 
CHMOS a-BIT SLAVE MICROCONTROLLER 
• Pin, Software and Architecturally 
• One 8-Bit Status and Two Data 
Compatible with all UPI-41 and UPI·42 
Registers for Asynchronous 
Siave-to- 
Products 
Master Interface 
• Low Voltage Operation with the UPI· 
• Fully Compatible with all Intel and Most 
L42 
Other Microprocessor 
Families 
- 
Full 3.3V Support 
• Interchangeable 
ROM and OTP EPROM 
• Integrated 
Auto A20 Gate Support 
Versions 
• Two New Power Down Modes 
• Expandable I/O 
-STANDBY 
-SUSPEND 
• Sync Mode Available 
• Security Bit Code Protection 
Support 
• Over 90 Instructions: 
70% Single Byte 
• 8-Bit CPU plus ROM/OTP EPROM, RAM, 
• Quick Pulse Programming Algorithm 
I/O, Timer/Counter 
and Clock in a 
- 
Fast OTP Programming 
Single Package 
• Available in 40-Lead Plastic, 44-Lead 
• 4096 x 8 ROM/OTP, 256 x 8 RAM 8-Bit 
Plastic Leaded Chip Carrier, and 
Timer/Counter, 
18 Programmable 
I/O 
44-Lead Quad Flat Pack Packages 
Pins 
(See Packaging Spec., Order ;('240800, Package Type P, N, 


• DMA, Interrupt, or Polled Operation 
and S) 


Supported 


The UPI-C42 
is an enhanced 
CHMOS 
version 
of the industry 
standard 
Intel UPI-42 family. 
It is fabricated 
on 
Intel's CHMOS 
III-E process. 
The UPI-C42 is pin, software, 
and architecturally 
compatible 
with the NMOS UPI 
family. The UPI-C42 has all of the same features 
of the NMOS family plus a larger user programmable 
memory 
array (4K), integrated 
auto A20 gate support, 
and lower power consumption 
inherent 
to a CHMOS 
product. 


The UPI-L42 
offers 
the same functionality 
and socket 
compatibility 
as the UPI-C42 
as well as providing 
low 
voltage 
3.3V operation. 


The UPI-C42 is essentially 
a "slave" 
microcontroller, 
or a microcontroller 
with a slave interface 
included 
on the 
chip. Interface 
registers 
are included 
to enable the UPI device to function 
as a slave peripheral 
controller 
in the 
MCS Modules 
and iAPX family, as well as other 8-, 16-, and 32-bit systems. 


To allow full user flexibility, 
the program 
memory 
is available 
in ROM and One-Time 
Programmable 
EPROM 
(OTP). 


UPI-C42 
UPI-L42 
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290414-1 
Figure 
1. DIP Pin 
Configuration 


290414-3 
Figure 
3. QFP Pin Configuration 
290414-2 
Figure 
2. PLCC Pin Configuration 


Refer 
to the 1994 Peripheral 
Components 
Handbook 
for the complete 
data sheet on this device. 


October 
1993 


Order 
Number: 
290414-003 
4-1 09 


• 


8XC51SL/LOW 
VOLTAGE 8XC51SL 
KEYBOARD CONTROLLER 


80C51SL -CPU 
with RAM and I/O; VCC = 5V ± 10% 


81C51SL - 
16K ROM Preprogrammed 
with SystemSoft 
Keyboard Controller 
and Scanner 
Firmware. Vcc = 5V ± 10%. 


83C51SL - 
16K Factory Programmed ROM. Vcc = 5V ± 10%. 
87C51SL - 
16K OTP ROM. Vcc = 5V ± 10%. 


Low Voltage 80C51SL-CPU 
with RAM and I/O; Vcc = 3.3V ±0.3V 


Low Voltage 81C51SL-16K 
ROM Preprogrammed 
with SystemSoft 
Keyboard 
Controller 


and Scanner Firmware. Vcc = 3.3V ±0.3V. 


Low Voltage 83C51SL-16K 
Factory Programmed ROM. Vcc = 3.3V ±0.3V. 


Low Voltage 87C51SL-16K 
OTP ROM. Vcc = 3.3V ±0.3V. 


• Proliferation 
of 8051 Architecture 
• 4-Channel, 8-Blt A/D 


• Complete 8042 Keyboard Control 
• Interface for up to 32 Kbytes of 


Functionality 
External Memory 


• 8042 Style Host Interface 
• Slew Rate Controlled 
I/O Buffers Used 
• Optional Hardware Speedup of 
to Minimize Noise 


GATEA20 and RCL 
• 256 Bytes Data RAM 


• Local 16 x 8 Keyboard Switch Matrix 
• Three Multifunction 
I/O Ports 
Support 
• 10 Interrupt Sources with 6 User- 
• Two Industry Standard Serial Keyboard 
Definable External Interrupts 
Interfaces; Supported via Four High 
• 2 MHz-16 MHz Clock Frequency 
Drive Outputs 


5 LED Drivers 
• 100-Pln PQFP (8XC51SL) 
• 
100-Pin SQFP (Low Voltage 8XC51SL) 
• Low Power CHMOS Technology 


The 8XC51SL, 
based 
on Intel's 
industry-standard 
MCS~ 
51 microcontroller 
family, 
is designed 
for keyboard 
control 
in laptop and notebook 
PCs. The highly integrated 
keyboard 
controller 
incorporates 
an 8042-style 
UPI 
host interface 
with expanded 
memory, 
keyboard 
scan, and power management. 
The 8XC51SL 
supports 
both 
serial and scanned 
keyboard 
interfaces 
and is available 
in pre-programmed 
versions 
to reduce time to market. 


The 
Low Voltage 
8XC51SL 
is the 3.3V version 
optimized 
for even 
further 
power 
savings. 
Throughout 
the 
remainder 
of this document, 
both devices 
will generally 
be referred 
to as 51SL. 


The 
8XC51SL 
is a pin-for-pin 
compatible 
replacement 
for the 8XC51SL-BG. 
It does, 
however 
have 
some 
additional 
functionality. 
Those 
additional 
functions 
are as follows: 


1. 16K OTP ROM: The 8XC51SL-BG 
had only 8K of ROM. 


2. New Register 
Set: The 8XC51SL 
adds a second 
set of host interface 
registers 
available 
for use in support- 
ing power 
management. 
This required 
an additional 
address 
line (A1) for decoding. 
To accommodate 
this, 
one Vcc 
pin was removed. 
However, 
in order to maintain 
compatibility 
with the -BG version, 
an enable 
bit 
for this 
new register 
set was added 
in configuration 
register 
1. This allows 
the 8XC51 SL to be drop 
in 
compatible 
to existing 
8XC51SL-BG 
designs; 
no software 
modifications 
required. 


NOTE: 


The changes 
made to the Vcc 
pins require 
that all three 
Vcc 
pins be properly 
connected. 
Failing to do so 
could result in high leakage 
current 
and possible 
damage 
to the device. 


For the complete 
data sheet on this device, please 
call Intel's 
Literature 
Distribution 
Center 
(800) 548-4725. 


October 1993 
4-110 
Order Number: 272271-002 


iPLD610 
FAST1~MACROCELLCMOSPLD 


Function, 
Pin, and JEDEC Compatible 
with EP600, EP610, 


EP610A, EP630, PALCE610, 
85C060 and 5C060 PLDs 


• tpD 10 ns, 100 MHz Counter Frequency 
• Programmable Clock System with 2 
(wllnternal 
Feedback) 
Synchronous 
Clocks and Asynchro- 
• Icc = 105 mA max. @ 1 MHz 
nous Clocking Option on all Registers 
• Programmable 
Low-Power Option for 
• Extensive Software and Programming 


"Standby" 
Operation; 20 I-'-ATyp. in 
Support via Intel and Third Party Tools 


Standby Mode 
• 1-Micron CMOS· III-E EPROM 


• Clocking Speed Same as -7 
ns PAL· 
Technology 


(74 MHz w/External 
Feedback) 
• Programmable "Security 
Bit" Allows 
• 16 Macrocells with Programmable 
I/O 
Total Protection 
of Proprietary 
Designs 


Architecture 
(Register/Combinatorial). 
• 100% Generically Tested Logic Array 
Registers Configurable 
as D/T/JK/RS 
• Available in 300-mil 24-Pin PDlP, CerDIP 
Types 
and 28-Pin PLCC Packages 
• Up to 20 Inputs (4 Dedicated and 16 
(See Packaging Specifications Order Number #240800-001, 


I/O) 
Package Type Nand P) 
• 8 P-Terms, Selectable SOP Invert, Clear 
•• 


and OE P-Terms for Each Macrocell 
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Figure 1.IPLD610 Pin Configurations 
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to the 1994 Programmable 
Logic Handbook 
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data sheet 
on this device. 
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iPLD910 
FAST 24-MACROCELL 
CMOS PLD 


Function, Pin, and JEDEC Compatible 
with 
EP900, EP910, EP910A, 8SC090 and SC090 
• tpD 12 ns, 62.5 MHz w/Feedback, 
Clock 
• Programmable 
Clock System 
with 2 
to Output 
8 ns 
Synchronous 
Clocks and Asynch- 
• lee = 150 mA Max 
@ 1 MHz 
ronous Clocking 
Option on all 
Macrocells 
• Programmable 
Low-Power 
Option for 
Extensive 
Software 
and Programming 
"Standby" 
Operation; 
60 p.A Typ. in 
• 
Standby 
Mode 
Support 
via Intel and Third-Party 
Tools 
• 24 Macrocells 
with Programmable 
1/0 
• 1-Mlcron 
CMOS IIIE" EPROM 


Architecture 
(Register IComblnatorlal). 
Technology 


Registers 
Configurable 
as D/T/JK/RS 
• Programmable 
Security 
Bit Allows Total 
Types 
Protection 
of Proprietary 
Designs 
• Up to 36 Inputs (12 Dedicated 
and 24 
• 100% Generically 
Tested 
Logic Array 
1/0) 
• Available 
in 40-Pin PDIP, CERDIP 
and 
• 8 P-terms, 
Selectable 
SOP Invert, Clear 
44-Pin PLCC Packages 
and OE P·terms 
for Each Macrocell 
(See Packaging 
Spec., 
Order 
Number 
240800, Package 


Type 
Nand 
P) 
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• 
35 
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Figure 1. iPLD910 
Pinout Diagrams 
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iPLD22V10-10, 
-15, -25 
HIGH PERFORMANCE 
10-MACROCELL 
CMOS PLD 


• 
tpD 10 ns, 95.2 MHz with Feedback, 
100 MHz with No Feedback 


• 
Typical lee = 90 mA 
@ 15 MHz 


• 
12 Dedicated Inputs and 10 I/O Pins 


• 
10 Macrocells with Programmable 
110 
Architecture 
(Register/Combinatorial) 


• 
Variable P-terms-Up 
to 16 per 
Macrocell, Selectable Output Polarity, 
Separate Output Enable P·term 


• 
Global Asynchronous 
Clear and 
Synchronous 
Preset P·terms 


• 
1·Mlcron CHMOS IIIE EPROM 
Technology 


• 
Programmable 
"Security 
Bit" Allows 
Total Protection 
of Proprietary 
Designs 


• 
100% Generically Tested Logic Array 


• 
Available In 300·mll 24·Pln PDIP and 
28-Pln PLCC Packages 


(See 
Packaging 
Spec., 
Order 
Number 
240800. 


Package 
Type 
Nand 
P) 
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Refer 
to the 
1994 Programmable 
Logic Handbook 
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data sheet on this device. 
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IrA' "u 
10 ns FLEXlogic 
FPGA Family WITH SRAM OPTION 


• 
High Performance 
FPGA (Field 
Programmable Gate Array) 
- 
Deterministic 
10 ns Pln-to-Pln 
Propagation Delays 
- 
80 MHz System Clock Frequency 


• 
2,500 Equivalent Logic Gates or up to 
5,120 Bits of SRAM 


• 
0.8,....CMOS Technology 
- 
Power Management Options 
Minimize Active Power Consumption 
(1 mA/MHz) 
-1 
mA Standby Version Available 


• 
JTAG 1149.1 Compatible Test Port 
- 
Supports Boundary Scan and In- 
Circuit Reconfigurationl 
Programming 


• 
Four Conflgurable 
Function Blocks 
(CFBs) Linked by a 100% Connectable 
Matrix 
-Improves 
Fitting of Complex Designs 


• 
Any CFB Can Be Either 24V10 Logic or 
SRAM Block 
- 
Up to 40 Complex Macrocells 
-128 
x 10 SRAM Configuration 
- 
CFB Selectable 3.3V or 5V Outputs 
- 
Open-Drain Output Option 


• 
24V10 Macrocell Features 
- 
Dual Feedback on All 110 Pins 
- 
Allocation 
Supports up to 16 Product 
Terms per Macrocell with No 
Performance Penalty 
- 
12 Clocking Options 
- 
Flexible Preset/Clear 
Options 
- 
Selectable D/T Flip-Flops 
- 
Fast 12-Blt Identity Compare Option 


• 
Supported by Industry Standard Design 
and Programming Tools 


43 
290460-1 


Pins 
Package 
Macrocells 
I/O 
Inputs 
Clocks 
JTAG/Vpp 
Vcc 
GND 


44 
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40 
30 
0 
2 
5 
3 
4 


68 
PLCC 
40 
40 
10 
2 
5 
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The 89CI24FX Data/Fax 
Modem Chip Set Applica- 
tion Note provides the end user with applications and 
layout guidelines to reduce power consumption to a 
minimum when in the Power Down Mode. 


When the 89CI24FX is in the power down mode, the 
microcontroller (89CI26FX) consumes very little pow- 
er (less than 0.5 mW). However, the external memory, 
voltage regulators and peripherals draw excess current 
that makes the overall system power consumption more 
than 80 mW. This application note describes a method 
to reduce overall system power consumption to less 
than I mW when the 89CI24FX is in the power down 
mode. Adding a power down feature in the microcon- 
troller and reducing power sink to a minimum accom- 
plishes this goal. 
. 


Three steps are required to reduce the overall system 
power consumption in the power down mode: 


1. Detect power down in the microcontroller 
and 
isolate the power source from potential current 
sink from other components. 


2. Inhibit 
the DC path from the power supply, 
through other components, to ground when the 
microcontroller is powered down. 


3. Solve current drift problems due to floating inputs 
when power is removed from peripherals. 


Monitoring the clock output (CLKOUT, pin 65) from 
the microcontroller detects the power down condition. 
The CLKOUT pin is held high when the controller is 
in the power down mode. When power down is detect- 
ed, the detector shuts off the + 5V supply to all compo- 
nents except the microcontroller, 
RAM, 
and logic 
gates. The detector also shuts off power to the voltage 
regulators. 


When the power down detector shuts off the bipolar 
switches, some of the device input pins become current 
sinks and drain current from the controller output pins 
when the output pins are high. These controller output 
pins are the CLKOUT and SCLK outputs. To elimi- 
nate these DC paths in the power down mode, add an 
inverter from the controller clock output (CLKOUT) 
to the AFE clock input (CLKIN) 
and use an AND 
gate to change the SCLK output to the AFE to a low. 


When the controller is in the power down mode, the 
SDATA pin becomes a floating input that can draw 
current in excess of 300 fJ-A.Placing a 510 Kn resistor 
between SDATA pin and ground solves this problem. 
Using a 100 Kn resistor or lower may impede circuit 
functionality. 


89C126FX 
89127 


SDATA 
SDATA 


51DK 


-::.=- 


Figure 1. Placement 
of Current 
Drain 
Blocking 
Resistor 


The reduced power drain feature design trade-off, be- 
sides adding circuit complexity, is an additional 20 mW 
power consumption when the 89CI24FX is active. 


• 


};' 


0.33 J'F 
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The evolution of the floppy has been marked in little 
over a decade by a significant increase in capacity ac- 
companied by a noticeable decrease in the form factor 
from the early 8 inch floppy disks to the present day 3.5 
inch floppy disks. This decade will also be remarkable 
as OEMs adopt "Super" dense floppies. 


The most commonly seen floppies today are invariably 
one of the form factors - the 5.25" or the 3.5" . Each 
form factor has several associated capacity ranges. The 
5.25" floppies available are: 180 KB (single density), 
360 KB (double density) and 1.2 MB (high density). 
The 3.5" floppies available are: 720 KB (double densi- 
ty) and 1.44 MB (high density). The emerging super 
dense floppies will evolve on the installed base of 3.5" 
floppies. The latest member of this set is the 2.88 MB 
(extra density) floppy, pioneered by Toshiba. The cor- 
nerstone of market acceptance of newer drives is com- 
patibility to the older family. The 2.88 MB (formatted) 
floppy drive allows the user to format, read from and 
write to the lower density diskettes. 


As programs and data files get bigger, the demand for 
higher capacity floppies becomes obvious. There are 
several 3.5" higher density drives available from vari- 
ous vendors with capacities well into the 20 MB range. 
NEC has introduced a 13 MB drive and companies 
such as Insite have introduced 20 MB drives. Both 
drives require servo-mechanisms to accurately position 
the head over the right track. NEC's drive has the stan- 
dard floppy drive interface whereas Insite's interface is 
SCSI based. The market for these floppy drives will 
remain a niche unless they receive more OEM support. 


Initiated by Toshiba's research and innovation of the 
higher density 4 MB floppy disk media, the market is 
headed towards the super dense floppy drive. After 


IBM's endorsement of the 4 MB (unformatted) floppy 
disk drives on their PS/2 model 57 and PS/2 model 90, 
several OEMs have shown a growing interest in "su- 
per" dense floppy disk drives. The latest DOS 5.0 sup- 
ports the new 4 MB floppy media and BIOS vendors 
like Pheonix, AMI, Award, Quadtel, System Soft, and 
Microid all support the newer 4 MB floppy media. 


An important consideration to implement the 4 MB 
floppy drive is the floppy disk controller. Intel's highly 
integrated floppy disk controller, 82077AA/SL, has led 
the market in supporting the 4 MB floppy drive. Two 
ingredients are necessary to fully support these drives: 
1 Mbps transfer rate and the perpendicular recording 
mode. This paper deals with a discussion of what the 
perpendicular mode is and how can a 4 MB floppy disk 
drive 
be 
implemented 
in 
a 
system 
using 
the 
82077AA/SL. 


Toshiba has taken the 2 MB floppy and doubled the 
storage capacity by doubling the number of bits per 
track. Toshiba achieved this by an innovative magnetic 
recording mode, called the vertical or the perpendicular 
recording mode. This mode utilizes magnetization per- 
pendicular to the recording medium plane. This is in 
contrast to the current mode of longitudinal recording 
which uses the magnetization parallel to the recording 
plane. By making the bits stand vertical as opposed to 
on their side, recording density is effectively doubled, 
Figure 1. The new perpendicular mode of recording not 
only produces sharp magnetization transitions neces- 
sary at higher recording densities, but is also more sta- 
ble. 


The 4 MB disks utilize barium ferrite coated substrates 
to achieve perpendicular mode of magnetization. Cur- 
rent disks use cobalt iron oxide (Co-g-Fe203) coating 
for longitudinal recording. The barium ferrite ensures 
good head to medium contact, stable output and dura- 
bility in terms of long use. High coercivity is required 
to attain high recording density for a longitudinal re- 
cording medium (coercivity specification of a disk re- 
fers to the magnetic field strength required to make an 
accurate 
record on the disk). A conventional 
head 
could not be used in this case; however, the barium 


ferrite disk has low coercivity and the conventional fer- 
rite head can be used. The new combination heads in- 
clude a pre-erase mechanism, i.e., the ferrite ring heads 
containing erase elements followed by the read/write 
head. These erase elements have deep overwrite pene- 
tration and ensure complete erasure for writing new 
data. The distance between the erase elements and the 
read/write 
head is about 200mm. This distance is im- 
portant from the floppy disk controller point of view 
and will be discussed in later sections. 
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PERPENDICULAR 
DRIVE 
FORMAT 
AND SPECIFICATION 


Figures 2a and 2b show the IBM drive format for both 
double density and perpendicular modes of recording. 
The main difference in recording format is the length of 
Gap2 between the ID field an" the Data field. The 
main reason for the increased Gap2 length is the pre- 
erase head preceding the read/write head on the newer 
4 MB floppy drives. The size of the data field is main- 
tained at 512 KBytes standard. The increase in the ca- 
pacity is implemented by increasing the number of sec- 
tors from 18 to 36. Table I shows the specifications of 
the various capacity 3.5" drives. 


The 
current 
82077AA/SL 
parts 
contain 
the 
"en- 
hanced" perpendicular 
mode command as shown in 
Figure 3. This is a two byte command with the first 
byte being the command code (OxI2H). The 2nd byte 
contains the parameters required to enable perpendicu- 
lar mode recording. The former command (in the older 
82077 parts) included only the WGATE and GAP bits. 
This command is compatible to the older mode where 
only the two LSBs are written. The enhanced mode 
allows system designers to designate specific drives as 
perpendicular recording drives. The second byte will be 
referenced as the PR[0:7] byte for ease of discussion. 
The following discusses the use of the enhanced perpen- 
dicular recording mode. 
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PERPENDICULAR MODE COMMAND 
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0 
1 
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Figure 3. Perpendicular Mode Command 
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Various 
Parameters 
Used 
5.25" 
5.25" 
3.5" 
3.5" 
3.5" 
In the Different 
Kinds of FDDs. 
360 KB 
. 1.2MB 
720 KB 
1.44 MB 
2.88 MB 


Number of Cylinders 
40 
80 
80 
80 
80 


Sectors/Track 
9 
15 
9 
18 
36 


Formatted 
Capacity 
354.KB 
1.2 MB 
720 KB 
1.44 MB 
2.88MB 


Unformatted 
Capacity 
360 KB 
1.6MB 
1 MB 
2MB 
4MB 


Rotation 
Speed (rpm) 
XT 
300 
360 
300 
300 
300 
AT 
360 


Track Density (tpi) 
48 
96 
135 
135 
135 


Recording 
Density (bpi) 
5876 
9870 
8717 
17432 
34868 


Data Transfer 
Rate (Mbps) 
XT 
0.25 
0.5 
0.25 
0.5 
1 
AT 
0.30 


Gap Length for Read/Write 
42 
42 
27 
27 
56 


Gap Length for Format 
80 
80 
84 
84 
83 


Sector Size 
512 KB 
512 KB 
512 KB 
512 KB 
512 KB 


Density Notation 
DO/OS 
HD/DS 
DO/OS 
HD/DS 
ED/OS 


The following describes the various functions of the 
programmed bits in the PRo 
OW 
If this bit is not set high, all PR[2:S] are 
ignored. In other words, if OW = 0, only 
GAP and WGATE are considered. In order 
to select a drive as perpendicular, it is neces- 
sary to set OW = I and select the Dn bit. 


Dn 
This refers to the drive specificationbits and 
corresponds to PR[2:S]. These bits are con- 
sidered only if OW = 1.During the READ/ 
WRITE/FORMAT 
command, the drive se- 
lected in these commands is compared to 
Dn. If the bits match then perpendicular 
mode will be enabled for that drive. For ex- 
ample, if DOis set then drive 0 will be con- 
figured for perpendicular mode. 


GAP 
This alters the Gap2 length as required by 
the perpendicular mode format. 


WGATE Write gate alters timing of WE to allow for 
pre-erase loads in perpendicular drives. 


The VCOEN timing and the length of the Gap2 field 
(explained above) can be altered to accommodate the 


unique requirements of the 4 MB floppy drives by GAP 
and WGATE bits of the PRoTable 2 describes the ef- 
fects of the GAP and WGATE bits for the perpendicu- 
lar command. 


82077AA/SL's 
PERPENDICULAR 
MODE SUPPORT 


The 82077AA and 82077SLboth support 4 MB record- 
ing mode. The 82077SL has power management fea- 
tures included as well. Both AA and SL product lines 
have three versions each out of which two of the ver- 
sions support the 4 MB floppy drives. The 82077AA-I, 
82077AA, 82077SL, and 82077SL-I all support the 
4 MB floppy drives. A single command puts the 
82077AA/SL into the perpendicular mode. This mode 
also requires the data rate to be set at I Mbps. The 
FIFO that is unique to Intel's 82077AA/SL parts may 
become necessary to remove the host interface bottle- 
neck due to the higher data rate. The 4 MB floppy disk 
drives are downward compatible to I MB and 2 MB 
floppy diskettes. The following discussion explains the 
implications of the new 4 MB combination head and 
the functionality of the perpendicular mode command. 
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Length 
Portion 
Gap2VCO 
VCOLow 
of Gap2 


GAP 
WGATE 
Mode 
Time after 
of Gap2 
Written 
by 
Low Time 
Format 
for Read 
Index Pulse 
Field 
Write Data 
Operations 
Operation 


0 
0 
Conventional 
33 Bytes 
22 Bytes 
o Bytes 
24 Bytes 


0 
1 
Perpendicular 
33 Bytes 
22 Bytes 
19 Bytes 
24 Bytes 
(Data Rate = 500 kbps) 


1 
0 
Conventional 
33 Bytes 
22 Bytes 
o Bytes 
24 Bytes 


1 
1 
Perpendicular 
33 Bytes 
41 Bytes 
38 Bytes 
43 Bytes 


The implementation 
of 4 MB drives requires under- 
standing the Gap2 (see Figures 2a and 2b) and yea 
timing requirements unique to these drives. These new 
requirements are dictated by the design of the "combi- 
nl!-tionhead" in these drives. Rewriting of disks in the 
4 MB drives requires a pre-erase gap to erase the mag- 
netic flux on the disk preceding the writing by the 
read/write gap. The read/write gap in the 4 MB drive 
does not have sufficient penetration (as shown in Figure 
4a) to overwrite the existing data. In the conventional 
drives, the read/write 
gap had sufficient depth and 
could effectively overwrite the older data as depicted in 
Figure 4b. It must be noted that it is necessary to write 


the conventional 2 MB media in the 4 MB drive at 
500 Kbps perpendicular mode. This ensures proper era- 
sure of existing data and reliable write of the new data. 
The pre-erase gap in the 4 MB floppy drives is activated 
only during format and write commands. Both the pre- 
erase gap and read/write gap are activated at the same 
time. 


As shown in Figure 4a, the pre-erase gap precedes the 
read/write 
gap by 200mm. This distance translated to 
bytes is about 38 bytes at a data rate of 1 Mbps and 19 
bytes at 500 Kbps. Whenever the read/write gap is en- 
abled by the Write Gate signal the pre-erase gap is acti- 
vated at the same time. 


GAP2 = 41 
bytes 
of 4E 
j.-- 200 }'m = 38 
bytes 
@ 1 t.4bps 


In conventional drives, the Write Gate is asserted at the 
beginning of the sync field, i.e., when the read/write is 
at the beginning of the data field. The controller then 
writes the new sync field, data address mark, data field 
and eRe 
(see Figure 2a). With the combination head, 
the read/write gap must be activated in the Gap2 field 
to ensure proper write of the new sync field. To accom- 
modate both the distance between the pre-erase gap and 
read/write 
gap and the head activation and deactiva- 
tion time, the Gap2 field is expanded to a length of 41 
bytes at I Mbps (see Figure 2b). Since the bit density is 
proportional to the data rate, 19bytes will be written in 
the Gap2 field at SOOKbps data rate in the perpendicu- 
lar mode. 


On the read back by the 82077AA/SL, the controller 
must begin the synchronization at the beginning of the 
sync field. For conventional mode, the internal PLL 
veo 
is enabled (VeOEN) 
approximately 
24 bytes 
from the start of the Gap2 field. However, at 1 Mbps 
perpendicular mode the veOEN 
goes active after 43 
bytes to accomodate the increased Gap2 field size. For 
each case, a 2 byte cushion is maintained from the be- 
ginning of the sync field to avoid write splices caused 
by motor speed variation. 


It should be noted that none of the alterations in Gap2 
size, veo 
timing or Write Gate timing affect the nor- 


mal program flow. Once the perpendicular command is 
invoked, 82077AA/SL behaviour from the user stand- 
point is unchanged. 


PROGRAMMING 
PERPENDICULAR 
MODE 


Figures Sa and Sb show a flowchart on how the perpen- 
dicular 
recording 
mode 
is 
implemented 
on 
the 
82077AA/SL. The perpendicular mode command can 
be issued during initialization. As shown in Figure Sa 
the perpendicular command stores the PR value inter- 
nally. This value is used during the data transfer com- 
mands for configuration in order to deal with the per- 
pendicular drives. Table 2 shows how the Gap2 length, 
VeOEN timing or Write Gate timing is affected. The 
OW bit is also tested for in this part of the loop. The 
enhanced perpendicular mode is enabled by setting the 
OW = 1, setting the Dn bits corresponding to the in- 
stalled perpendicular drive high and leaving PR[O:1l = 
'00'. 


As shown in Figure 5b, the Gap2 length is initially set 
to the conventional 
length of 22 bytes. Next 
the 
PR[O:I] bits (GAP, WGATE) are checked if they are 
set to '00'. If the PR[O:1l bits are set to '10' then, per- 
pendicular mode is disabled and conventional mode is 
retained. If the PR[O:I] = '01' or '11' the VeOEN is 
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set to activate 43 bytes or 24 bytes from the start of the 
Gap2 field, depending on the value as shown in Table 2. 
After this, PR[O:1] = '11' is checked; if not true (pro- 
grammed '01') the program is exited with only the 
VCOEN timing being set for perpendicular mode. If 
true, however, the Gap2 length is set up for perpendic- 
ular mode (note: this is done independent of the data 
rate). It must be noted that if the PR[O:I] bits are set to 
'II' then it is up to the user to disable precompensation 
before accessing perpendicular drives. The other branch 
of the flowchart refers to setting of PR[O:I] to '00'. In 
this case, the perpendicular command will have the fol- 
lowing effect: 
I. If any of the On bits in PR[2:S] programmed high, 
then 
precompensation 
is automatically 
disabled 


(0 ns is selected for the specified drive regardless of 
the data rate) and VCOEN is set to activate appro- 
priately. All the bits that are set low will enable the 
82077 to be configured for conventional mode, i.e., 
exit the program without modifications (shown Fig- 
ure Sb). 


2. Next the data rate is checked for 1 Mbps. If the data 
rate is at 1 Mbps, then Gap2 length is set to 41 
bytes, otherwise, the program is exited without set- 
ting up the Gap2 to 41 bytes. 


It must be noted that if PR[2:S] are to be recognized in 
the command the OW bit must be set high. If this bit is 
low, setting of On bits will have no effect. Setting the 
OW bit will enable the storage of the On bit. Also set- 
ting PR[O:I] to any other value than '00' will override 
anything written in the On bits. In other words, setting 
PR[O:I] to a value other than '00' enables the effect of 
that for all drives. It must be noted that if PR[O:1] bits 
are set to a value other than '00' then it is recommend- 
ed not to use the enhanced command mode, i.e., all 
other bits should be zero. Consider the following exam- 
ples: 
a. PR[O:7] = Ox84; This is the way to use the com- 


mand in the enhanced mode. In this case, the OW 
= 1 and DO is set high. During the data transfer 
command, if DO is selected it will be automatically 
configured for perpendicular 
mode. If 01 is ac- 
cessed, however, it will be configured for conven- 
tional mode. Similarly, if PR[O:7] = Ox88then 01 
is configured for perpendicular mode and DOis con- 
figured for conventional mode. Software resets do 
not clear this mode. 


b. PR[O:7] = Ox03; This is the way to use the com- 


mand in the old mode. If the user decides to use this 
mode, then it must be noted that the command has 
to be issued before every data transfer command. 
Also when used this way, all the drives are config- 
ured for perpendicular 
mode. The user must also 


remember to disable precompensation 
and set the 


data rate to 1 Mbps while accessing the perpendicu- 
lar drive in the system. Any software reset clears the 
command. 


c. PR[O:7] = Ox87;In this case, the OW = I, DO = 1 


and PR[O:I] 
= 11. This may be called a mixed 


mode and should be refrained from usage. This is 
similar to setting PR[O:7] = Ox03, because setting 
PR[O:I] high overrides automatic configuration. In 
this case the user has to be aware that precompensa- 
tion must be disabled and the data rate must be set 
to 1 Mbps while accessing drive o. After software 
reset, bits GAP and WGATE will be cleared, but 
OW and DO will retain their previously set values. 
In other words, after software reset, the part will see 
PR[O:7] = Ox84.Evidently, this would cause prob- 
lems and, therefore, it is recommended this mode 
not be used. 


d. PR[O:7] = Ox80;In this case, the OW = I, On = 


o and PR[O:I] = 00. This has the effect of clearing 
the perpendicular mode command without doing a 
hardware reset. Another way to do this would be to 
set PR[O:7] = Ox02;this can then be used to tempo- 
rarily 
disable perpendicular 
mode 
configuration 


without affecting the previously programmed 
On 


values. Software reset following this will reenable 
the previously programmed enhanced mode com- 
mand. 


Using the enhanced perpendicular command removes 
the requirement of issuing the perpendicular command 
for each data transfer command and manually setting 
the perpendicular configuration. 


"Software" RESETs (via OOR or OSR registers) will 
only clear the PR[O:I] values to '0'. On bits will retain 
their previously programmed values. "Hardware" 
RE- 


SETs will clear all the programmed bits including OW 
and On bits to '0'. The status of these bits can be deter- 
mined by issuing the dumpreg command and checking 
the 8th result byte. This byte will contain the pro- 
grammed values of the On and PR[O:I] bits as shown 
in Figure 6. The OW bit is not returned in this result 
byte. 
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There is currently no industry-wide standard for the 
FDC to FDD interface. There are numerous floppy 
drive vendors, each with their own modes and interface 
pins to enable 4 MB perpendicular mode. The drive 
interface not only varies from manufacturer to manu- 
facturer but also within a manufacturer's product line. 
The differences on the interface mainly originate from 
configuring the floppy drive into the 4 MB mode. De- 
pending on the drive, the differences can create prob- 
lems of daisy-chaining a 4 MB drive with the standard 
I MB and 2 MB drives. Of course, for laptops this is 
not a problem since most of them use a single floppy 
drive. Lack of an,industry standard makes it necessary 
to look at each drive and build a interface for that par- 
ticular drive. 


The following is a brief discussion about some of the 
floppy drives availablein the market and how these can 
be interfaced with the 82077AA/SL. It is important to 
note that although a manufacturer's name may be given 
in connection with the interface described, Intel does 
not guarantee that the interface discussed will apply to 
all the drives from that manufacturer. The main goal is 
introduce 
to 
the 
reader 
how 
to 
interface 
the 
82077AA/SL with a 4 MB floppy drive. 


Previously, for the conventional 1 MB and 2 MB AT 
mode drives, a single Density Select input was used by 
floppy drives to select between high density and low 
density drives. A high on this input enabled high densi- 
ty operation (500 Kbps) whereas a low enabled low 
density operation (300 Kbps/250 Kbps). This signal 


was asserted high or low by the floppy disk controller 
depending on the data rate programmed. For the 4 MB 
operation, there are two inputs defined by the floppy 
drive manufacturers. The polarity of these inputs en- 
ables the selected density operation. Implementing this 
requires at least I new pin to be defined on the FDC- 
FDD interface. Most floppy vendors have elected to 
take pin 2 (originally density select) and redefine the 
polarity to conform to one of these new density select 
inputs and another pin to be the other density select 
input. However, the new density s~lect on pin 2 is not 
compatible to the old density select input in many of 
the floppy drives. This precludes the user from daisy 
chaining 4 MB drives with conventional drives. Anoth- 
er problem is that the second density select pin varies 
on its location on the FDC-FDD interface from drive 
to drive. 
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The way that the BIOS determines what type of disk- 
ette is in what type of drive is by trial and error. The 
system tries to read the diskette at 250 Kbps; if it fails 
then it will set the data rate to higher value and retry. 
The BIOS does this until the right data rate is selected. 
This method will still be implemented for the 4 MB 
drives by some BIOS vendors. However, the 4 MB 
drives available today also have two media sense ID 
pins that relate to the user what type of media is pres- 
ent in the floppy drive. This information will also re- 
quire two pins on the FDC-FDD interface. The loca- 
tion of these pins is once again variable from drive to 
drive. 


dardization problem by including an auto configuration 
in the drive. In these cases, the type of floppy put into 
the drive is sensed by the hole (each 4/2/1 MB diskette 
has a hole in different locations identifying it) on the 
diskette. Then the drive automatically sets itself up for 
this mode. The BIOS must obviously set up the floppy 
disk controller for the correct data rate which could be 
done if the media sense ID was read and decoded as to 
the data rate. Due to lack of extra pins on the even side 
of the floppy connector the newer locations of some of 
the functions are migrating to the odd pins (previously 
all grounded). 
Some drive manufacturers 
have even 


made this configurable via jumpers. For instance, the 
new TEAC drives have a huge potpourri of configura- 
tions that would satisfy the appetite of some of the most 
finicky system interfaces. 


DRATEO and DRATEI 
(pins 28 and 29 respectively) 


which directly reflect the data rate programmed in the 
DSR and CCR registers. These two pins can be used to 
select the correct density on the drive. These two can 
also be used with the combination ofDENSEL 
to select 


the 
correct 
data 
rate. 
At 
the 
present 
time 
the 


82077AA/SL does not support media sense ID. How- 
ever, the user could easily make it readable directly by 
BIOS. The following is a discussion on what combina- 
tion of DRATEO, DRATEI, 
and DENSEL could be 


used to interface to some of the currently available flop- 
py drives. 
. 


These were among the first 4 MB drives available in the market. Each of them has a mode select input on pins 2 and 
6. The polarity required for each different data rate is as shown below: 


Data 
Capacity 
DRATE1 
DRATEO 
MODSELO 
MODSEL1 
Rate 
pin2 
pin6 


1 Mbps 
4MB 
1 
1 
1 
0 


500 Kbps 
2MB 
0 
0 
0 
1 


300 Kbps/ 
4MB 
0 
1 
1 
1 
1 Mbps 


250 Kbps 
1 MB 
1 
0 
0 
0 


It is clear from the above that DRA TEO = MODSELO and MODSELI = DRATE 1#. This would mean taking 
the drate signals onto pins 2 and 6 of the FDC-FDD interface. Unfortunately this solution requires an inverting gate. 
TEAC has recently, however, come out with a new version called TEAC 235J-3653. On this drive there are a 
number of possible configurations into which the drive can be put into, however, only the best way to interface to the 
82077AA/SL will be discussed. The requirements are as shown below. This shows that HDIN = DENSEL (original 
signal for conventional drives) and EDIN = DRATEO. As suggested in the TEAC spec for method I, the straps 
connected are MSC, HI2 (setS"HDIN on pin 2), DC34 and EI6 (sets EDIN on pin 6). Pins 4, 29, and 33 are left 
open. Since pin 2 has the same polarity as the conventional drive requirement and the secondary input is connected 
via pin 6 (no connect on the conventional drives) daisy chaining this TEAC drive with a conventional drive does not 
cause any incompatibility. Figure 7 shows how the TEAC can be connected to the 82077AA/SL. It also shows daisy 
chaining of the TEAC drive with a conventional drive. 


Data 
Capacity 
DENSEL 
DRATE1 
DRATEO 
HDIN 
EDIN 
Rate 
pin 2 
pin6 


1 Mbps 
4MB 
1 
1 
1 
X 
1 


500 Kbps 
2MB 
1 
0 
0 
1 
0 


300 Kbps/ 
4MB 
0 
0 
1 
X 
1 
1 Mbps 


250 Kbps 
1 MB 
0 
1 
0 
0 
0 
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This is Panasonic's new drive and has the HDIN signal on pin 2 and EDIN signal on pin 6. The requirements are 
shown below. This type of interface allows for daisy chaining the Panasonic drive with a conventional drive. The 
DENSEL signal can be connected to pin 2 and the DRATEO should be connected to pin 6. 


Data 
Capacity 
DENSEL 
DRATE1 
DRATEO 
HDIN 
EDIN 
Rate 
pin 2 
pin 6 


1 Mbps 
4MB 
1 
1 
1 
1 
1 


500 Kbps 
2MB 
1 
0 
0 
1 
0 


300 Kbps/ 
4MB 
0 
0 
1 
0 
1 
1 Mbps 


250 Kbps 
1 MB 
0 
1 
0 
0 
0 


There are two models of this drive. The 252UG has DENSEL! 
on pin 2 and DENSELO on pin 33, whereas the 
788UG has DENSELO located on pin 2 and DENSEL! located on pin 6. Via jumpers, it is possible to configure the 
drives to different polarity for the density select line. The following table shows the configuration for the 252UG in 
which jumper setting is 2MS = IIF and 4 MS = IfF. 


Data 
Capacity 
DENSEL 
DRATE1 
DRATEO 
DENSEL1 
DENSELO 
Rate 
pin 2 
pin 33 


1 Mbps 
4MB 
1 
1 
1 
1 
1 


500 Kbps 
2MB 
1 
0 
0 
1 
0 


300 Kbps/ 
4MB 
0 
0 
1 
0 
1 
1 Mbps 


250 Kbps 
1 MB 
0 
1 
0 
0 
0 


The correct connection requirement is: DENSEL (from 82077AA/SL) = DENSEL! and DRATEO = DENSELO. 
Although there are other configurations, this provides the best one, since daisy chaining is possible without any 
problem. 


This drive has 3 different modes of operation. Mode B is the best and is similar to Mitsubishi's drives as described 
above. In this mode, HDI signal is connected to pin 2 and EDI is connected to pin 33. Mode B is enabled by 
inserting jumpers across 3-4 and 7-8 (SSOI B block) and 1-2 and 3-4 (SS03 block) for the drive with the power 
separated type (i.e., a connector for the floppy signals and another one for power supply) of 34-pin connector. 


Data 
Capacity 
DENSEL 
DRATE1 
DRATEO 
HDI 
EDI 
Rate 
pin 2 
pin 33 


1 Mbps 
4MB 
1 
1 
1 
1 
1 


500 Kbps 
2MB 
1 
0 
0 
1 
0 


300 Kbps/ 
4MB 
0 
0 
1 
0 
1 
1 Mbps 


250 Kbps 
1 MB 
0 
1 
0 
0 
0 


As demonstrated by the table, HDI = DENSEL and EDI = DRATEO. These connections would ensure daisy 
chaining capability without any problems. 


The dash 14 and 15are two drives from Sony that handle 4 MB requirements. The MP-F40W-14 has the DENSITY 
SELECT I, DENSITY SELECT 0 on pins 2 and 33 respectively, whereas the MP-F40W-15 has the DENSITY 
SELECT I, DENSITY SELECT 0 on pins 2 and 6 respectively. As it is obvious from the table below, daisy chaining 
is easily done if the 82077AAlSL is connected in the PS/2 mode (by tying IDENT low) with either type of drive, the 
only difference being the location of DENSITY SELECT O. 


Data 
DENSEL 
DENSITY DENSITY 


Rate 
Capacity 
PS/2 mode 
DRATE1 DRATEO SELECT1 SELECTO 
(IDENT = 0) 
pin 2 
pin 6/33 


1 Mbps 
4MB 
0 
1 
1 
0 
1 


500 Kbps 
2MB 
0 
0 
0 
0 
0 


300 Kbps/ 
4MB 
1 
0 
1 
1 
1 
1 Mbps 


250 Kbps 
1 MB 
1 
1 
0 
1 
0 


If the drive is used in the PS/2 mode, then DENSITY SELECT I = DENSEL and DENSITY SELECTO = 
DRATEO. To use the drive in AT mode, DENSITY 
SELECT I = DRATEI 
and DENSITY 
SELECTO = 
DRATEO, as shown below. However, daisy chaining is not possible. 


Data 
DENSEL 
DENSITY DENSITY 


Rate 
Capacity 
PS/2 mode 
DRATE1 DRATEO SELECT1 SELECTO 
(IDENT = 0) 
pin 2 
pin 6/33 


1 Mbps 
4MB 
0 
1 
1 
1 
1 


500 Kbps 
2MB 
0 
0 
0 
0 
0 


300 Kbps/ 
4MB 
1 
0 
1 
0 
1 
1 Mbps 


250 Kbps 
1 MB 
1 
1 
0 
1 
0 


Toshiba MB drive has the HD mode selection on pin 6 and ED mode selection on pin 2. This causes daisy chaining 
problems with conventional drives as shown in the figure below: 


Data 
Capacity 
DENSEL 
DRATE1 
DRATEO ED Mode 
HDMode 
Rate 
pin 2 
pin 6 


1 Mbps 
4MB 
1 
1 
1 
1 
1 


500 Kbps 
2MB 
1 
0 
0 
0 
1 


300 Kbps/ 
4MB 
0 
0 
1 
1 
0 
1 Mbps 


250 Kbps 
1 MB 
0 
1 
0 
0 
0 
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This section presents a design application of a PC/AT 
compatible floppy disk controller. The 82077SL inte- 
grates the entire PC/AT controller design with the ex- 
ception of the address decode on a single chip. The 
schematic for this solution is shown in Figure 8. The 
chip select for the 82077SL is generated by a 85C220 
JoLPLDthat is programmed to decode addresses 03FOH 
through 03F7H when AEN is low. The programming 
equations for the JoLPLDis in the Intel's .ADF format 
and can be processed using the IPLSII compiler (avail- 
able from Intel). 


A floppy disk interface is provided by on-chip output 
buffers with a 40 mA sink capability. The outputs from 
the disk drive are terminated at the floppy disk control- 
ler with a I Kn resistor pack. The 82077SL disk inter- 
face inputs contain a Schmitt trigger input structure for 
higher noise immunity. The host interface is a similar 
direct connection with on-chip 12 mA sink capable 
buffers on DBO-7, INT and DRQ. 
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The schematic 
shows eleven jumpers 
numbered 
JI 
through Jll. The table below describes the functions of 
these jumpers as well as their normal connections. The 
normal connections allow the BIOS to work without 
modification. In the normal mode, the 82077SL re- 
sponds to DRQ2 and DACK2# 
as well as IRQ6. De- 
pending on the type of drive interfaced to this board, 
the DENOUTO and DENOUTI 
signals can be tied. 


With the setting to 2-3 on J8 and J9, the default setting 
is 
DENSEL 
on 
DRVDENO 
and 
DRATED 
on 
DRVDENI. 
PIN6/33 SELECT is used to set for pin 6 
as the EDIN input. The JlI should always be closed. It 
can be used to measure the current consumption of 
82077SL. J7 selects between the primary and secondary 
address spaces. There are two resistor packs used for 
pullups on input signals from the floppy drive interface. 
These resistors are rated at IK. Please note that if using 
older 5.25· drives, the pullup on some of them is l5on. 
Most modem 5.25· drives use a IK value. In order to 
ensure the correct value please refer to the floppy drive 
specification manual. 


For further information, please contact your local Intel 
sales office. 


Jumper 
Description 
Normal 
Connection 


J1 
DA01: 
DMA request 
1 used with DACK1 # to allow for DMA transfers 
Open 


J2 
DA02: 
DMA request 2 used with DACK2 # to allow for DMA transfers 
Closed 


J3 
DACK1: DMA acknowledge 
1 used with DA01 
to allow for DMA transfers 
Open 


J4 
DACK2: DMA acknowledge 
2 used with DA02 
to allow for DMA transfers 
Closed 


JS 
lAOS: Interrupt 
line S used to generate 
floppy interrupts 
Open 


J6 
IA06: 
Interrupt 
line 6 used to generate 
floppy interrupts 
Closed 


J7 
DAV2: Address 
selection 
(between 
3FX and 37X address 
ranges) 
Open 


J8 
DENOUTO: Used with DENOUT1 
to select the values of DAVDEN1,O 
2-3 


J9 
DENOUT1: 
Used with DENOUTO to select the values of DAVDEN1 ,0 
2-3 


J10 
PIN6/33 
SELECT: 
Used to select between 
pin 6 and pin 33 for EDIN input 
1-2 
or 2-3 


J11 
VsslVcc: 
Connection 
between 
two power layers 
Closed 
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Designer: 
K. 
Shah 
Company: 
Intel 
Corp. 
Dept: 
IMD Marketing 
Date: 
April 
'92 
Rev.#: 
% The ~PLD 
used 
in the 82077SL 
Evaluation 
board 
design, 
Rev.#l.O. 
% 
85C220 
dip package 


SA9@2, % System 
Address 
Inputs % 
SA8@3, 
SA7@4, 
SA6@5, 
SA5@6, 
SA4@7, 
SA3@8, 
AEN@9, 


DENOUTO@l, 
% Maps 
the DRVDENO 
and DRVDENl 
to appropriate 
polarity 
table % 
DENOUT1@18, 
% Maps 
the DRVDENO 
and DRVDENl 
to appropriate 
polarity 
table % 


DRATEO@12, 
% DRATEO 
signal 
from 
the 82077SL 
% 
DRATE1@13, 
% DRATEl 
signal 
from the 82077SL 
% 
DENSEL@14 
% DENSEL 
signal 
from the 82077SL 
% 


OUTPUTS: 
CS_@15, % 82077SL 
chip 
select 
signal % 


DRVDEN1@16, 
% Drive 
density 
signal 
connected 
to EDIN 
of the 
drive % 


DRVDENO@17 
% Drive 
density 
signal 
connected 
to HDIN 
of the 
drive % 


NETWORK: 
% Inputs 
% 


SA9 = INP(SA91 
SA8 = INP(SA8) 
SA7 = INP(SA7) 
SA6 = INP(SA6) 
SA5 = INP(SA5) 
SA4 = INP(SA41 
SA3 = INP(SA31 
AEN = INP(AENl 
ADDSEL = INP(ADDSEL) 
DRATEO = INP(DRATEO) 
DRATEl = INP(DRATE1) 
DENSEL = INP(DENSELl 
DENOUTO 
= INP(DENOUTO) 
DENOUTl 
= INP(DENOUT1) 


CS_ = CONF(CSeq, 
Vccl 


DRVDENO 
= CONF(DENOeq, 
Vccl 
DRVDENl = CONF(DENleq, 
Vcc) 
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% CS_is 
activated 
for 
3FO-3F7 
and 
370-377 
address 
spaces % 


CSeq = (AEN' * SA9 * SA8 * SA7' * SAG * SA5 * SA4 * SA3' 
* ADDSEL' 
+ AEN' 
* SA9 * SA8 * SA7 * SAG * SA5 * SA4 * SA3' 
* ADDSEL)' 
: 


% These 
are 
the 
interface 
DENOUTl 
DENOUTO 
o 
0 
o 
1 
1 
0 
1 
1 


DRVDENO 
DENSEL 
DENSEL' 
DRATEl 
DRATEO 


DRVDENl 
DRATEO 
DRATEO 
DRAT EO 
DRATEl 


DENOeq = DENSEL 
* (DENOUTO' 
* DENOUTl') 
+ DENSEL' 
* (DENOUTO 
* DENOUTl') 
+ DRATEl 
* (DENOUTO' 
* DENOUTl) 
+ DRATEO 
* (DENOUTO 
* DENOUTl) 
: 
DENleq = DRATEl 
* (DENOUTO 
* DENOUTl) 
+ ,DRATEO * (DENOUTO' 
+ DENOUTl') 
: 


The following changes have been made since revision 001: 
Table 2 
kBps was corrected to kbps. 


Page 4-132 3. Mitsubishi MF356C description modified to read: "There are two models of this drive. The 252UG 
has DENSELI on pin 2 and DENSELO on pin 33, whereas the 788UG has DENSELO located on pin 2 
and DENSELI located on pin 6. Via jumpers, it is possible to configure the drives to different polarity 
for the density select lines. The following table shows the configuration for the 252UG in which jumper 
setting is 2 MS = IIF and 4 MS = I/F." 


Arrow added to diagram. 
Columns corrected to line 'up properly. 
Figure 8 
Page 4-137 
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Flash Memory Components 
5 


• 


intel~ 
28F001 BX-T/28F001 
BX-B 
1M (128K x 8) CMOS FLASH MEMORY 
• High Integration 
Blocked 
Architecture 
• High-Performance 
Read 
-One 
8 KB Boot Block w/Lock 
Out 
-120 
ns Maximum 
Access 
Time 
- 
Two 4 KB Parameter 
Blocks 
- 
5.0Y ± 10% Yee 
-One 
112 KB Main Block 
• Hardware 
Data Protection 
Feature 
• 100,000 Erase/Program 
Cycles 
Per 
- 
Erase/Write 
Lockout 
during Power 
Block 
Transitions 
• Simplified 
Program 
and Erase 
• Advanced 
Packaging, 
JEDEC Pinouts 
- 
Automated 
Algorithms 
via On-Chip 
- 
32-Pin PDIP 
Write State Machine 
(WSM) 
- 
32-Lead 
PLCC, TSOP 
• SRAM-Compatible 
Write Interface 
• ETOX II Nonvolatile 
Flash Technology 
• Deep-Powerdown 
Mode 
- 
EPROM-Compatible 
Process 
Base 


- 
0.05 p,A Ice Typical 
- 
Hlgh-Yolume 
Manufacturing 


- 
0.8 p,A Ipp Typical 
Experience 
• 12.0Y ± 5% Ypp 
• Extended 
Temperature 
Options 


Intel's 28F001BX-B and 28F001BX-T combine the cost-effectiveness of Intel standard flash memory with 
features that simplify write and allow block erase. These devices aid the system designer by combining the 
functions of several components into one, making boot block flash an innovative alternative to EPROM and 
EEPROM or battery-backed static RAM. Many new and existing designs can take advantage of the 
28F001BX's integration of blocked architecture, automated electrical reprogramming, and standard processor 
interface. 


The 28F001BX-B and 28F001BX-T are 1,048,576 bit nonvolatile memories organized as 131,072 bytes of 
8 bits. They are offered in 32-pin plastic DIP, 32-lead PLCC and 32-lead TSOP packages. Pin assignment 
conform to JEDEC standards for byte-wide EPROMs. These devices use an integrated command port and 
state machine for simplified block erasure and byte reprogramming. The 28F001BX-T's block locations pro- 
vide compatibility with microprocessors and microcontrollers that boot from high memory, such as Intel's 
MCS-186 family, 80286, i386™, i486™, i860TMand 80960CA. With exactly the same memory segmentation, 
the 28F001BX-B memory map is tailored for microprocessors and microcontrollers that boot from low memory, 
such as Intel's MCS-51, MCS-196, 80960KX and 80960SX families. All other features are identical, and unless 
otherwise noted, the term 28F001BX can refer to either device throughout the remainder of this document. 


The boot block section includes a reprogramming write lock out feature to guarantee data integrity. It is 
designed to contain secure code which will bring up the system minimally and download code to the other 
locations of the 28F001BX. Intel's 28F001BX employs advanced CMOS circuitry for systems requiring high- 
performance access speeds, low power consumption, and immunity to noise. Its 120 ns access time provides 
no-WAlT-state performance for a wide range of microprocessors and microcontrollers. A deep-powerdown 
mode lowers power consumption to 0.25 p.Wtypical through Vcc, crucial in laptop computer, handheld instru- 
mentation and other low-power applications. The RP# power control input also provides absolute data protec- 
tion during system powerup or power loss. 


Manufactured on Intel's ETOX process base, the 28F001BX builds on years of EPROM experience to yield the 
highest levels of quality, reliability, and cost-effectiveness. 


Refer to the 1994 Memory Products Handbook for the complete data sheet on this device. 


October 
1993 


Order 
Number: 
290406·005 


28F200BX· T/B, 28F002BX· T/B 
2·MBIT (128K x 16, 256K x 8) BOOT BLOCK 
FLASH MEMORY FAMILY 


• 
xS/x16 Input/Output 
Architecture 
- 
2SF200BX-T, 2SF200BX-B 
- 
For High Performance and High 
Integration 
16-bit and 32·bit CPUs 


• 
xS·only Input/Output 
Architecture 
- 
2SF002BX-T 2SF002BX-B 
- 
For Space Constrained 
S·bit 
Applications 


• 
Optimized High Density Blocked 
Architecture 
- 
One 16·KB Protected 
Boot Block 
- 
Two S-KB Parameter Blocks 
- 
One 96·KB Main Block 
- 
One 12S KB Main Block 
- 
Top or Bottom Boot Locations 


• 
Extended Cycling Capability 
- 
100,000 Block Erase Cycles 


• 
Automated 
Word/Byte 
Write and 
Block Erase 
- 
Command User Interface 
- 
Status Registers 
- 
Erase Suspend Capability 


• 
SRAM-Compatible Write Interface 


• 
Automatic 
Power SaVings Feature 
- 
1 mA Typical lee Active Current in 
Static Operation 


• 
Hardware Data Protection 
Feature 
- 
Erase/Write 
LOCkout during Power 
Transitions 


• 
Very High-Performance 
Read 
- 
60/S0 ns Maximum Access Time 
- 
30/40 ns Maximum Output Enable 
Time 


• 
Low Power Consumption 
- 
20 mA Typical xS Active Read 
Current 
-25 
mA Typical x16 Active Read 
Current 


• 
Reset/Deep 
Power-Down Input 
- 
0.2 ""A lee Typical 
- 
Acts as Reset for Boot Operations 


• 
Extended Temperature 
Operation 
- 
- 40·C to + S5·C 


• 
Write Protection 
for Boot Block 


• 
Industry Standard Surface Mount 
Packaging 
- 
2SF200BX: JEDEC ROM Compatible 
44-Lead PSOP 
56-Lead TSOP 
- 
2SF002BX: 40-Lead TSOP 


• 
12V Word/Byte 
Write and Block Erase 
- 
Vpp = 12V ± 5% Standard 
- 
Vpp = 12V ± 10% Option 


• 
ETOX III Flash Technology 
-5V 
Read 


• 
Independent 
Software Vendor Support 


28F200BX- TL/BL, 28F002BX· TL/BL 
2-MBIT (128K x 16, 256K x 8) LOW POWER BOOT BLOCK 
FLASH MEMORY FAMILY 


• 
Low Voltage Operation for Very Low 
Power Portable Applications 
-Vee 
= 3.3V ±0.3V 


• 
x8/x16 Input/Output Architecture 
- 
28F200BX·TL, 28F200BX·BL 
- 
For High Performance and High 
Integration 16·bit and 32·bit CPUs 


• 
x8·only Input/Output Architecture 
- 
28F002BX·TL, 28F002BX·BL 
- 
For Space Constrained 8·bit 
Applications 


• 
Optimized High Density Blocked 
Architecture 
- 
One 16·KB Protected Boot Block 
- 
Two 8·KB Parameter Blocks 
- 
One 96·KB Main Block 
- 
One 128·KB Main Block 
- 
Top or Bottom Boot Locations 


• 
Extended Cycling Capability 
- 
10,000Block Erase Cycles 


• 
Automated Word/Byte Write and Block 
Erase 
- 
Command User Interface 
- 
Status Registers 
- 
Erase Suspend Capability 


• 
SRAM·Compatible Write Interface 


• 
Automatic Power Savings Feature 
- 
0.8 mA Typical lee Active Current in 
Static Operation 


• 
Very High·Performance Read 
-150 
ns Maximum Access Time 
- 
65 ns Maximum Output Enable Time 


• 
Low Power Consumption 
-15 
mA Typical Active Read Current 


• 
Reset/Deep Power·Down Input 
- 
0.2 IJ-Alee Typical 
- 
Acts as Reset for Boot Operations 


• 
Write Protection for Boot Block 
• 
Hardware Data Protection Feature 
- 
Erase/Write Lockout during Power 
Transitions 


• 
Industry Standard Surface Mount 
Packaging 
- 
28F200BX·L:JEDEC ROM Compatible 
44-Lead PSOP 
56-Lead TSOP 
- 
28F002BX-L: 40-Lead TSOP 


• 
12V Word/Byte Write and Block Erase 
- 
Vpp = 12V ± 5% Standard 


• 
ETOX III Flash Technology 
-3.3V 
Read 


• 
Independent Software Vendor Support 


Refer to the 1994 Memory Products Handbook for the complete data sheet on this device. 
October 1993 
Order Number: 290449-003 


• 


28F400BX- T/B, 28F004BX- T/B 
4 MBIT (256K x16, 512K x8) BOOT BLOCK FLASH 
MEMORY FAMILY 


• 
x8/x16 Input/Output 
Architecture 
- 
28F400BX-T, 28F400BX-B 
- 
For High Performance 
and High 
Integration 
16-bit and 32-bit CPUs 


• 
x8-only Input/Output 
Architecture 
- 
28F004BX-T, 28F004BX-B 
- 
For Space Constrained 
8-bit 
Applications 


• 
Optimized High Density Blocked 
Architecture 
- 
One 16-KB Protected 
Boot Block 
- 
Two 8-KB Parameter Blocks 
- 
One 96-KB Main Block 
- 
Three 128-KB Main Blocks 
- 
Top or Bottom Boot Locations 


• 
Extended Cycling Capability 
- 
100,000 Block Erase Cycles 


• 
Automated 
Word/Byte 
Write and Block 
Erase 
- 
Command User Interface 
- 
Status Registers 
- 
Erase Suspend Capability 


• 
SRAM-Compatible Write Interface 


• 
Automatic 
Power Savings Feature 
- 
1 mA Typical lee Active Current in 
Static Operation 


• 
Very High-Performance 
Read 
- 
60/80 ns Maximum Access Time 
- 
30/40 ns Maximum Output Enable 
Time 


• 
Low Power Consumption 
- 
20 mA Typical x8 Active Read 
Current 
-25 
mA Typical x16 Active Read 
Current 


• 
Reset/Deep 
Power-Down Input 
- 
0.2 JJ-Alee Typical 
- 
Acts as Reset for Boot Operations 


• 
Extended Temperature 
Operation 
- 
- 40°C to + 85°C 


• 
Write Protection 
for Boot Block 


• 
Hardware Data Protection 
Feature 
- 
Erase/Write 
Lockout During Power 
Transitions 


• 
Industry Standard Surface Mount 
Packaging 
- 
28F400BX: JEDEC ROM Compatible 
44-Lead PSOP 
56-Lead TSOP 
- 
28F004BX: 40-Lead TSOP 


• 
12V Word/Byte 
Write and Block Erase 
-Vpp 
= 12V ±5% Standard 
- 
Vpp = 12V ± 10% Option 


• 
ETOX III Flash Technology 
-5V 
Read 


28F008SA 
8-MBIT (1-MBIT x 8) FLASH MEMORY 
• High-Density Symmetrically 
Blocked 
• Very High-Performance 
Read 
Architecture 
- 
85 ns Maximum Access Time 
- 
Sixteen 64-Kbyte Blocks 
• SRAM-Compatlble Write Interface 
• Extended Cycling Capability 
• Hardware Data Protection 
Feature 
- 
100,000 Block Erase Cycles 
- 
Erase/Write 
Lockout during Power 
-1.6 
Million Block Erase 
Transitions 
Cycles per Chip 


Automated 
Byte Write and Block Erase 
• Industry Standard Packaging 
• 
- 
40-Lead TSOP, 44-Lead PSOP 
- 
Command User Interface 
- 
Status Register 
• ETOX III Nonvolatile 
Flash Technology 
• System Performance 
Enhancements 
- 
12V Byte Write/Block 
Erase 


- 
RY/BY # Status Output 
• Independent Software Vendor Support 
- 
Erase Suspend Capability 
- 
MlcrosoW 
Flash File System (FFS) 
• Deep-Powerdown 
Mode 
- 
0.20 /-LAlee Typical 


Intel's 
28F008SA 
8-Mbit 
FlashFile™ 
Memory 
is the highest 
density 
nonvolatile 
read/write 
solution 
for solid 
state storage. 
The 28F008SA's 
extended 
cycling, 
symmetrically 
blocked 
architecture, 
fast access 
time, write 
automation 
and low power 
consumption 
provide 
a more reliable, 
lower power, 
lighter weight 
and higher per- 
formance 
alternative 
to traditional 
rotating 
disk technology. 
The 28F008SA 
brings new capabilities 
to portable 
computing. 
Application 
and operating 
system software 
stored in resident 
flash memory 
arrays provide 
instant- 
on, rapid execute-in-place 
and protection 
from obsolescence 
through 
in-system 
software 
updates. 
Resident 
software 
also extends 
system 
battery 
life and increases 
reliability 
by reducing 
disk drive accesses. 


For high density 
data acquisition 
applications, 
the 28F008SA 
offers a more cost-effective 
and reliable 
alterna- 
• 
tive to SRAM and battery. 
Traditional 
high density 
embedded 
applications, 
such as telecommunications, 
can 
take advantage 
of the 28F008SA's 
nonvolatility, 
blocking 
and minimal 
system 
code requirements 
for flexible 
firmware 
and modular 
software 
designs. 


The 28F008SA 
is offered 
in 40-lead 
TSOP (standard 
and reverse) 
and 44-lead 
PSOP packages. 
Pin assign- 
ments 
simplify 
board 
layout 
when 
integrating 
multiple 
devices 
in a flash 
memory 
array or subsystem. 
This 
device 
uses an integrated 
Command 
User Interface 
and state machine 
for simplified 
block erasure 
and byte 
write. The 28F008SA 
memory 
map consists 
of 16 separately 
erasable 
64-Kbyte 
blocks. 


Intel's 
28F008SA 
employs 
advanced 
CMOS circuitry 
for systems 
requiring 
low power consumption 
and noise 
immunity. 
Its 85 ns access time provides 
superior 
performance 
when compared 
with magnetic 
storage 
media. 


A deep powerdown 
mode lowers power consumption 
to 1 p.W typical thru Vcc, crucial 
in portable 
computing, 
handheld 
instrumentation 
and 
other 
low-power 
applications. 
The 
RP# 
power 
control 
input 
also 
provides 
absolute 
data protection 
during system 
powerup/down. 


Manufactured 
on Intel's 
0.8 micron 
ETOX 
process, 
the 
28F008SA 
provides 
the 
highest 
levels 
of quality, 
reliability 
and cost-effectiveness. 


Refer 
to the 
1994 Memory 
Products 
Handbook 
for the complete 
data sheet on this device. 


October 1993 
Order Number: 290429-004 


28F016SA 
16 MBIT (1 MBIT x 16, 2 MBIT x 8) 
FLASHFILETMMEMORY 


• 
Revolutionary 
Architecture 
- 
Pipelined 
Command 
Execution 
- 
Write During Erase 
- 
Command 
Superset 
of Intel 
28F008SA 


• 
1 mA Typical 
Ice in Static Mode 


• 
1 J-tATypical 
Deep Power-Down 


• 
32 Independently 
Lockable 
Blocks 


• 
State-of-the-Art 
0.6 J-tmETOX IV Flash 
Technology 


Intel's 28F016SA 
16-Mbit FlashFile™ 
Memory is a revolutionary 
architecture 
which enables 
the design of truly 
mobile, 
high performance, 
personal 
computing 
and communication 
products. 
With innovative 
capabilities, 
low 
power 
operation 
and very high read/write 
performance, 
the 28F016SA 
is also the ideal choice 
for designing 
embedded 
mass storage 
flash memory 
systems. 


• 
User-Selectable 
3.3V or 5V Vee 


• 
User-Configurable 
x8 or x16 Operation 


• 
70 ns Maximum 
Access Time 


• 
0.43 MB/sec 
Write Transfer 
Rate 


• 
1 Million Erase Cycles per Block 


• 
56-Lead, 
1.2mm x 14mm x 20mm TSOP 
Package 


The 
28F016SA 
is a very 
high density, 
highest 
performance 
non-volatile 
read/write 
solution 
for solid-state 
storage 
applications. 
Its symmetrically 
blocked 
architecture 
(100% 
compatible 
with 
the 
28F008SA 
8-Mbit 
FlashFile™ 
memory), 
extended 
cycling, 
low power 3.3V operation, 
very fast write and read performance 
and 
selective 
block 
locking 
provide 
a highly flexible 
memory 
component 
suitable 
for high density 
memory 
cards, 


Resident 
Flash Arrays and PCMCIA-ATA 
Flash Drives. The 28F016SA's 
dual read voltage 
enables 
the design 
of memory 
cards which can interchangeably 
be read/written 
in 3.3V and 5.0V systems. 
Its x8/x16 
architecture 
allows the optimization 
of memory to processor 
interface. 
The flexible block locking option enables 
bundling 
of 
executable 
application 
software 
in a Resident 
Flash Array or memory 
card. Manufactured 
on Intel's 
0.6 J.Lm 
ETOXIV 
process 
technology, 
the 28F016SA 
is the most cost-effective, 
high-density 
3.3V flashfile 
memory. 


